Wen Liu
doubility123
AI & ML interests
Generative AI, Large Multi-Modality Models, 2D/3D Generation
Recent Activity
authored
a paper
4 days ago
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding
and Generation
authored
a paper
4 days ago
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified
Multimodal Understanding and Generation
authored
a paper
4 days ago
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced
Multimodal Understanding
Organizations
doubility123's activity
Update README.md
#1 opened 22 days ago
by
reach-vb
Update README.md
#1 opened 22 days ago
by
reach-vb
Update README.md
#1 opened 22 days ago
by
reach-vb
Model is not inferencing on multiple images; is this the right template?
4
#4 opened 10 months ago
by
ltbd78
Rename README.md to 总感觉有点道理
#5 opened 10 months ago
by
poiuy741741
Rename README.md to 我们地球上有汽车。老虎.md
#6 opened 10 months ago
by
poiuy741741
Rename README.md to 我们地球上有汽车。老虎.md
#7 opened 10 months ago
by
poiuy741741
Add tag for VLM
#2 opened 10 months ago
by
osanseviero
Add tag for VLM
#2 opened 10 months ago
by
osanseviero
data preprocessing tools
1
#2 opened almost 2 years ago
by
doubility123
data preprocessing tools
1
#2 opened almost 2 years ago
by
doubility123
data preprocessing tools
1
#2 opened almost 2 years ago
by
doubility123