arxiv:2412.10302
Wen Liu
doubility123
AI & ML interests
Generative AI, Large Multi-Modality Models, 2D/3D Generation
Recent Activity
authored
a paper
1 day ago
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding
and Generation
authored
a paper
1 day ago
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified
Multimodal Understanding and Generation
authored
a paper
1 day ago
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced
Multimodal Understanding
Organizations
Papers
10
models
None public yet
datasets
None public yet