arxiv:2412.09604
Zhaokai Wang
wzk1015
AI & ML interests
Computer Vision
Music Generation
Multimodal Large Language Models
Recent Activity
liked
a model
8 days ago
favor123/llava-hr-7b-sft-1024
commented
a paper
18 days ago
SynerGen-VL: Towards Synergistic Image Understanding and Generation with
Vision Experts and Token Folding
Organizations
Papers
10
models
None public yet
datasets
None public yet