arxiv:2412.19806
Shengqiong Wu
ChocoWu
AI & ML interests
Large Language Model, Multimodal learning, Scene graph Generation
Recent Activity
authored
a paper
4 days ago
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating,
Segmenting, Editing
new activity
15 days ago
Bin1117/AnyEdit:wrong format of data
Organizations
Papers
3
datasets
None public yet