arxiv:2412.05271
Zhaoyang Liu
zyliu
AI & ML interests
Video understanding, 3D Perception, Autonomous driving, Foundation models, AIGC
Recent Activity
authored
a paper
19 days ago
InternChat: Solving Vision-Centric Tasks by Interacting with Chatbots
Beyond Language
authored
a paper
19 days ago
Learning Human Motion Representations: A Unified Perspective
authored
a paper
19 days ago
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model
for Hundreds of Vision-Language Tasks
Organizations
spaces
2
models
11
zyliu/tmp_model11
Updated
•
5
zyliu/tmp_model10
Updated
•
2
zyliu/tmp_model9
Updated
•
3
zyliu/vllm3_tmp1
Updated
•
3
zyliu/tmp_model8
Updated
•
2
zyliu/tmp_model7
Updated
•
3
zyliu/tmp_model6
Updated
•
5
zyliu/tmp_model5
Updated
•
2
zyliu/tmp_model4
Updated
•
14
zyliu/tmp_gen_edit_model
Updated
•
17
datasets
None public yet