arxiv:2412.02611
Jiaming Han
csuhan
AI & ML interests
Computer Vision
Recent Activity
upvoted
a
paper
about 1 month ago
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand
Audio-Visual Information?
authored
a paper
about 1 month ago
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand
Audio-Visual Information?
Organizations
models
12
csuhan/temp
Updated
csuhan/t2i
Updated
csuhan/LLaVA_EF
Updated
csuhan/OneLLM-7B-x-text
Updated
csuhan/OneLLM-7B-image-text
Updated
csuhan/OneLLM-7B
Updated
โข
4
csuhan/OneLLM-7B-hf-v1.1
Updated
โข
2
csuhan/OneLLM-7B-hf
Updated
csuhan/OneLLM-7B-backup
Updated
โข
3
csuhan/blip2_opt2.7b
Feature Extraction
โข
Updated
โข
9