5 11 32

Jiaming Han

csuhan

https://csuhan.com

csuhan

AI & ML interests

Computer Vision

Recent Activity

upvoted a paper 7 days ago

Diffusion Adversarial Post-Training for One-Step Video Generation

upvoted a paper 8 days ago

VideoAuteur: Towards Long Narrative Video Generation

upvoted a paper 27 days ago

DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation

View all activity

Organizations

csuhan's activity

upvoted a paper 7 days ago

Diffusion Adversarial Post-Training for One-Step Video Generation

Paper • 2501.08316 • Published 8 days ago • 30

upvoted a paper 8 days ago

VideoAuteur: Towards Long Narrative Video Generation

Paper • 2501.06173 • Published 12 days ago • 31

upvoted a paper 27 days ago

DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation

Paper • 2412.18597 • Published 29 days ago • 19

upvoted a paper about 2 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 23

authored a paper about 2 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 23

upvoted 3 papers 3 months ago

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 53

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published Oct 21, 2024 • 66

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 80

authored a paper 3 months ago

Remember, Retrieve and Generate: Understanding Infinite Visual Concepts as Your Personalized Assistant

Paper • 2410.13360 • Published Oct 17, 2024 • 8

upvoted a paper 3 months ago

Remember, Retrieve and Generate: Understanding Infinite Visual Concepts as Your Personalized Assistant

Paper • 2410.13360 • Published Oct 17, 2024 • 8

updated 2 models 4 months ago

csuhan/temp

Updated Oct 9, 2024

csuhan/t2i

Updated Sep 27, 2024

updated a model 5 months ago

csuhan/LLaVA_EF

Updated Aug 14, 2024

liked 2 models 7 months ago

stabilityai/stable-diffusion-2-1-unclip

Text-to-Image • Updated Apr 12, 2023 • 14.8k • 279

Intel/llava-gemma-2b

Image-Text-to-Text • Updated Jun 11, 2024 • 4.37k • 43

liked a dataset 7 months ago

UCSC-VLAA/Recap-DataComp-1B

Viewer • Updated 13 days ago • 1.88B • 2.12k • 162

updated a model 7 months ago

csuhan/OneLLM-7B-x-text

Updated Jun 27, 2024

liked a dataset 8 months ago

HuggingFaceM4/the_cauldron

Viewer • Updated May 6, 2024 • 1.88M • 48.9k • 357

updated a model 10 months ago

csuhan/OneLLM-7B-image-text

Updated Mar 21, 2024

liked a dataset 10 months ago

fnlp/AnyInstruct

Updated Jul 30, 2024 • 60 • 40