Jiasenlu's picture

2 2 2

Jiasenlu

Jiasenlu

·

https://jiasenlu.github.io/

AI & ML interests

Vision and Language

Recent Activity

liked a model 25 days ago

lehduong/OneDiffusion

authored a paper 26 days ago

STIV: Scalable Text and Image Conditioned Video Generation

authored a paper about 1 month ago

One Diffusion to Generate Them All

View all activity

Organizations

None yet

Jiasenlu's activity

liked a model 25 days ago

lehduong/OneDiffusion

Updated 21 days ago • 33 • 40

authored a paper 26 days ago

STIV: Scalable Text and Image Conditioned Video Generation

Paper • 2412.07730 • Published 27 days ago • 70

authored a paper about 1 month ago

One Diffusion to Generate Them All

Paper • 2411.16318 • Published Nov 25, 2024 • 26

upvoted a paper about 1 month ago

One Diffusion to Generate Them All

Paper • 2411.16318 • Published Nov 25, 2024 • 26

commented a paper about 1 month ago

One Diffusion to Generate Them All

Paper • 2411.16318 • Published Nov 25, 2024 • 26 •

authored 2 papers 3 months ago

MM-Ego: Towards Building Egocentric Multimodal LLMs

Paper • 2410.07177 • Published Oct 9, 2024 • 21

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 106

upvoted a paper about 1 year ago

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Paper • 2312.17172 • Published Dec 28, 2023 • 27

liked a Space over 2 years ago

Unicl Zero-Shot Image Recognition Demo