Xiaolong Li

littledragon

https://dragonlong.github.io

dragonlong

AI & ML interests

3d deep learning

Recent Activity

upvoted a paper 2 days ago

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

upvoted a collection 30 days ago

Eagle 2

upvoted a collection 3 months ago

MIT Talk 31/10 Papers

View all activity

Organizations

None yet

littledragon's activity

upvoted a paper 2 days ago

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Paper • 2501.09755 • Published 5 days ago • 30

upvoted a collection 30 days ago

Eagle 2

Collection

Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. • 8 items • Updated about 8 hours ago • 10

upvoted a collection 3 months ago

MIT Talk 31/10 Papers

Collection

14 items • Updated Oct 28, 2024 • 31

liked a dataset 3 months ago

sled-umich/3D-GRAND

Updated 14 days ago • 77 • 7

liked a model 3 months ago

robotics-diffusion-transformer/rdt-1b

Robotics • Updated Oct 17, 2024 • 3.19k • 67

liked a Space 3 months ago

Running

🥇

MEGA-Bench

A leaderboard for multimodal models

liked a model 3 months ago

rhymes-ai/Aria

Image-Text-to-Text • Updated 1 day ago • 27.5k • 608

liked a model 4 months ago

NVEagle/Eagle-X5-13B-Chat

Image-Text-to-Text • Updated Sep 16, 2024 • 422 • 28

liked a dataset 4 months ago

MSheng-Lee/M3DBench

Preview • Updated Oct 1, 2024 • 59 • 2

liked a dataset 5 months ago

shi-labs/Eagle-1.8M

Updated Aug 29, 2024 • 147 • 7

liked a model 5 months ago

alvdansen/flux-koda

Text-to-Image • Updated Aug 16, 2024 • 13.4k • • 222

upvoted a paper 5 months ago

Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches

Paper • 2408.04567 • Published Aug 8, 2024 • 25

upvoted a paper 6 months ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 68

upvoted a paper 7 months ago

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

Paper • 2406.16860 • Published Jun 24, 2024 • 60

upvoted a paper 8 months ago

WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space

Paper • 2311.13570 • Published Nov 22, 2023 • 3

liked a model 11 months ago

bigcode/starcoder2-15b

Text Generation • Updated Jun 5, 2024 • 29.4k • • 580

liked a dataset about 1 year ago

lyx97/FETV

Viewer • Updated Jun 15, 2023 • 619 • 65 • 6

liked a Space about 1 year ago

Build error

283

🌒

Zero123++ Demo Space

upvoted 2 papers over 1 year ago

Video Language Planning

Paper • 2310.10625 • Published Oct 16, 2023 • 9

Exploiting Diffusion Prior for Real-World Image Super-Resolution

Paper • 2305.07015 • Published May 11, 2023 • 4