Yongming Rao's picture

6 2

Yongming Rao

raoyongming

·

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

upvoted a collection about 2 months ago

upvoted a paper about 2 months ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

View all activity

Organizations

None yet

raoyongming's activity

upvoted a collection about 2 months ago

Insight-V

Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models • 5 items • Updated Nov 22, 2024 • 9

upvoted a paper about 2 months ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published Nov 21, 2024 • 22

upvoted a paper 3 months ago

MaskBit: Embedding-free Image Generation via Bit Tokens

Paper • 2409.16211 • Published Sep 24, 2024 • 17

upvoted a paper 4 months ago

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Paper • 2409.12961 • Published Sep 19, 2024 • 25

upvoted a paper 5 months ago

Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model

Paper • 2408.00754 • Published Aug 1, 2024 • 22

upvoted a paper 6 months ago

Efficient Inference of Vision Instruction-Following Models with Elastic Cache

Paper • 2407.18121 • Published Jul 25, 2024 • 17