Rongyao Fang's picture

1 4

Rongyao Fang

LucasFang

·

https://rongyaofang.github.io/

rongyaofang

AI & ML interests

Multimodal Large Language Model targeting AGI

Recent Activity

upvoted a paper 26 days ago

StreamChat: Chatting with Streaming Video

authored a paper 26 days ago

StreamChat: Chatting with Streaming Video

upvoted a paper about 2 months ago

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

View all activity

Organizations

None yet

LucasFang's activity

upvoted a paper 26 days ago

StreamChat: Chatting with Streaming Video

Paper • 2412.08646 • Published 27 days ago • 17

authored a paper 26 days ago

StreamChat: Chatting with Streaming Video

Paper • 2412.08646 • Published 27 days ago • 17

upvoted a paper about 2 months ago

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Paper • 2411.10640 • Published Nov 16, 2024 • 44

authored 3 papers 3 months ago

Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking

Paper • 2303.05475 • Published Mar 9, 2023

RBGNet: Ray-based Grouping for 3D Object Detection

Paper • 2204.02251 • Published Apr 5, 2022

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 53

upvoted a paper 3 months ago

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 53

commented a paper 3 months ago

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 53 •

authored a paper 10 months ago

FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis

Paper • 2403.12963 • Published Mar 19, 2024 • 7

upvoted a paper 11 months ago

Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling

Paper • 2401.15977 • Published Jan 29, 2024 • 37