11 15 12

Xiangtai Li

LXT

https://lxtgh.github.io/

AI & ML interests

Computer Vision, Multi-Modal Understanding, Generative AI

Recent Activity

liked a dataset 18 days ago

zhangtao-whu/OMG-LLaVA

upvoted a paper 23 days ago

Multimodal Latent Language Modeling with Next-Token Diffusion

commented a paper 25 days ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

View all activity

Organizations

LXT's activity

liked a dataset 18 days ago

zhangtao-whu/OMG-LLaVA

Updated Jul 3, 2024 • 740 • 3

upvoted a paper 23 days ago

Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published 25 days ago • 41

commented a paper 25 days ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published 27 days ago • 46 •

authored 2 papers 26 days ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published 27 days ago • 46

EMOv2: Pushing 5M Vision Model Frontier

Paper • 2412.06674 • Published 28 days ago • 13

updated a collection 26 days ago

Research Paper

Collection

Research Papers from Researcher/Member of MeissonFlow. • 1 item • Updated 26 days ago

liked a dataset 26 days ago

jianzongwu/MangaZero

Viewer • Updated 26 days ago • 32.7k • 169 • 20

upvoted 2 papers 26 days ago

EMOv2: Pushing 5M Vision Model Frontier

Paper • 2412.06674 • Published 28 days ago • 13

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published 27 days ago • 46

commented 2 papers 26 days ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published 27 days ago • 46 •

EMOv2: Pushing 5M Vision Model Frontier

Paper • 2412.06674 • Published 28 days ago • 13 •

upvoted a paper 27 days ago

HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing

Paper • 2412.04280 • Published Dec 5, 2024 • 13

authored a paper about 1 month ago

HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing

Paper • 2412.04280 • Published Dec 5, 2024 • 13

liked a model 2 months ago

Collov-Labs/Monetico

Text-to-Image • Updated Oct 28, 2024 • 28 • 65

upvoted 2 papers 3 months ago

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Paper • 2410.13848 • Published Oct 17, 2024 • 32

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 90

liked a Space 3 months ago

Running on Zero

🚀

Meissonic Flow

authored 3 papers 3 months ago