Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models Paper • 2501.01423 • Published 4 days ago • 31
MLLM-as-a-Judge for Image Safety without Human Labeling Paper • 2501.00192 • Published 6 days ago • 22
ProgCo: Program Helps Self-Correction of Large Language Models Paper • 2501.01264 • Published 4 days ago • 23
Slow Perception: Let's Perceive Geometric Figures Step-by-step Paper • 2412.20631 • Published 7 days ago • 12
PERSE: Personalized 3D Generative Avatars from A Single Portrait Paper • 2412.21206 • Published 7 days ago • 14
MMFactory: A Universal Solution Search Engine for Vision-Language Tasks Paper • 2412.18072 • Published 13 days ago • 14
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning Paper • 2412.15797 • Published 17 days ago • 16
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search Paper • 2412.18319 • Published 13 days ago • 34
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding Paper • 2412.18450 • Published 13 days ago • 32
Large Motion Video Autoencoding with Cross-modal Video VAE Paper • 2412.17805 • Published 14 days ago • 23
NILE: Internal Consistency Alignment in Large Language Models Paper • 2412.16686 • Published 16 days ago • 8
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching Paper • 2412.17153 • Published 14 days ago • 33
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners Paper • 2412.17256 • Published 14 days ago • 44
Diving into Self-Evolving Training for Multimodal Reasoning Paper • 2412.17451 • Published 14 days ago • 41
Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published 17 days ago • 36
Flowing from Words to Pixels: A Framework for Cross-Modality Evolution Paper • 2412.15213 • Published 18 days ago • 25
Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion Paper • 2412.14462 • Published 18 days ago • 15