Collections
Discover the best community collections!
Collections including paper arxiv:2411.11844
-
Video Creation by Demonstration
Paper • 2412.09551 • Published • 9 -
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Paper • 2412.07589 • Published • 45 -
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation
Paper • 2412.06531 • Published • 71 -
APOLLO: SGD-like Memory, AdamW-level Performance
Paper • 2412.05270 • Published • 38
-
Generative World Explorer
Paper • 2411.11844 • Published • 75 -
Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation
Paper • 2501.04144 • Published • 18 -
SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images
Paper • 2501.04689 • Published • 17 -
SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration
Paper • 2501.01320 • Published • 11
-
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems
Paper • 2411.02959 • Published • 66 -
GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details
Paper • 2411.03047 • Published • 8 -
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
Paper • 2411.02336 • Published • 23 -
GenXD: Generating Any 3D and 4D Scenes
Paper • 2411.02319 • Published • 20
-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 65 -
Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
Paper • 2411.06559 • Published • 13 -
Generative World Explorer
Paper • 2411.11844 • Published • 75 -
GUI Agents: A Survey
Paper • 2412.13501 • Published • 25
-
StdGEN: Semantic-Decomposed 3D Character Generation from Single Images
Paper • 2411.05738 • Published • 14 -
A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents
Paper • 2410.22476 • Published • 25 -
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Paper • 2410.23218 • Published • 46 -
Training-free Regional Prompting for Diffusion Transformers
Paper • 2411.02395 • Published • 25