Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion Paper • 2501.09019 • Published 6 days ago • 11
XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework Paper • 2501.08809 • Published 6 days ago • 9
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them Paper • 2501.08292 • Published 7 days ago • 16
3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering Paper • 2501.05131 • Published 12 days ago • 33
MangaNinja: Line Art Colorization with Precise Reference Following Paper • 2501.08332 • Published 7 days ago • 53
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 7 days ago • 263
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published 12 days ago • 80
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers Paper • 2501.02393 • Published 16 days ago • 8
Cosmos World Foundation Model Platform for Physical AI Paper • 2501.03575 • Published 14 days ago • 64
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper • 2501.03262 • Published 17 days ago • 83
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published 27 days ago • 95
MotiF: Making Text Count in Image Animation with Motion Focal Loss Paper • 2412.16153 • Published Dec 20, 2024 • 6
In Case You Missed It: ARC 'Challenge' Is Not That Challenging Paper • 2412.17758 • Published 29 days ago • 16
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Paper • 2412.18597 • Published 28 days ago • 19
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization Paper • 2412.17739 • Published 29 days ago • 39
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding Paper • 2412.18450 • Published 28 days ago • 32