GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published 15 days ago • 60
VideoWorld: Exploring Knowledge Learning from Unlabeled Videos Paper • 2501.09781 • Published 13 days ago • 24
SEAL: Entangled White-box Watermarks on Low-Rank Adaptation Paper • 2501.09284 • Published 14 days ago • 10
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 7 days ago • 261
Hallucinations Can Improve Large Language Models in Drug Discovery Paper • 2501.13824 • Published 6 days ago • 7
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback Paper • 2501.10799 • Published 11 days ago • 14
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published 5 days ago • 29
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models Paper • 2412.18609 • Published Dec 24, 2024 • 16
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization Paper • 2412.18525 • Published Dec 24, 2024 • 72
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models Paper • 2501.01423 • Published 27 days ago • 36
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution Paper • 2501.02976 • Published 23 days ago • 52
Generalizable Origin Identification for Text-Guided Image-to-Image Diffusion Models Paper • 2501.02376 • Published 25 days ago • 3
MoDec-GS: Global-to-Local Motion Decomposition and Temporal Interval Adjustment for Compact Dynamic 3D Gaussian Splatting Paper • 2501.03714 • Published 22 days ago • 9
DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization Paper • 2501.03271 • Published 25 days ago • 11
Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation Paper • 2501.04144 • Published 22 days ago • 18
Building Foundations for Natural Language Processing of Historical Turkish: Resources and Models Paper • 2501.04828 • Published 21 days ago • 11
Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models Paper • 2501.05767 • Published 20 days ago • 28
Evaluating Sample Utility for Data Selection by Mimicking Model Weights Paper • 2501.06708 • Published 18 days ago • 5