On the Compositional Generalization of Multimodal LLMs for Medical Imaging Paper • 2412.20070 • Published 7 days ago • 39
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published 10 days ago • 82
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey Paper • 2412.18619 • Published 19 days ago • 44
A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression Paper • 2412.17483 • Published 12 days ago • 29
MMFactory: A Universal Solution Search Engine for Vision-Language Tasks Paper • 2412.18072 • Published 11 days ago • 14
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought Paper • 2412.17498 • Published 12 days ago • 21
Revisiting In-Context Learning with Long Context Language Models Paper • 2412.16926 • Published 13 days ago • 27
Diving into Self-Evolving Training for Multimodal Reasoning Paper • 2412.17451 • Published 12 days ago • 40
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners Paper • 2412.17256 • Published 12 days ago • 42
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response Paper • 2412.14922 • Published 16 days ago • 82
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing Paper • 2412.14711 • Published 16 days ago • 14
Data Laundering: Artificially Boosting Benchmark Results through Knowledge Distillation Paper • 2412.15255 • Published 19 days ago • 3
In Case You Missed It: ARC 'Challenge' Is Not That Challenging Paper • 2412.17758 • Published 11 days ago • 15
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization Paper • 2412.17739 • Published 11 days ago • 37
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding Paper • 2412.18450 • Published 11 days ago • 32
Generative Agents: Interactive Simulacra of Human Behavior Paper • 2304.03442 • Published Apr 7, 2023 • 12