HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published 25 days ago • 94
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 5 days ago • 259
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning Paper • 2501.06458 • Published 8 days ago • 29
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published 11 days ago • 83
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 12 days ago • 77
Virgo: A Preliminary Exploration on Reproducing o1-like MLLM Paper • 2501.01904 • Published 16 days ago • 31
Test-time Computing: from System-1 Thinking to System-2 Thinking Paper • 2501.02497 • Published 14 days ago • 40
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published 6 days ago • 80