MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents Paper • 2501.08828 • Published 2 days ago • 24
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 3 days ago • 37
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning Paper • 2501.06458 • Published 7 days ago • 29
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Paper • 2501.06186 • Published 7 days ago • 54
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 3 days ago • 254
Multi-task retriever fine-tuning for domain-specific and efficient RAG Paper • 2501.04652 • Published 9 days ago • 10
Search-o1: Agentic Search-Enhanced Large Reasoning Models Paper • 2501.05366 • Published 8 days ago • 75
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 10 days ago • 77
LLM4SR: A Survey on Large Language Models for Scientific Research Paper • 2501.04306 • Published 10 days ago • 33
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published 9 days ago • 83
Cosmos World Foundation Model Platform for Physical AI Paper • 2501.03575 • Published 11 days ago • 63
Test-time Computing: from System-1 Thinking to System-2 Thinking Paper • 2501.02497 • Published 13 days ago • 40
Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding Paper • 2411.18462 • Published Nov 27, 2024 • 6
Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens Paper • 2411.17691 • Published Nov 26, 2024 • 11
Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published Nov 26, 2024 • 49