Lauromon
's Collections
Time to read
updated
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep
Thinking
Paper
•
2501.04519
•
Published
•
230
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta
Chain-of-Though
Paper
•
2501.04682
•
Published
•
83
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level
Mathematical Reasoning
Paper
•
2410.02884
•
Published
•
53
Think Before You Speak: Cultivating Communication Skills of Large
Language Models via Inner Monologue
Paper
•
2311.07445
•
Published
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers
Paper
•
2408.06195
•
Published
•
68
Self-Reflection in LLM Agents: Effects on Problem-Solving Performance
Paper
•
2405.06682
•
Published
•
3
Scaling LLM Test-Time Compute Optimally can be More Effective than
Scaling Model Parameters
Paper
•
2408.03314
•
Published
•
54
Training Large Language Models to Reason in a Continuous Latent Space
Paper
•
2412.06769
•
Published
•
74
Stream of Search (SoS): Learning to Search in Language
Paper
•
2404.03683
•
Published
•
30
Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents
Paper
•
2408.07199
•
Published
•
21
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
Paper
•
2312.10003
•
Published
•
37
Agent Laboratory: Using LLM Agents as Research Assistants
Paper
•
2501.04227
•
Published
•
77
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation
Framework
Paper
•
2308.08155
•
Published
•
5
GAIA: a benchmark for General AI Assistants
Paper
•
2311.12983
•
Published
•
187
More Agents Is All You Need
Paper
•
2402.05120
•
Published
•
52