Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models Paper • 2407.07086 • Published Jul 9, 2024
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published 11 days ago • 83
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published 11 days ago • 83
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published 11 days ago • 83
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published 11 days ago • 83
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published 11 days ago • 83
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models Paper • 2412.02980 • Published Dec 4, 2024 • 12
PERSONA Collection Collection of various datasets related to the PERSONA paper. • 5 items • Updated Oct 28, 2024 • 2
Adaptive Inference-Time Compute: LLMs Can Predict if They Can Do Better, Even Mid-Generation Paper • 2410.02725 • Published Oct 3, 2024 • 1
Open X-Embodiment: Robotic Learning Datasets and RT-X Models Paper • 2310.08864 • Published Oct 13, 2023 • 2
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control Paper • 2307.15818 • Published Jul 28, 2023 • 29
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning Paper • 2408.08441 • Published Aug 15, 2024 • 8
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data Paper • 2404.14367 • Published Apr 22, 2024 • 1
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models Paper • 2311.18232 • Published Nov 30, 2023 • 1