The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published 8 days ago • 83
Reasoning Datasets Collection Reasoning datasets that are trending 🔥 • 10 items • Updated 18 days ago • 24
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper • 2411.14405 • Published Nov 21, 2024 • 58
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published Nov 19, 2024 • 48
SelfCodeAlign: Self-Alignment for Code Generation Paper • 2410.24198 • Published Oct 31, 2024 • 23