Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models Paper • 2310.14491 • Published Oct 23, 2023
A Careful Examination of Large Language Model Performance on Grade School Arithmetic Paper • 2405.00332 • Published May 1, 2024 • 30
Continued Pretraining for Better Zero- and Few-Shot Promptability Paper • 2210.10258 • Published Oct 19, 2022