MiniMax-01: Scaling Foundation Models with Lightning Attention Paper โข 2501.08313 โข Published 8 days ago โข 267
YuLan-Mini: An Open Data-efficient Language Model Paper โข 2412.17743 โข Published 30 days ago โข 64
Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced Prompting Paper โข 2412.00869 โข Published Dec 1, 2024 โข 4
A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models Paper โข 2411.19477 โข Published Nov 29, 2024 โข 6