Establishing Task Scaling Laws via Compute-Efficient Model Ladders Paper โข 2412.04403 โข Published 27 days ago โข 2
Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens Paper โข 2401.17377 โข Published Jan 30, 2024 โข 35