Latxa: An Open Language Model and Evaluation Suite for Basque Paper • 2403.20266 • Published Mar 29, 2024 • 3
TrustLLM: Trustworthiness in Large Language Models Paper • 2401.05561 • Published Jan 10, 2024 • 66
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published May 2, 2024 • 119
Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory Paper • 2405.08707 • Published May 14, 2024 • 27
tinyBenchmarks: evaluating LLMs with fewer examples Paper • 2402.14992 • Published Feb 22, 2024 • 11