view article Article ๐งโโ๏ธ "Replacing Judges with Juries" using distilabel By alvarobartt โข May 3, 2024 โข 17
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper โข 2404.18796 โข Published Apr 29, 2024 โข 68
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper โข 2405.01535 โข Published May 2, 2024 โข 119
Open LLM Leaderboard best models โค๏ธโ๐ฅ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: โข 64 items โข Updated about 1 hour ago โข 497