JudgeLM: Fine-tuned Large Language Models are Scalable Judges Paper โข 2310.17631 โข Published Oct 26, 2023 โข 34
Running on CPU Upgrade 12.3k ๐ Open LLM Leaderboard Track, rank and evaluate open LLMs and chatbots