Spaces:

Open-Style
/

OSQ-Leaderboard

Running

SondosMB commited on Jun 1, 2024

Commit

bc21111

verified ·

1 Parent(s): 86d655e

Update constants.py

Files changed (1) hide show

constants.py CHANGED Viewed

@@ -10,7 +10,7 @@ INTRODUCTION_TEXT= """
 # OS Benchmark (Evaluating  LLMs with OS and MCQ)
 🔗 [Website](https://github.com/VILA-Lab/MBZUAI-LLM-Leaderboard) | 💻 [GitHub](https://github.com/VILA-Lab/MBZUAI-LLM-Leaderboard) | 📖 [Paper](#) | 🐦 [Tweet 1](#) | 🐦 [Tweet 2](#)
-> ### MBZUAI-LLM-Leaderboard, a new framework for evaluating large language models (LLMs) by transitioning from multiple-choice questions (MCQs) to open-style questions.
 This approach addresses the inherent biases and limitations of MCQs, such as selection bias and the effect of random guessing. By utilizing open-style questions,
 the framework aims to provide a more accurate assessment of LLMs' abilities across various benchmarks and ensure that the evaluation reflects true capabilities,
 particularly in terms of language understanding and reasoning.
@@ -18,7 +18,7 @@ particularly in terms of language understanding and reasoning.
 """
 CITATION_TEXT = """@artical{..,
-      title={MBZUAI-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena},
       author={},
       year={2024},
       archivePrefix={arXiv}

 # OS Benchmark (Evaluating  LLMs with OS and MCQ)
 🔗 [Website](https://github.com/VILA-Lab/MBZUAI-LLM-Leaderboard) | 💻 [GitHub](https://github.com/VILA-Lab/MBZUAI-LLM-Leaderboard) | 📖 [Paper](#) | 🐦 [Tweet 1](#) | 🐦 [Tweet 2](#)
+> ### Open-LLM-Leaderboard,for evaluating large language models (LLMs) by transitioning from multiple-choice questions (MCQs) to open-style questions.
 This approach addresses the inherent biases and limitations of MCQs, such as selection bias and the effect of random guessing. By utilizing open-style questions,
 the framework aims to provide a more accurate assessment of LLMs' abilities across various benchmarks and ensure that the evaluation reflects true capabilities,
 particularly in terms of language understanding and reasoning.
 """
 CITATION_TEXT = """@artical{..,
+      title={Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena},
       author={},
       year={2024},
       archivePrefix={arXiv}