Adam Jirkovsky
commited on
Commit
·
c54febc
1
Parent(s):
8a4bce6
Edit introduction text
Browse files- src/display/about.py +2 -2
src/display/about.py
CHANGED
@@ -34,14 +34,14 @@ TITLE = """<h1 align="center" id="space-title">🇨🇿 CzechBench Leaderboard</
|
|
34 |
TABLE_DESC = "The values presented in the table represent the accuracy metric."
|
35 |
|
36 |
# What does your leaderboard evaluate?
|
37 |
-
|
38 |
Czech-Bench is a collection of LLM benchmarks available for the Czech language. It currently consists of 15 Czech benchmarks, including new machine translations of the popular ARC, GSM8K, MMLU, and TruthfulQA datasets.
|
39 |
|
40 |
Czech-Bench is developed by <a href="https://huggingface.co/CIIRC-NLP">CIIRC-NLP</a>.
|
41 |
"""
|
42 |
|
43 |
# Which evaluations are you running? how can people reproduce what you have?
|
44 |
-
|
45 |
## Basic Information
|
46 |
The goal of the CzechBench project is to provide a comprehensive and practical benchmark for evaluating Czech language models.
|
47 |
Our [evaluation suite](https://github.com/jirkoada/czechbench_eval_harness/tree/main/lm_eval/tasks/czechbench#readme)
|
|
|
34 |
TABLE_DESC = "The values presented in the table represent the accuracy metric."
|
35 |
|
36 |
# What does your leaderboard evaluate?
|
37 |
+
INTRODUCTION_OLD = """
|
38 |
Czech-Bench is a collection of LLM benchmarks available for the Czech language. It currently consists of 15 Czech benchmarks, including new machine translations of the popular ARC, GSM8K, MMLU, and TruthfulQA datasets.
|
39 |
|
40 |
Czech-Bench is developed by <a href="https://huggingface.co/CIIRC-NLP">CIIRC-NLP</a>.
|
41 |
"""
|
42 |
|
43 |
# Which evaluations are you running? how can people reproduce what you have?
|
44 |
+
INTRODUCTION_TEXT = f"""
|
45 |
## Basic Information
|
46 |
The goal of the CzechBench project is to provide a comprehensive and practical benchmark for evaluating Czech language models.
|
47 |
Our [evaluation suite](https://github.com/jirkoada/czechbench_eval_harness/tree/main/lm_eval/tasks/czechbench#readme)
|