Update src/display/about.py
Browse files- src/display/about.py +14 -7
src/display/about.py
CHANGED
@@ -48,13 +48,20 @@ currently consists of 15 individual tasks, leveraging pre-existing Czech dataset
|
|
48 |
including ARC, GSM8K, MMLU, and TruthfulQA. This work is brought to you by CIIRC CTU and VSB Ostrava.
|
49 |
|
50 |
Key Features and Benefits:
|
51 |
-
- **Tailored for the Czech Language:**
|
52 |
-
|
53 |
-
- **
|
54 |
-
|
55 |
-
- **
|
56 |
-
|
57 |
-
This
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
58 |
|
59 |
Below, you can find the up-to-date loaderboard of models evaluated on CzechBench.
|
60 |
For more information on the included benchmarks and instructions on evaluating your own models, please visit the "About" section below.
|
|
|
48 |
including ARC, GSM8K, MMLU, and TruthfulQA. This work is brought to you by CIIRC CTU and VSB Ostrava.
|
49 |
|
50 |
Key Features and Benefits:
|
51 |
+
- **Tailored for the Czech Language:**
|
52 |
+
CzechBench includes both original Czech datasets and adapted versions of international datasets, ensuring relevant evaluation of model performance in the Czech context.
|
53 |
+
- **Wide Range of Tasks:**
|
54 |
+
It contains 15 different tasks that cover various aspects of language understanding and text generation, enabling a comprehensive assessment of the model's capabilities.
|
55 |
+
- **Bilingual performance analysis:**
|
56 |
+
CzechBench also offers a parallel collection of 9 English tasks corresponding to the Czech versions included in the main suite.
|
57 |
+
This allows for direct comparison of model performance across both languages with equivalent conditions in terms of prompt formulation and few-shot example selection.
|
58 |
+
- **Universal model support:**
|
59 |
+
The universal text-to-text evaluation approach adopted in CzechBench allows for direct comparison of models with varying levels of internal access, including commercial APIs.
|
60 |
+
- **Ease of Use:**
|
61 |
+
The benchmark is built upon a commonly used evaluation framework with wide support for state-of-the-art models and inference acceleration tools.
|
62 |
+
- **Empowering decisions:**
|
63 |
+
Whether you are a business looking for the best LLM solution to base your application on, or a research team trying to maximize the capabilities of the models they are developing,
|
64 |
+
CzechBench will help you gain insights into particular strengths and weeknesses of individual models and better focus on key areas for optimization.
|
65 |
|
66 |
Below, you can find the up-to-date loaderboard of models evaluated on CzechBench.
|
67 |
For more information on the included benchmarks and instructions on evaluating your own models, please visit the "About" section below.
|