Merge branch 'main' of https://huggingface.co/Badgids/Gonzo-Chat-7B
Browse files
README.md
CHANGED
@@ -160,13 +160,13 @@ dtype: bfloat16
|
|
160 |
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
161 |
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Badgids__Gonzo-Chat-7B)
|
162 |
|
163 |
-
|
|
164 |
-
|
165 |
-
|Avg.
|
166 |
-
|AI2 Reasoning Challenge (25-Shot)|65.02|
|
167 |
-
|HellaSwag (10-Shot)
|
168 |
-
|MMLU (5-Shot)
|
169 |
-
|TruthfulQA (0-shot)
|
170 |
-
|Winogrande (5-shot)
|
171 |
-
|GSM8k (5-shot)
|
172 |
|
|
|
160 |
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
161 |
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Badgids__Gonzo-Chat-7B)
|
162 |
|
163 |
+
| Metric | Value |
|
164 |
+
| --------------------------------- | ----: |
|
165 |
+
| Avg. | 66.63 |
|
166 |
+
| AI2 Reasoning Challenge (25-Shot) | 65.02 |
|
167 |
+
| HellaSwag (10-Shot) | 85.40 |
|
168 |
+
| MMLU (5-Shot) | 63.75 |
|
169 |
+
| TruthfulQA (0-shot) | 60.23 |
|
170 |
+
| Winogrande (5-shot) | 77.74 |
|
171 |
+
| GSM8k (5-shot) | 47.61 |
|
172 |
|