leaderboard-pr-bot commited on
Commit
7b551cc
·
verified ·
1 Parent(s): dc989ca

Adding Evaluation Results

Browse files

This is an automated PR created with https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr

The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.

If you encounter any issues, please report them to https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions

Files changed (1) hide show
  1. README.md +14 -1
README.md CHANGED
@@ -200,4 +200,17 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
200
 
201
  If you would like to support me:
202
 
203
- [☕ Buy Me a Coffee](https://www.buymeacoffee.com/weyaxi)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
200
 
201
  If you would like to support me:
202
 
203
+ [☕ Buy Me a Coffee](https://www.buymeacoffee.com/weyaxi)
204
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
205
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Weyaxi__Bagel-Hermes-2x34b)
206
+
207
+ | Metric |Value|
208
+ |---------------------------------|----:|
209
+ |Avg. |75.10|
210
+ |AI2 Reasoning Challenge (25-Shot)|69.80|
211
+ |HellaSwag (10-Shot) |85.26|
212
+ |MMLU (5-Shot) |77.24|
213
+ |TruthfulQA (0-shot) |64.82|
214
+ |Winogrande (5-shot) |84.77|
215
+ |GSM8k (5-shot) |68.69|
216
+