zhangirazerbayev commited on
Commit
75251ab
·
verified ·
1 Parent(s): ca19162

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -0
README.md ADDED
@@ -0,0 +1,39 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - EleutherAI/muInstruct
5
+ - camel-ai/math
6
+ language:
7
+ - en
8
+ tags:
9
+ - math
10
+ ---
11
+
12
+ `llemma_7b_muinstruct_camelmath` is an instruction-following finetune of [Llemma 7B](https://huggingface.co/EleutherAI/llemma_7b), trained on the [μInstruct](https://huggingface.co/datasets/EleutherAI/muInstruct) and [camel-ai/math](https://huggingface.co/datasets/camel-ai/math) datasets.
13
+
14
+ ## Input Formatting
15
+ Format input queries as follows:
16
+ ```
17
+ input_text = f"Input:{input}\n\nResponse:"
18
+ ```
19
+
20
+ Note that due to an error during training, this model's end-of-sequence token ID is `0` instead of the `2` which is standard for Llama-2 based models. Inference APIs should handle this automatically by reading this repo's `config.json`, but be aware of this difference if you are doing token surgery.
21
+
22
+ ## Evals
23
+ `
24
+ llemma_7b_muinstruct_camelmath` compares favorably to other 7B parameter models on the [Hungarian Math Exam](https://huggingface.co/datasets/keirp/hungarian_national_hs_finals_exam/blob/main/README.md). It surpasses the few-shot performance of Llemma 7B whilst being the strongest Llama-2 7B based model.
25
+
26
+ | Model | Exam Score |
27
+ | ------------------------------------------------------------------------------ | ---------- |
28
+ | [Code Llama 7B](https://huggingface.co/codellama/CodeLlama-7b-hf) (few-shot) | 8\% |
29
+ | [MetaMath 7B](https://huggingface.co/meta-math/MetaMath-7B-V1.0) | 20\% |
30
+ | [MAmmoTH 7B](https://huggingface.co/TIGER-Lab/MAmmoTH-7B) | 17\% |
31
+ | [MAmmoTH Coder 7B](https://huggingface.co/TIGER-Lab/MAmmoTH-Coder-7B) | 11\% |
32
+ | [Llemma 7B](https://huggingface.co/EleutherAI/llemma_7b) (few-shot) | 23\% |
33
+ | [Llemma_7B_muinstruct_camelmath] | 25\% |
34
+ | - | - |
35
+ | [Mistral 7B](https://huggingface.co/mistralai/Mistral-7B-v0.1) (few-shot) | 22\% |
36
+ | [MetaMath Mistral 7B](https://huggingface.co/meta-math/MetaMath-Mistral-7B) | 29\% |
37
+ | [OpenChat 3.5](https://huggingface.co/openchat/openchat_3.5) | 37\% |
38
+
39
+