chenhugging
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -48,6 +48,9 @@ CUDA_VISIBLE_DEVICES=0 python src/train_bash.py --stage sft --do_train True --mo
|
|
48 |
|
49 |
### Performance
|
50 |
|
|
|
|
|
|
|
51 |
hf (pretrained=upstage/SOLAR-10.7B-v1.0,peft=chenhugging/solar-10b-ocn-v1,trust_remote_code=True,parallelize=True,load_in_4bit=True), gen_kwargs: (None), limit: 100.0, num_fewshot: None, batch_size: 1
|
52 |
| Tasks |Version|Filter|n-shot| Metric |Value| |Stderr|
|
53 |
|---------------------|-------|------|-----:|--------|----:|---|-----:|
|
|
|
48 |
|
49 |
### Performance
|
50 |
|
51 |
+
Test script:
|
52 |
+
lm_eval --model hf --model_args pretrained=upstage/SOLAR-10.7B-v1.0,peft=chenhugging/solar-10b-ocn-v1,trust_remote_code=True,parallelize=True,load_in_4bit=True --tasks ocn,aocnp,medmcqa,pubmedqa,mmlu_clinical_knowledge,mmlu_college_medicine,mmlu_professional_medicine --device cuda:0 --limit 100
|
53 |
+
|
54 |
hf (pretrained=upstage/SOLAR-10.7B-v1.0,peft=chenhugging/solar-10b-ocn-v1,trust_remote_code=True,parallelize=True,load_in_4bit=True), gen_kwargs: (None), limit: 100.0, num_fewshot: None, batch_size: 1
|
55 |
| Tasks |Version|Filter|n-shot| Metric |Value| |Stderr|
|
56 |
|---------------------|-------|------|-----:|--------|----:|---|-----:|
|