chenhaodev
/

solar-10b-ocn-v1

Generated from Trainer

Model card Files Files and versions Community

chenhugging commited on Feb 7, 2024

Commit

9a670b7

·

verified ·

1 Parent(s): dc900cf

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -48,6 +48,9 @@ CUDA_VISIBLE_DEVICES=0 python src/train_bash.py --stage sft --do_train True --mo
 ### Performance
 hf (pretrained=upstage/SOLAR-10.7B-v1.0,peft=chenhugging/solar-10b-ocn-v1,trust_remote_code=True,parallelize=True,load_in_4bit=True), gen_kwargs: (None), limit: 100.0, num_fewshot: None, batch_size: 1
 |        Tasks        |Version|Filter|n-shot| Metric |Value|   |Stderr|
 |---------------------|-------|------|-----:|--------|----:|---|-----:|

 ### Performance
+Test script:
+lm_eval --model hf --model_args pretrained=upstage/SOLAR-10.7B-v1.0,peft=chenhugging/solar-10b-ocn-v1,trust_remote_code=True,parallelize=True,load_in_4bit=True --tasks ocn,aocnp,medmcqa,pubmedqa,mmlu_clinical_knowledge,mmlu_college_medicine,mmlu_professional_medicine --device cuda:0 --limit 100
 hf (pretrained=upstage/SOLAR-10.7B-v1.0,peft=chenhugging/solar-10b-ocn-v1,trust_remote_code=True,parallelize=True,load_in_4bit=True), gen_kwargs: (None), limit: 100.0, num_fewshot: None, batch_size: 1
 |        Tasks        |Version|Filter|n-shot| Metric |Value|   |Stderr|
 |---------------------|-------|------|-----:|--------|----:|---|-----:|