File size: 332 Bytes
13b8ee3 3db201c 13b8ee3 ea3741e |
1 2 3 4 5 6 7 8 9 10 11 |
---
license: apache-2.0
datasets:
- opencsg/smoltalk-chinese
language:
- zh
base_model:
- opencsg/csg-wukong-ablation-chinese-fineweb-edu
---
* Using ``opencsg/csg-wukong-2b-chinese-fineweb-edu`` as base model, we fine-tune it on ``smoltalk-chinese`` for 2 epoch
* learning rate = 3e-4 ; global batch size = 32 ; lr scheduler=cosine |