File size: 332 Bytes
13b8ee3
 
3db201c
 
 
 
 
 
13b8ee3
ea3741e
 
1
2
3
4
5
6
7
8
9
10
11
---
license: apache-2.0
datasets:
- opencsg/smoltalk-chinese
language:
- zh
base_model:
- opencsg/csg-wukong-ablation-chinese-fineweb-edu
---
* Using ``opencsg/csg-wukong-2b-chinese-fineweb-edu`` as base model, we fine-tune it on ``smoltalk-chinese`` for 2 epoch
* learning rate = 3e-4 ; global batch size = 32 ; lr scheduler=cosine