winglian's picture
use pythia-12b, neox-20b is flaky
3961902
|
raw
history blame
206 Bytes
# Python 12B
- Single-GPU A100 only (?)
```shell
python scripts/finetune.py examples/pythia-12b/config.yml
```
⚠️ Multiple-GPU A100 - Doesn't seem to work with multi-gpu without causing OOM! ⚠️