winglian's picture
use pythia-12b, neox-20b is flaky
3961902
|
raw
history blame
206 Bytes

Python 12B

  • Single-GPU A100 only (?)
python scripts/finetune.py examples/pythia-12b/config.yml

⚠️ Multiple-GPU A100 - Doesn't seem to work with multi-gpu without causing OOM! ⚠️