Included gradient checkpointing
#1
by
FJFehr
- opened
This is a minor change that allows for gradient checkpointing. This allows for increased batch sizes when file-tuning these models.
This is a minor change that allows for gradient checkpointing. This allows for increased batch sizes when file-tuning these models.