gjuggler's picture
End of training
cb1144e
raw
history blame
206 Bytes
{
"epoch": 9.99,
"total_flos": 5.40401463321831e+18,
"train_loss": 1.548523964753022,
"train_runtime": 6742.5459,
"train_samples_per_second": 31.917,
"train_steps_per_second": 0.11
}