File size: 663 Bytes
bf3a8fe cd892d5 98b3728 f21feab 73615f3 2d2dece 9ed6ea0 074b813 1ae73df f19b306 e724bfe 26cb848 57f21d2 8e61b9c 6f63be8 78d374b 31a0717 88e822b e76d4aa 75ed782 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 |
---
language: en
license: mit
library_name: pytorch
---
# Plainly Optimized Network
Dataset: BIGBENCH
Trainer Hyperparameters:
- `lr` = 5e-05
- `per_device_batch_size` = 1
- `gradient_accumulation_steps` = 4
- `weight_decay` = 1e-09
- `seed` = 42
|eval_loss|eval_accuracy|epoch|
|--|--|--|
|58.940|0.054|1.0|
|54.182|0.049|2.0|
|56.362|0.051|3.0|
|52.705|0.046|4.0|
|55.357|0.050|5.0|
|53.973|0.048|6.0|
|56.034|0.050|7.0|
|51.731|0.045|8.0|
|54.661|0.048|9.0|
|50.378|0.043|10.0|
|51.579|0.044|11.0|
|51.193|0.044|12.0|
|52.724|0.046|13.0|
|52.055|0.045|14.0|
|51.406|0.044|15.0|
|51.539|0.045|16.0|
|52.422|0.046|17.0|
|50.304|0.043|18.0|
|50.937|0.044|19.0|
|