fix some human error when defining training config
Browse files
README.md
CHANGED
@@ -219,16 +219,16 @@ These are the key hyperparameters used during training:
|
|
219 |
| Feature | Pretraining | Finetuning |
|
220 |
|-------------------------------|----------------------------|---------------------------------|
|
221 |
| **Hardware** | 2x H100 80GB PCIe | 2x A100 80GB PCIe |
|
222 |
-
| **Batch Size** |
|
223 |
| **Gradient Accumulation Steps** | 2 | 1 |
|
224 |
| **Noise Offset** | None | 0.0357 |
|
225 |
| **Epochs** | 10 | 10 |
|
226 |
-
| **UNet Learning Rate** |
|
227 |
-
| **Text Encoder Learning Rate** |
|
228 |
-
| **Optimizer** |
|
229 |
-
| **Optimizer Args** |
|
230 |
| **Scheduler** | Constant with Warmups | Constant with Warmups |
|
231 |
-
| **Warmup Steps** | 0.
|
232 |
|
233 |
## License
|
234 |
|
|
|
219 |
| Feature | Pretraining | Finetuning |
|
220 |
|-------------------------------|----------------------------|---------------------------------|
|
221 |
| **Hardware** | 2x H100 80GB PCIe | 2x A100 80GB PCIe |
|
222 |
+
| **Batch Size** | 32 | 48 |
|
223 |
| **Gradient Accumulation Steps** | 2 | 1 |
|
224 |
| **Noise Offset** | None | 0.0357 |
|
225 |
| **Epochs** | 10 | 10 |
|
226 |
+
| **UNet Learning Rate** | 5e-6 | 2e-6 |
|
227 |
+
| **Text Encoder Learning Rate** | 2.5e-6 | None |
|
228 |
+
| **Optimizer** | Adafactor | Adafactor |
|
229 |
+
| **Optimizer Args** | Scale Parameter: False, Relative Step: False, Warmup Init: False | Scale Parameter: False, Relative Step: False, Warmup Init: False |
|
230 |
| **Scheduler** | Constant with Warmups | Constant with Warmups |
|
231 |
+
| **Warmup Steps** | 0.05% | 0.05% |
|
232 |
|
233 |
## License
|
234 |
|