Linaqruf commited on
Commit
c140027
·
verified ·
1 Parent(s): 4c2b2dc

fix some human error when defining training config

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -219,16 +219,16 @@ These are the key hyperparameters used during training:
219
  | Feature | Pretraining | Finetuning |
220
  |-------------------------------|----------------------------|---------------------------------|
221
  | **Hardware** | 2x H100 80GB PCIe | 2x A100 80GB PCIe |
222
- | **Batch Size** | 64 | 48 |
223
  | **Gradient Accumulation Steps** | 2 | 1 |
224
  | **Noise Offset** | None | 0.0357 |
225
  | **Epochs** | 10 | 10 |
226
- | **UNet Learning Rate** | 7.5e-6 | 7.5e-6 |
227
- | **Text Encoder Learning Rate** | 3.75e-6 | None |
228
- | **Optimizer** | AdamW8bit | Adafactor |
229
- | **Optimizer Args** | Weight Decay: 0.1, Betas: (0.9, 0.99) | Scale Parameter: False, Relative Step: False, Warmup Init: False |
230
  | **Scheduler** | Constant with Warmups | Constant with Warmups |
231
- | **Warmup Steps** | 0.5% | 0.5% |
232
 
233
  ## License
234
 
 
219
  | Feature | Pretraining | Finetuning |
220
  |-------------------------------|----------------------------|---------------------------------|
221
  | **Hardware** | 2x H100 80GB PCIe | 2x A100 80GB PCIe |
222
+ | **Batch Size** | 32 | 48 |
223
  | **Gradient Accumulation Steps** | 2 | 1 |
224
  | **Noise Offset** | None | 0.0357 |
225
  | **Epochs** | 10 | 10 |
226
+ | **UNet Learning Rate** | 5e-6 | 2e-6 |
227
+ | **Text Encoder Learning Rate** | 2.5e-6 | None |
228
+ | **Optimizer** | Adafactor | Adafactor |
229
+ | **Optimizer Args** | Scale Parameter: False, Relative Step: False, Warmup Init: False | Scale Parameter: False, Relative Step: False, Warmup Init: False |
230
  | **Scheduler** | Constant with Warmups | Constant with Warmups |
231
+ | **Warmup Steps** | 0.05% | 0.05% |
232
 
233
  ## License
234