Update README.md
Browse files
README.md
CHANGED
@@ -17,6 +17,6 @@ inference:
|
|
17 |
|
18 |
# ddpo-incompressibility
|
19 |
|
20 |
-
This model was finetuned from [Stable Diffusion v1-
|
21 |
|
22 |
The model was finetuned for 20 iterations with a batch size of 256 samples per iteration. During finetuning, it was prompted with all of the animals in the [Imagenet-1000](https://deeplearning.cms.waikato.ac.nz/user-guide/class-maps/IMAGENET/) categories (the first 398 categories), but exhibits some generalization to other prompts.
|
|
|
17 |
|
18 |
# ddpo-incompressibility
|
19 |
|
20 |
+
This model was finetuned from [Stable Diffusion v1-4](https:/CompVis/stable-diffusion-v1-4) using [DDPO](https://arxiv.org/abs/2305.13301) and a reward function encouraging images that are _not_ JPEG-compressible. See [the project website](https://rl-diffusion.github.io/) for more details.
|
21 |
|
22 |
The model was finetuned for 20 iterations with a batch size of 256 samples per iteration. During finetuning, it was prompted with all of the animals in the [Imagenet-1000](https://deeplearning.cms.waikato.ac.nz/user-guide/class-maps/IMAGENET/) categories (the first 398 categories), but exhibits some generalization to other prompts.
|