File size: 1,567 Bytes
8d75911 bff12ea 2876d0e 69f3a07 4849d80 23c5dc4 8d75911 bff12ea 4a19439 bff12ea 3ce0e3a bff12ea |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 |
---
license: creativeml-openrail-m
language:
- en
library_name: diffusers
pipeline_tag: text-to-image
tags:
- stable-diffusion
- stable-diffusion-diffusers
- text-to-image
inference:
parameters:
num_inference_steps: 50
guidance_scale: 5.0
eta: 1.0
widget:
- text: "a horse playing chess"
example_title: horse + chess
- text: "a lion washing dishes"
example_title: lion + dishes
- text: "a goat riding a bike"
example_title: goat + bike
---
# ddpo-alignment
This model was finetuned from [Stable Diffusion v1-4](https:/CompVis/stable-diffusion-v1-4) using [DDPO](https://arxiv.org/abs/2305.13301) and a reward function that uses [LLaVA](https://llava-vl.github.io/) to measure prompt-image alignment. See [the project website](https://rl-diffusion.github.io/) for more details.
The model was finetuned for 200 iterations with a batch size of 256 samples per iteration. During finetuning, we used prompts of the form: "_a(n) \<animal\> \<activity\>_". We selected the animal and activity from the following lists, so try those for the best results. However, we also observed limited generalization to other prompts.
Activities:
- washing dishes
- playing chess
- riding a bike
Animals:
- cat
- dog
- horse
- monkey
- rabbit
- zebra
- spider
- bird
- sheep
- deer
- cow
- goat
- lion
- tiger
- bear
- raccoon
- fox
- wolf
- lizard
- beetle
- ant
- butterfly
- fish
- shark
- whale
- dolphin
- squirrel
- mouse
- rat
- snake
- turtle
- frog
- chicken
- duck
- goose
- bee
- pig
- turkey
- fly
- llama
- camel
- bat
- gorilla
- hedgehog
- kangaroo |