TRL DDPO Model

This is a diffusion model that has been fine-tuned with reinforcement learning to guide the model outputs according to a value, function, or human feedback. The model can be used for image generation conditioned with text.

Downloads last month: 30

Inference Providers NEW

Text-to-Image

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.