hoan17
/

Dev_dataset_index3_Final_Pickapic_e500

Text-to-Image

Diffusers

Safetensors

StableDiffusionPipeline

trl

ddpo

reinforcement-learning

stable-diffusion

Model card Files Files and versions Community

TRL DDPO Model

This is a diffusion model that has been fine-tuned with reinforcement learning to guide the model outputs according to a value, function, or human feedback. The model can be used for image generation conditioned with text.

Downloads last month: 2

Inference Providers NEW

Text-to-Image

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support