Kevin Black
initial commit
b897866
|
raw
history blame
873 Bytes
---
license: creativeml-openrail-m
language:
- en
library_name: diffusers
pipeline_tag: text-to-image
tags:
- stable-diffusion
- stable-diffusion-diffusers
- text-to-image
inference:
parameters:
num_inference_steps: 50
guidance_scale: 5.0
eta: 1.0
---
# ddpo-aesthetic
This model was finetuned from [Stable Diffusion v1-5](https:/runwayml/stable-diffusion-v1-5) using [DDPO](https://arxiv.org/abs/2305.13301) and a reward function encouraging images that are JPEG-compressible. See [the project website](https://rl-diffusion.github.io/) for more details.
The model was finetuned for 60 iterations with a batch size of 256 samples per iteration. It was prompted with the 398 animals in the [Imagenet-1000](https://deeplearning.cms.waikato.ac.nz/user-guide/class-maps/IMAGENET/) classes during finetuning, but exhibits strong generalization to other prompts.