File size: 779 Bytes
27fc927 371b7a8 27fc927 32a4744 27fc927 3201b27 27fc927 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 |
---
license: apache-2.0
pipeline_tag: text-to-image
---
# Controlling Structure and Appearance for SSD-1B
The CtrlXStableDiffusionXLPipeline has been modified as it had way too many TODO lines. Removed the refiner phase.
Requires 8GB VRAM.
## Setup
```
pip install accelerate diffusers gradio torch safetensors transformers
```
## Inference
```python
python run_ctrlx.py --num_inference_steps 20 --guidance_scale 9.0 --model_offload --structure_image images/horse__point_cloud.jpg --appearance_image images/horse.jpg --prompt "a photo of a horse standing on grass" --structure_prompt "3D point cloud of a horse"
```
## Disclaimer
All code belongs to [Jordan Lin](https://github.com/genforce/ctrl-x), the models weight to [Segmind](https://huggingface.co/segmind/SSD-1B). |