File size: 779 Bytes
27fc927
 
 
 
 
371b7a8
27fc927
32a4744
27fc927
 
 
 
 
 
 
 
 
 
 
 
3201b27
27fc927
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
---
license: apache-2.0
pipeline_tag: text-to-image
---

# Controlling Structure and Appearance for SSD-1B

The CtrlXStableDiffusionXLPipeline has been modified as it had way too many TODO lines. Removed the refiner phase.

Requires 8GB VRAM.

## Setup

```
pip install accelerate diffusers gradio torch safetensors transformers
```

## Inference

```python
python run_ctrlx.py --num_inference_steps 20 --guidance_scale 9.0 --model_offload --structure_image images/horse__point_cloud.jpg --appearance_image images/horse.jpg --prompt "a photo of a horse standing on grass" --structure_prompt "3D point cloud of a horse"
```

## Disclaimer

All code belongs to [Jordan Lin](https://github.com/genforce/ctrl-x), the models weight to [Segmind](https://huggingface.co/segmind/SSD-1B).