|
--- |
|
datasets: |
|
- cQueenccc/Vivian-Blip-Captions |
|
language: |
|
- en |
|
pipeline_tag: text-to-image |
|
--- |
|
|
|
# Disclaimer |
|
This was inspired from https://github.com/YaYaB/finetune-diffusion |
|
|
|
# Model Card for Finetuning Stable Diffusion on Vivian Maier's photographs |
|
The main goal is to fine-tune the Stable Diffusion model to generate images reflecting the distinct photographic style of Vivian Maier. |
|
|
|
And I chose to utilize a Jupyter Notebook to make the fine-tuning process accessible and easy to understand, particularly for those new to the diffusion pipeline and hugging face API. |
|
|
|
# Requirements |
|
To launch the finetuning with a batch_size of 1 you need to have a gpu with at least 24G VRAM (you can use accumulating gradient to simulate higher batch size) |
|
|
|
Make sure that you have enough disk space, the model uses ~11Gb |
|
|
|
## Examples(at epoch 90) |
|
|
|
![vv1.jpg](https://huggingface.co/cQueenccc/Fine-Tune-Diffusion-Vivian/resolve/main/eval/A%20woman%20walking%20down%20the%20street/A%20woman%20walking%20down%20the%20street_90_000000.png) |
|
> A woman walking down a street |
|
|
|
![vv2.jpg](https://huggingface.co/cQueenccc/Fine-Tune-Diffusion-Vivian/resolve/main/eval/a%20group%20of%20people%20getting%20on%20a%20bus/a%20group%20of%20people%20getting%20on%20a%20bus_90_000000.png) |
|
> a group of people getting on a bus |
|
|
|
![vv3.jpg](https://huggingface.co/cQueenccc/Fine-Tune-Diffusion-Vivian/resolve/main/eval/two%20men%20working%20on%20a%20construction%20site/two%20men%20working%20on%20a%20construction%20site_90_000000.png) |
|
> two man working on a constructing site |
|
|
|
## Citation |
|
|
|
If you use this dataset, please cite it as: |
|
|
|
``` |
|
@misc{cqueenccc2023vivian, |
|
author = {cQueenccc}, |
|
title = {Finetuning Stable Diffusion on Vivian Maier's photographs}, |
|
year={2023}, |
|
howpublished= {\url{https://huggingface.co/cQueenccc/Fine-Tune-Diffusion-Vivian/}} |
|
} |
|
``` |