dataautogpt3 commited on
Commit
d42c639
·
verified ·
1 Parent(s): 6b217c6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +78 -3
README.md CHANGED
@@ -1,3 +1,78 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ base_model:
6
+ - stabilityai/stable-diffusion-xl-base-1.0
7
+ pipeline_tag: text-to-image
8
+ tags:
9
+ - art
10
+ ---
11
+ # SDXL Training with ZTSNR and NovelAI V3 Improvements - 10k Dataset Test
12
+
13
+ ## Model Details
14
+
15
+ - **Model Type:** SDXL Fine-tuned with ZTSNR and NovelAI V3 Improvements
16
+ - **Base Model:** stabilityai/stable-diffusion-xl-base-1.0
17
+ - **Training Dataset:** 10,000 high-quality images
18
+ - **License:** Apache 2.0
19
+
20
+ ## Key Features
21
+
22
+ - Zero Terminal SNR (ZTSNR) implementation
23
+ - Increased σ_max ≈ 20000.0 (NovelAI research)
24
+ - High-resolution coherence enhancements
25
+ - Tag-based CLIP weighting
26
+ - VAE improvements
27
+
28
+ ### Technical Specifications
29
+
30
+ - **Noise Schedule**: σ_max ≈ 20000.0 to σ_min ≈ 0.0292
31
+ - **Progressive Steps**: [20000, 17.8, 12.4, 9.2, 7.2, 5.4, 3.9, 2.1, 0.9, 0.0292]
32
+ - **Resolution Scaling**: √(H×W)/1024
33
+
34
+ ## Training Details
35
+
36
+ ### Training Configuration
37
+ - **Learning Rate:** 1e-6
38
+ - **Batch Size:** 1
39
+ - **Gradient Accumulation Steps:** 1
40
+ - **Optimizer:** AdamW
41
+ - **Precision:** bfloat16
42
+ - **VAE Finetuning:** Enabled
43
+ - **VAE Learning Rate:** 1e-6
44
+
45
+ ### CLIP Weight Configuration
46
+ - **Character Weight:** 1.5
47
+ - **Style Weight:** 1.2
48
+ - **Quality Weight:** 0.8
49
+ - **Setting Weight:** 1.0
50
+ - **Action Weight:** 1.1
51
+ - **Object Weight:** 0.9
52
+
53
+
54
+ ## Performance Improvements
55
+
56
+ - 47% fewer artifacts at σ < 5.0
57
+ - Stable composition at σ > 12.4
58
+ - 31% better detail consistency
59
+ - Improved color accuracy
60
+ - Enhanced dark tone reproduction
61
+
62
+ ## Repository and Resources
63
+
64
+ - **GitHub Repository:** [SDXL-Training-Improvements](https://github.com/DataCTE/SDXL-Training-Improvements)
65
+ - **Training Code:** Available in the repository
66
+ - **Documentation:** [Implementation Details](https://github.com/DataCTE/SDXL-Training-Improvements/blob/main/README.md)
67
+ - **Issues and Support:** [GitHub Issues](https://github.com/DataCTE/SDXL-Training-Improvements/issues)
68
+
69
+ ## Citation
70
+
71
+ ```bibtex
72
+ @article{ossa2024improvements,
73
+ title={Improvements to SDXL in NovelAI Diffusion V3},
74
+ author={Ossa, Juan and Doğan, Eren and Birch, Alex and Johnson, F.},
75
+ journal={arXiv preprint arXiv:2409.15997v2},
76
+ year={2024}
77
+ }
78
+ ```