jimmycarter commited on
Commit
a8f6dd6
·
verified ·
1 Parent(s): a62b5a7

Model card auto-generated by SimpleTuner

Browse files
Files changed (1) hide show
  1. README.md +248 -0
README.md ADDED
@@ -0,0 +1,248 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ base_model: "black-forest-labs/FLUX.1-dev"
4
+ tags:
5
+ - flux
6
+ - flux-diffusers
7
+ - text-to-image
8
+ - diffusers
9
+ - simpletuner
10
+ - lora
11
+ - template:sd-lora
12
+ inference: true
13
+ widget:
14
+ - text: 'unconditional (blank prompt)'
15
+ parameters:
16
+ negative_prompt: 'blurry, cropped, ugly'
17
+ output:
18
+ url: ./assets/image_0_0.png
19
+ - text: 'In this scene from the animated series "Helluva Boss," Loona, the wolf-like receptionist of the Immediate Murder Professionals (I.M.P), is depicted leaning against a wall outside the office. She is casually engrossed in her phone, displaying her typical aloof and detached demeanor. Loona''s appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes, complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts.'
20
+ parameters:
21
+ negative_prompt: 'blurry, cropped, ugly'
22
+ output:
23
+ url: ./assets/image_1_0.png
24
+ - text: 'Loona shrugs with an exasperated expression, her red eyes wide and frustrated, as she seemingly questions or challenges something said in the I.M.P office. Still from Helluva boss. Loona''s appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes, complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts.'
25
+ parameters:
26
+ negative_prompt: 'blurry, cropped, ugly'
27
+ output:
28
+ url: ./assets/image_2_0.png
29
+ - text: 'A scene from the animated series "Helluva Boss," set in the office. Loona, the wolf-like receptionist with white fur, black-tipped ears, and red eyes, is seated on a couch, facing towards the viewer. Loona''s appearance is complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts. She holds a piece of paper that says,"Welcome to Losercity, jerks". In the background, the office has a striped wall pattern and visible damage on the ceiling, indicating a chaotic or rough environment. On the right side of the image, two imp characters appear to be engaged in conversation.'
30
+ parameters:
31
+ negative_prompt: 'blurry, cropped, ugly'
32
+ output:
33
+ url: ./assets/image_3_0.png
34
+ - text: 'Loona from Helluva Boss is dressed in an oversized taco costume, looking visibly irritated and embarrassed. Her red eyes convey her annoyance as she crosses her arms and glares to the side. Loona''s appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes'
35
+ parameters:
36
+ negative_prompt: 'blurry, cropped, ugly'
37
+ output:
38
+ url: ./assets/image_4_0.png
39
+ - text: 'Loona is standing next to Blitzo (Helluva boss)'
40
+ parameters:
41
+ negative_prompt: 'blurry, cropped, ugly'
42
+ output:
43
+ url: ./assets/image_5_0.png
44
+ - text: 'In this "Helluva Boss" scene, Loona, the wolf-like receptionist, stands in an elevator with a tense and irritated expression, her teeth bared in a snarl. Blitzø, the red demon with distinctive black and white horns, leans close and makes an adorable look, as if asking for a favor. The ornate elevator setting hints at a tense moment, possibly involving a challenging mission or conflict within the I.M.P team.'
45
+ parameters:
46
+ negative_prompt: 'blurry, cropped, ugly'
47
+ output:
48
+ url: ./assets/image_6_0.png
49
+ - text: 'a 2D simple drawing of a madeleine cake, with a green cloud drawn next to it'
50
+ parameters:
51
+ negative_prompt: 'blurry, cropped, ugly'
52
+ output:
53
+ url: ./assets/image_7_0.png
54
+ - text: 'a 3D captivating YouTube thumbnail depicting of a full detailed,it''s on a party real people like, on front there is a giant pulling a nose of a black African real like lady down to size of elephant nose,be creative and unique'
55
+ parameters:
56
+ negative_prompt: 'blurry, cropped, ugly'
57
+ output:
58
+ url: ./assets/image_8_0.png
59
+ - text: 'Whiskers the cat. Whiskers becomes a mentor to other animals.Impressed by Whiskers'' intelligence, other animals in the neighborhood seek his guidance. Whiskers sets up a virtual learning platform using AI technology, where animals can ask questions, receive personalized lessons, and acquire knowledge in various subjects. Whiskers becomes a mentor, helping others unlock their potential.'
60
+ parameters:
61
+ negative_prompt: 'blurry, cropped, ugly'
62
+ output:
63
+ url: ./assets/image_9_0.png
64
+ - text: 'As the stock market fluctuates, the investor remains calm and collected at their desk, surrounded by charts and graphs. Their tailored suit and polished briefcase are a symbol of their expertise and experience in the world of finance. '
65
+ parameters:
66
+ negative_prompt: 'blurry, cropped, ugly'
67
+ output:
68
+ url: ./assets/image_10_0.png
69
+ - text: 'loona from helluva boss is eating a donut'
70
+ parameters:
71
+ negative_prompt: 'blurry, cropped, ugly'
72
+ output:
73
+ url: ./assets/image_11_0.png
74
+ ---
75
+
76
+ # flux-training-losercity-next-lycoris6
77
+
78
+ This is a LyCORIS adapter derived from [black-forest-labs/FLUX.1-dev](https://huggingface.co/black-forest-labs/FLUX.1-dev).
79
+
80
+
81
+ The main validation prompt used during training was:
82
+
83
+
84
+
85
+ ```
86
+ loona from helluva boss is eating a donut
87
+ ```
88
+
89
+ ## Validation settings
90
+ - CFG: `3.5`
91
+ - CFG Rescale: `0.0`
92
+ - Steps: `15`
93
+ - Sampler: `None`
94
+ - Seed: `42`
95
+ - Resolution: `1024`
96
+
97
+ Note: The validation settings are not necessarily the same as the [training settings](#training-settings).
98
+
99
+ You can find some example images in the following gallery:
100
+
101
+
102
+ <Gallery />
103
+
104
+ The text encoder **was not** trained.
105
+ You may reuse the base model text encoder for inference.
106
+
107
+
108
+ ## Training settings
109
+
110
+ - Training epochs: 0
111
+ - Training steps: 100
112
+ - Learning rate: 0.0006
113
+ - Effective batch size: 16
114
+ - Micro-batch size: 2
115
+ - Gradient accumulation steps: 8
116
+ - Number of GPUs: 1
117
+ - Prediction type: flow-matching
118
+ - Rescaled betas zero SNR: False
119
+ - Optimizer: adamw_bf16
120
+ - Precision: bf16
121
+ - Quantised: Yes: fp8-quanto
122
+ - Xformers: Not used
123
+ - LyCORIS Config:
124
+ ```json
125
+ {
126
+ "algo": "lokr",
127
+ "multiplier": 1.0,
128
+ "linear_dim": 1000000,
129
+ "linear_alpha": 1,
130
+ "full_matrix": true,
131
+ "factor": 12,
132
+ "apply_preset": {
133
+ "target_module": [
134
+ "FluxTransformer2DModelWithMasking"
135
+ ],
136
+ "module_algo_map": {
137
+ "FeedForward": {
138
+ "factor": 6
139
+ }
140
+ }
141
+ }
142
+ }
143
+ ```
144
+
145
+ ## Datasets
146
+
147
+ ### default_dataset_arb
148
+ - Repeats: 9999
149
+ - Total number of images: 41
150
+ - Total number of aspect buckets: 1
151
+ - Resolution: 1.33 megapixels
152
+ - Cropped: False
153
+ - Crop style: None
154
+ - Crop aspect: None
155
+ ### default_dataset_arb2
156
+ - Repeats: 9999
157
+ - Total number of images: 2565
158
+ - Total number of aspect buckets: 1
159
+ - Resolution: 1.33 megapixels
160
+ - Cropped: False
161
+ - Crop style: None
162
+ - Crop aspect: None
163
+ ### default_dataset
164
+ - Repeats: 9999
165
+ - Total number of images: 42
166
+ - Total number of aspect buckets: 1
167
+ - Resolution: 1.048576 megapixels
168
+ - Cropped: True
169
+ - Crop style: center
170
+ - Crop aspect: square
171
+ ### default_dataset_512
172
+ - Repeats: 9999
173
+ - Total number of images: 42
174
+ - Total number of aspect buckets: 1
175
+ - Resolution: 0.262144 megapixels
176
+ - Cropped: True
177
+ - Crop style: center
178
+ - Crop aspect: square
179
+ ### default_dataset_640
180
+ - Repeats: 9999
181
+ - Total number of images: 42
182
+ - Total number of aspect buckets: 1
183
+ - Resolution: 0.4096 megapixels
184
+ - Cropped: True
185
+ - Crop style: center
186
+ - Crop aspect: square
187
+ ### default_dataset_768
188
+ - Repeats: 9999
189
+ - Total number of images: 42
190
+ - Total number of aspect buckets: 1
191
+ - Resolution: 0.589824 megapixels
192
+ - Cropped: True
193
+ - Crop style: center
194
+ - Crop aspect: square
195
+ ### default_dataset_896
196
+ - Repeats: 9999
197
+ - Total number of images: 42
198
+ - Total number of aspect buckets: 1
199
+ - Resolution: 0.802816 megapixels
200
+ - Cropped: True
201
+ - Crop style: center
202
+ - Crop aspect: square
203
+ ### default_dataset_uncaptioned
204
+ - Repeats: 9999
205
+ - Total number of images: 2565
206
+ - Total number of aspect buckets: 1
207
+ - Resolution: 1.048576 megapixels
208
+ - Cropped: True
209
+ - Crop style: center
210
+ - Crop aspect: square
211
+ ### default_dataset_uncaptioned_512
212
+ - Repeats: 9999
213
+ - Total number of images: 2565
214
+ - Total number of aspect buckets: 1
215
+ - Resolution: 0.262144 megapixels
216
+ - Cropped: True
217
+ - Crop style: center
218
+ - Crop aspect: square
219
+
220
+
221
+ ## Inference
222
+
223
+
224
+ ```python
225
+ import torch
226
+ from diffusers import DiffusionPipeline
227
+ from lycoris import create_lycoris_from_weights
228
+
229
+ model_id = 'black-forest-labs/FLUX.1-dev'
230
+ adapter_id = 'pytorch_lora_weights.safetensors' # you will have to download this manually
231
+ lora_scale = 1.0
232
+ wrapper, _ = create_lycoris_from_weights(lora_scale, adapter_id, pipeline.transformer)
233
+ wrapper.merge_to()
234
+
235
+ prompt = "loona from helluva boss is eating a donut"
236
+
237
+ pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu')
238
+ image = pipeline(
239
+ prompt=prompt,
240
+ num_inference_steps=15,
241
+ generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
242
+ width=1024,
243
+ height=1024,
244
+ guidance_scale=3.5,
245
+ ).images[0]
246
+ image.save("output.png", format="PNG")
247
+ ```
248
+