awacke1 commited on
Commit
8858935
Β·
verified Β·
1 Parent(s): 8d76fc6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +186 -87
README.md CHANGED
@@ -27,6 +27,106 @@ Detail Graph: https://github.com/AaronCWacker/Yggdrasil/blob/main/Mermaid/Multim
27
 
28
  5. ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/CfbenG3-7zxOxmuc5hRvJ.png)
29
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
30
  # Architecture to AI Space
31
 
32
  # πŸ€– AI Types and Capabilities Classification
@@ -172,102 +272,101 @@ Detail Graph: https://github.com/AaronCWacker/Yggdrasil/blob/main/Mermaid/Multim
172
  - `RLHF.Reinforce.Learn.With.Human.Feedback`
173
 
174
 
 
175
 
176
- # AI Pipeline Architecture Models for Multimodal Model Space Workflows
177
-
178
- Zero GPU and GPU Spaces of note:
179
-
180
- 1. https://huggingface.co/spaces/awacke1/midi-composer
181
-
182
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/n-PkaPq3PIkhWoca7S8cY.png)
183
-
184
- 2. https://huggingface.co/spaces/awacke1/OpenDalleV1.1-GPU-Demo
185
-
186
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/JXQp7Zms-cNEc7zsSnuN-.png)
187
-
188
- 3. https://huggingface.co/spaces/awacke1/AudioFileGenerationWithSDAudio
189
-
190
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/5Jzmziz_KCRBMEGrVJ22g.png)
191
-
192
- 4. https://huggingface.co/spaces/multimodalart/stable-video-diffusion
193
-
194
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/0xNstTp_AtGYNJaJRlHg1.png)
195
-
196
- 5. https://huggingface.co/spaces/mukaist/Midjourney
197
-
198
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/SFnE05CGFi7SYtLwTLit3.png)
199
-
200
- 6. https://huggingface.co/spaces/multimodalart/stable-cascade
201
-
202
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/S2atWZqgj6xnG6c8N8MWZ.png)
203
-
204
- 7. https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
205
-
206
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/FHPref69BgJmNxPO7uxT-.png)
207
-
208
- 8. https://huggingface.co/spaces/ehristoforu/dalle-3-xl-lora-v2
209
-
210
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/RLuL7SuQmVv53DisDCkXx.png)
211
-
212
- 9. https://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-large
213
-
214
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/xjUKEsH6jcdMA3yGMfTI8.png)
215
-
216
- 10. https://huggingface.co/spaces/TencentARC/InstantMesh
217
-
218
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/8mH5XFoIOXjw4n4XhlXVb.png)
219
-
220
- 11. https://huggingface.co/spaces/playgroundai/playground-v2.5
221
 
222
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/YHIj0Ts14Gt1iSmAQicrw.png)
223
 
224
- 12. https://huggingface.co/spaces/jasperai/Flux.1-dev-Controlnet-Upscaler
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
225
 
226
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/kU6f8n_gpYsSOjxpq5XbV.png)
227
 
228
- 13. https://huggingface.co/spaces/akhaliq/anychat
 
 
 
 
229
 
230
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/IOpK_ueGakzkF9knGg3es.png)
 
 
 
231
 
 
 
 
 
232
 
233
 
234
-
235
- 15. https://huggingface.co/spaces/awacke1/stable-video-diffusion
236
- 16. https://huggingface.co/spaces/awacke1/GPT-4o-omni-text-audio-image-video
237
- 17. https://huggingface.co/spaces/awacke1/Arxiv-Paper-Search-And-QA-RAG-Pattern
238
- 18. https://huggingface.co/spaces/awacke1/AnthropicClaude3.5Sonnet-ACW
239
- 19. https://huggingface.co/spaces/awacke1/MusicGenStreamFacebook
240
- 20. https://huggingface.co/spaces/awacke1/AIKnowledgeTreeBuilder
241
- 21. https://huggingface.co/spaces/awacke1/RescuerOfStolenBikes
242
- 22. https://huggingface.co/spaces/awacke1/PDF-text-to-speech-Per-Page
243
- 23. https://huggingface.co/spaces/awacke1/AzureCosmosDBUI
244
- 24. https://huggingface.co/spaces/awacke1/MSGraphAPI
245
- 25. https://huggingface.co/spaces/awacke1/RealTimeAsyncASR
246
- 26. https://huggingface.co/spaces/awacke1/whisper-web
247
- 27. https://huggingface.co/spaces/awacke1/MarkdownMagicEditor.md
248
- 28. https://huggingface.co/spaces/awacke1/EZ-Voice-Clone-From-Long-Text
249
- 29. https://huggingface.co/spaces/awacke1/Edge-TTS-Text-to-Speech
250
- 30. https://huggingface.co/spaces/awacke1/StreamlitDotEdgeGraphViz-Images-SVG (Image button and rounded buttons)
251
- 31. https://huggingface.co/spaces/awacke1/DynamicMapCreator2
252
- 32. https://huggingface.co/spaces/awacke1/RealTimeAsyncASR
253
- 33. https://huggingface.co/spaces/awacke1/realtime-whisper-webgpu
254
- 34. https://huggingface.co/collections/Xenova/transformersjs-demos-64f9c4f49c099d93dbc611df
255
- 35. https://huggingface.co/spaces/awacke1/Models-Datasets-Spaces-Search-Hub
256
- 36. https://huggingface.co/spaces/awacke1/AutoMLUsingStreamlit-Plotly (CSV to Plotly Graph Objects)
257
- 37. https://huggingface.co/spaces/awacke1/Multiplayer-Eval-App-Upvote
258
- 38. https://huggingface.co/spaces/awacke1/AI-MovieMaker-Comedy
259
- 39. https://huggingface.co/spaces/awacke1/Multiplayer-Image-Evaluate-Upvote
260
- 40. https://huggingface.co/spaces/awacke1/Multiplayer-USMLE-Story-Voting-Evals
261
- 41. https://huggingface.co/spaces/awacke1/Multiplayer-RLHF-Evals
262
- 42. https://huggingface.co/spaces/awacke1/RLHF.Reinforce.Learn.With.Human.Feedback
263
- 43. https://huggingface.co/spaces/awacke1/MultiuserFarmSimulator
264
- 44. https://huggingface.co/spaces/awacke1/SocketIO-Multiplayer-DrawingGame
265
- 45. https://huggingface.co/spaces/awacke1/Multiplayer-Self-Play-Sankey-Simulator
266
- 46. https://huggingface.co/spaces/awacke1/Multiplayer-Action-Battle
267
- 47. https://huggingface.co/spaces/awacke1/Multiplayer-Image-Character-Terrain
268
- 48. https://huggingface.co/spaces/awacke1/MultiplayerMapMovement
269
- 49. https://huggingface.co/spaces/awacke1/MultiplayerDiceGameFromGrok
270
- 50. https://huggingface.co/spaces/awacke1/AaronWackerMusic
271
 
272
 
273
 
 
27
 
28
  5. ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/CfbenG3-7zxOxmuc5hRvJ.png)
29
 
30
+
31
+
32
+
33
+ # AI Pipeline Architecture Models for Multimodal Model Space Workflows
34
+
35
+ Zero GPU and GPU Spaces of note:
36
+
37
+ 1. https://huggingface.co/spaces/awacke1/midi-composer
38
+
39
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/n-PkaPq3PIkhWoca7S8cY.png)
40
+
41
+ 2. https://huggingface.co/spaces/awacke1/OpenDalleV1.1-GPU-Demo
42
+
43
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/JXQp7Zms-cNEc7zsSnuN-.png)
44
+
45
+ 3. https://huggingface.co/spaces/awacke1/AudioFileGenerationWithSDAudio
46
+
47
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/5Jzmziz_KCRBMEGrVJ22g.png)
48
+
49
+ 4. https://huggingface.co/spaces/multimodalart/stable-video-diffusion
50
+
51
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/0xNstTp_AtGYNJaJRlHg1.png)
52
+
53
+ 5. https://huggingface.co/spaces/mukaist/Midjourney
54
+
55
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/SFnE05CGFi7SYtLwTLit3.png)
56
+
57
+ 6. https://huggingface.co/spaces/multimodalart/stable-cascade
58
+
59
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/S2atWZqgj6xnG6c8N8MWZ.png)
60
+
61
+ 7. https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
62
+
63
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/FHPref69BgJmNxPO7uxT-.png)
64
+
65
+ 8. https://huggingface.co/spaces/ehristoforu/dalle-3-xl-lora-v2
66
+
67
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/RLuL7SuQmVv53DisDCkXx.png)
68
+
69
+ 9. https://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-large
70
+
71
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/xjUKEsH6jcdMA3yGMfTI8.png)
72
+
73
+ 10. https://huggingface.co/spaces/TencentARC/InstantMesh
74
+
75
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/8mH5XFoIOXjw4n4XhlXVb.png)
76
+
77
+ 11. https://huggingface.co/spaces/playgroundai/playground-v2.5
78
+
79
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/YHIj0Ts14Gt1iSmAQicrw.png)
80
+
81
+ 12. https://huggingface.co/spaces/jasperai/Flux.1-dev-Controlnet-Upscaler
82
+
83
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/kU6f8n_gpYsSOjxpq5XbV.png)
84
+
85
+ 13. https://huggingface.co/spaces/akhaliq/anychat
86
+
87
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/620630b603825909dcbeba35/IOpK_ueGakzkF9knGg3es.png)
88
+
89
+
90
+
91
+
92
+ 15. https://huggingface.co/spaces/awacke1/stable-video-diffusion
93
+ 16. https://huggingface.co/spaces/awacke1/GPT-4o-omni-text-audio-image-video
94
+ 17. https://huggingface.co/spaces/awacke1/Arxiv-Paper-Search-And-QA-RAG-Pattern
95
+ 18. https://huggingface.co/spaces/awacke1/AnthropicClaude3.5Sonnet-ACW
96
+ 19. https://huggingface.co/spaces/awacke1/MusicGenStreamFacebook
97
+ 20. https://huggingface.co/spaces/awacke1/AIKnowledgeTreeBuilder
98
+ 21. https://huggingface.co/spaces/awacke1/RescuerOfStolenBikes
99
+ 22. https://huggingface.co/spaces/awacke1/PDF-text-to-speech-Per-Page
100
+ 23. https://huggingface.co/spaces/awacke1/AzureCosmosDBUI
101
+ 24. https://huggingface.co/spaces/awacke1/MSGraphAPI
102
+ 25. https://huggingface.co/spaces/awacke1/RealTimeAsyncASR
103
+ 26. https://huggingface.co/spaces/awacke1/whisper-web
104
+ 27. https://huggingface.co/spaces/awacke1/MarkdownMagicEditor.md
105
+ 28. https://huggingface.co/spaces/awacke1/EZ-Voice-Clone-From-Long-Text
106
+ 29. https://huggingface.co/spaces/awacke1/Edge-TTS-Text-to-Speech
107
+ 30. https://huggingface.co/spaces/awacke1/StreamlitDotEdgeGraphViz-Images-SVG (Image button and rounded buttons)
108
+ 31. https://huggingface.co/spaces/awacke1/DynamicMapCreator2
109
+ 32. https://huggingface.co/spaces/awacke1/RealTimeAsyncASR
110
+ 33. https://huggingface.co/spaces/awacke1/realtime-whisper-webgpu
111
+ 34. https://huggingface.co/collections/Xenova/transformersjs-demos-64f9c4f49c099d93dbc611df
112
+ 35. https://huggingface.co/spaces/awacke1/Models-Datasets-Spaces-Search-Hub
113
+ 36. https://huggingface.co/spaces/awacke1/AutoMLUsingStreamlit-Plotly (CSV to Plotly Graph Objects)
114
+ 37. https://huggingface.co/spaces/awacke1/Multiplayer-Eval-App-Upvote
115
+ 38. https://huggingface.co/spaces/awacke1/AI-MovieMaker-Comedy
116
+ 39. https://huggingface.co/spaces/awacke1/Multiplayer-Image-Evaluate-Upvote
117
+ 40. https://huggingface.co/spaces/awacke1/Multiplayer-USMLE-Story-Voting-Evals
118
+ 41. https://huggingface.co/spaces/awacke1/Multiplayer-RLHF-Evals
119
+ 42. https://huggingface.co/spaces/awacke1/RLHF.Reinforce.Learn.With.Human.Feedback
120
+ 43. https://huggingface.co/spaces/awacke1/MultiuserFarmSimulator
121
+ 44. https://huggingface.co/spaces/awacke1/SocketIO-Multiplayer-DrawingGame
122
+ 45. https://huggingface.co/spaces/awacke1/Multiplayer-Self-Play-Sankey-Simulator
123
+ 46. https://huggingface.co/spaces/awacke1/Multiplayer-Action-Battle
124
+ 47. https://huggingface.co/spaces/awacke1/Multiplayer-Image-Character-Terrain
125
+ 48. https://huggingface.co/spaces/awacke1/MultiplayerMapMovement
126
+ 49. https://huggingface.co/spaces/awacke1/MultiplayerDiceGameFromGrok
127
+ 50. https://huggingface.co/spaces/awacke1/AaronWackerMusic
128
+
129
+
130
  # Architecture to AI Space
131
 
132
  # πŸ€– AI Types and Capabilities Classification
 
272
  - `RLHF.Reinforce.Learn.With.Human.Feedback`
273
 
274
 
275
+ # AI Architecture Synopsis for GPU Spaces:
276
 
277
+ Below is a grouped synopsis of the provided dataset, highlighting common model families, tasks, and modalities. We’ve identified 10 thematic clusters, collapsed similar entries, and tallied their approximate counts. Each section includes representative examples and a brief summary:
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
278
 
279
+ ---
280
 
281
+ ### 1. **Llama-Based Language Models** πŸ¦™πŸ“š
282
+ - **Count:** ~10+
283
+ - **Examples:**
284
+ - *Llama 2 7B/13B Chat*, *Llama 3.2V 11B Cot*, *Llava Llama-3 8B*, *Llama-Vision-11B*
285
+ - **Note:**
286
+ Models derived from or combined with LLaMA, often used for chat, reasoning, and multimodal tasks.
287
+
288
+ ### 2. **FLUX Models & Tools** βš‘πŸ–Ό
289
+ - **Count:** ~30+
290
+ - **Examples:**
291
+ - *FLUXllama*, *FLUX.1 [dev]*, *FLUX 8Steps LoRA*, *FLUX Prompt Generator*, *FLUX.1 Redux Dev*
292
+ - **Note:**
293
+ A large ecosystem of FLUX-based image generation tools, LoRAs (Low-Rank Adapters), and control variations, emphasizing fast, high-quality image outputs and model refinements.
294
+
295
+ ### 3. **Stable Diffusion & Derivatives** πŸŽ¨πŸƒ
296
+ - **Count:** ~20+
297
+ - **Examples:**
298
+ - *Stable Diffusion 3.5 Large*, *Stable Diffusion 3 Medium*, *Stable Audio Open*, *StableDelight*, *Stable Design*
299
+ - **Note:**
300
+ Numerous text-to-image and image-to-image tools built on Stable Diffusion technology, including audio and video extensions. Often offer rapid generation and style customization.
301
+
302
+ ### 4. **TTS & Audio Generation/Processing** πŸŽ€πŸ”Š
303
+ - **Count:** ~25+
304
+ - **Examples:**
305
+ - *F5-TTS*, *Spanish F5*, *Sound AI SFX*, *Multi Parler-TTS*, *Kokoro TTS*, *Voice Clone*
306
+ - **Note:**
307
+ Text-to-speech models, voice cloning, audio upscaling, and SFX generation demos. Many support zero-shot voice cloning and multilingual options.
308
+
309
+ ### 5. **Video Generation & Editing** πŸŽ₯⚑
310
+ - **Count:** ~20+
311
+ - **Examples:**
312
+ - *Stable Video Diffusion*, *MMAudio β€” generating synchronized audio from video/text*, *Instant Video*, *Video Background Removal*, *Animagine XL 3.1*
313
+ - **Note:**
314
+ Tools for creating or editing video content from text prompts, applying style transfers, removing backgrounds, and generating synchronized audio-visual experiences.
315
+
316
+ ### 6. **Image Editing, Inpainting & Upscaling** πŸ–ŒοΈπŸ”
317
+ - **Count:** ~25+
318
+ - **Examples:**
319
+ - *Diffusers Image Outpaint*, *Background Removal*, *Finegrain Image Enhancer*, *IC Light*, *InstantMesh*, *GiniGen Canvas*, *PhotoMaker V2*
320
+ - **Note:**
321
+ A broad set of image refinement tools: removing backgrounds, outpainting, upscaling, enhancing clarity, inpainting, and object-specific edits.
322
+
323
+ ### 7. **Document Parsing, OCR & Retrieval** πŸ“„πŸ”Ž
324
+ - **Count:** ~15+
325
+ - **Examples:**
326
+ - *Document Parser*, *PDF to Markdown*, *OmniParser demo*, *DocLayout YOLO*, *Newborn Article Impact Predict*, *RAG-Chatbot*
327
+ - **Note:**
328
+ Models dedicated to extracting information from documents, performing OCR, summarizing or querying text, and turning PDFs into searchable or structured formats.
329
+
330
+ ### 8. **Vision-Language & Multimodal Models** πŸ§©πŸ‘
331
+ - **Count:** ~30+
332
+ - **Examples:**
333
+ - *Qwen2-VL-7B*, *Llava Video*, *BLIP2*, *Florence 2*, *OpenGPT 4o*, *MMAudio*, *Paligemma2 Vqav2*
334
+ - **Note:**
335
+ Models that combine text, images, audio, and sometimes video to answer questions, perform captioning, generate rich content, and handle complex multimodal tasks.
336
+
337
+ ### 9. **3D Generation & Geometry Processing** 🌐🦾
338
+ - **Count:** ~15+
339
+ - **Examples:**
340
+ - *Make It Animatable*, *MeshAnything*, *3D-GRAND*, *Unique3D*, *Shap-E*, *Instant Text-to-3D Mesh Demo*
341
+ - **Note:**
342
+ Tools for converting images or text into 3D meshes, animating characters, and generating or refining 3D scenes. Often build on diffusion or 3D native generation methods.
343
+
344
+ ### 10. **Developer & Coding Tools** πŸ‘¨πŸ»β€πŸ’»πŸ”§
345
+ - **Count:** ~10+
346
+ - **Examples:**
347
+ - *OpenCoder 8B Instruct*, *Code Review debug*, *Screenshot to HTML*, *Fake Data Generator (JSONL)*, *Beam Search Visualizer*
348
+ - **Note:**
349
+ A set of utilities for code generation, debugging, embeddings, dataset creation, and general developer-centric workflows.
350
 
351
+ ---
352
 
353
+ ### Overall Synopsis
354
+ - **Common Models:**
355
+ - **LLaMA-based:** Widely used for language and multimodal capabilities.
356
+ - **Stable Diffusion & FLUX:** Predominant for image, audio, and video generation tasks.
357
+ - **TTS & Voice Conversion:** Multiple frameworks offering multilingual and zero-shot cloning.
358
 
359
+ - **Common Terms:**
360
+ - **LoRA (Low-Rank Adaptation), ControlNet, Inpainting, Upscaling:** Frequent technical keywords pointing to image refinements and personalization.
361
+ - **RAG (Retrieval Augmented Generation), OCR, Document Parser:** Indicative of text/data extraction and retrieval functionalities.
362
+ - **Multimodal, Vision-Language, 3D Generation:** Emphasizes integrating multiple data types (text, image, audio, video, 3D).
363
 
364
+ - **Common Modalities:**
365
+ - **Text-to-Image/Video/Audio** conversions prevalent throughout.
366
+ - **Image & Video Editing, Restoration, and Manipulation** capabilities are abundant.
367
+ - **Document, OCR, and Data Processing** tasks show a strong presence.
368
 
369
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
370
 
371
 
372