Image - a netzkontrast Collection

netzkontrast 's Collections

music

LLMs

Speech

Lora

Video

Image

Image

updated 14 days ago

Customizing Text-to-Image Models with a Single Image Pair

Paper • 2405.01536 • Published May 2, 2024 • 20
Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models

Paper • 2404.03913 • Published Apr 5, 2024
LCM-Lookahead for Encoder-based Text-to-Image Personalization

Paper • 2404.03620 • Published Apr 4, 2024 • 1
Customizing Text-to-Image Diffusion with Camera Viewpoint Control

Paper • 2404.12333 • Published Apr 18, 2024 • 1
fka/awesome-chatgpt-prompts

Viewer • Updated 29 days ago • 203 • 8.97k • 7.3k
MohamedRashad/midjourney-detailed-prompts

Viewer • Updated Apr 24, 2024 • 3.05k • 81 • 51
jtatman/stable-diffusion-prompts-uncensored

Viewer • Updated Jan 4, 2024 • 852k • 55 • 15
Gustavosta/Stable-Diffusion-Prompts

Viewer • Updated Sep 18, 2022 • 81.9k • 3.38k • 460
succinctly/midjourney-prompts

Viewer • Updated Jul 22, 2022 • 246k • 143 • 96
succinctly/text2image-prompt-generator

Text Generation • Updated Aug 20, 2022 • 31.4k • 296
alespalla/chatbot_instruction_prompts

Viewer • Updated Oct 16, 2024 • 323k • 474 • 47
MohamedRashad/easy_imageinwords

Viewer • Updated May 13, 2024 • 2.4k • 53 • 3
vivym/midjourney-prompts

Viewer • Updated Nov 15, 2023 • 7.13M • 111 • 41
jtatman/stable-diffusion-prompts-stats-full-uncensored

Viewer • Updated Nov 8, 2024 • 897k • 156 • 60
Gustavosta/MagicPrompt-Dalle

Text Generation • Updated Mar 17, 2023 • 1.33k • 48
Running on Zero

718

😻

Omost
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28, 2024 • 86
SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs

Paper • 2408.11813 • Published Aug 21, 2024 • 12
TokenPacker: Efficient Visual Projector for Multimodal LLM

Paper • 2407.02392 • Published Jul 2, 2024 • 21
PALP: Prompt Aligned Personalization of Text-to-Image Models

Paper • 2401.06105 • Published Jan 11, 2024 • 49
Genie: Generative Interactive Environments

Paper • 2402.15391 • Published Feb 23, 2024 • 71
Training-Free Consistent Text-to-Image Generation

Paper • 2402.03286 • Published Feb 5, 2024 • 66
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications

Paper • 2408.03703 • Published Aug 7, 2024
AutoPresent: Designing Structured Visuals from Scratch

Paper • 2501.00912 • Published Jan 1 • 8
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published 27 days ago • 48
ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding

Paper • 2501.05452 • Published 25 days ago • 15
FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors

Paper • 2501.08225 • Published 20 days ago • 18
AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation

Paper • 2501.09503 • Published 19 days ago • 13