Image-to-Image
Diffusers
English
flux-ip-adapter-v2 / README.md
stazizov's picture
Update README.md
5bb75f9 verified
|
raw
history blame
2.78 kB
metadata
license: apache-2.0
datasets:
  - CaptionEmporium/coyo-hd-11m-llavanext
  - CortexLM/midjourney-v6
language:
  - en
base_model:
  - black-forest-labs/FLUX.1-dev
new_version: XLabs-AI/flux-ip-adapter
pipeline_tag: image-to-image
library_name: diffusers

Banner Picture 1 Mona Anime Workflow 1

This repository provides a IP-Adapter checkpoint for FLUX.1-dev model by Black Forest Labs

See our github for comfy ui workflows.

Models

The IP adapter is trained on a resolution of 512x512 for 150k steps and 1024x1024 for 350k steps while maintaining the aspect ratio. We release v2 version - which can be used directly in ComfyUI!

Please, see our ComfyUI custom nodes installation guide

Examples

See examples of our models results below.
Also, some generation results with input images are provided in "Files and versions"

Inference

To try our models, you have 2 options:

  1. Use main.py from our official repo
  2. Use our custom nodes for ComfyUI and test it with provided workflows (check out folder /workflows)

Instruction for ComfyUI

  1. Go to ComfyUI/custom_nodes
  2. Clone x-flux-comfyui, path should be ComfyUI/custom_nodes/x-flux-comfyui/*, where * is all the files in this repo
  3. Go to ComfyUI/custom_nodes/x-flux-comfyui/ and run python setup.py
  4. Update x-flux-comfy with git pull or reinstall it.
  5. Download Clip-L model.safetensors from OpenAI VIT CLIP large, and put it to ComfyUI/models/clip_vision/*.
  6. Download our IPAdapter from huggingface, and put it to ComfyUI/models/xlabs/ipadapters/*.
  7. Use Flux Load IPAdapter and Apply Flux IPAdapter nodes, choose right CLIP model and enjoy your genereations.
  8. You can find example workflow in folder workflows in this repo.

If you get bad results, try to set true_gs=2

Limitations

The IP Adapter is currently in beta. We do not guarantee that you will get a good result right away, it may take more attempts to get a result. Example Picture 2 Example Picture 1

License

Our weights fall under the FLUX.1 [dev] Non-Commercial License