Spaces:
Running
on
Zero
Running
on
Zero
<!--Copyright 2023 The HuggingFace Team. All rights reserved. | |
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with | |
the License. You may obtain a copy of the License at | |
http://www.apache.org/licenses/LICENSE-2.0 | |
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on | |
an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the | |
specific language governing permissions and limitations under the License. | |
--> | |
# How to build a community pipeline | |
*Note*: this page was built from the GitHub Issue on Community Pipelines [#841](https://github.com/huggingface/diffusers/issues/841). | |
Let's make an example! | |
Say you want to define a pipeline that just does a single forward pass to a U-Net and then calls a scheduler only once (Note, this doesn't make any sense from a scientific point of view, but only represents an example of how things work under the hood). | |
Cool! So you open your favorite IDE and start creating your pipeline π». | |
First, what model weights and configurations do we need? | |
We have a U-Net and a scheduler, so our pipeline should take a U-Net and a scheduler as an argument. | |
Also, as stated above, you'd like to be able to load weights and the scheduler config for Hub and share your code with others, so we'll inherit from `DiffusionPipeline`: | |
```python | |
from diffusers import DiffusionPipeline | |
import torch | |
class UnetSchedulerOneForwardPipeline(DiffusionPipeline): | |
def __init__(self, unet, scheduler): | |
super().__init__() | |
``` | |
Now, we must save the `unet` and `scheduler` in a config file so that you can save your pipeline with `save_pretrained`. | |
Therefore, make sure you add every component that is save-able to the `register_modules` function: | |
```python | |
from diffusers import DiffusionPipeline | |
import torch | |
class UnetSchedulerOneForwardPipeline(DiffusionPipeline): | |
def __init__(self, unet, scheduler): | |
super().__init__() | |
self.register_modules(unet=unet, scheduler=scheduler) | |
``` | |
Cool, the init is done! π₯ Now, let's go into the forward pass, which we recommend defining as `__call__` . Here you're given all the creative freedom there is. For our amazing "one-step" pipeline, we simply create a random image and call the unet once and the scheduler once: | |
```python | |
from diffusers import DiffusionPipeline | |
import torch | |
class UnetSchedulerOneForwardPipeline(DiffusionPipeline): | |
def __init__(self, unet, scheduler): | |
super().__init__() | |
self.register_modules(unet=unet, scheduler=scheduler) | |
def __call__(self): | |
image = torch.randn( | |
(1, self.unet.in_channels, self.unet.sample_size, self.unet.sample_size), | |
) | |
timestep = 1 | |
model_output = self.unet(image, timestep).sample | |
scheduler_output = self.scheduler.step(model_output, timestep, image).prev_sample | |
return scheduler_output | |
``` | |
Cool, that's it! π You can now run this pipeline by passing a `unet` and a `scheduler` to the init: | |
```python | |
from diffusers import DDPMScheduler, Unet2DModel | |
scheduler = DDPMScheduler() | |
unet = UNet2DModel() | |
pipeline = UnetSchedulerOneForwardPipeline(unet=unet, scheduler=scheduler) | |
output = pipeline() | |
``` | |
But what's even better is that you can load pre-existing weights into the pipeline if they match exactly your pipeline structure. This is e.g. the case for [https://huggingface.co/google/ddpm-cifar10-32](https://huggingface.co/google/ddpm-cifar10-32) so that we can do the following: | |
```python | |
pipeline = UnetSchedulerOneForwardPipeline.from_pretrained("google/ddpm-cifar10-32") | |
output = pipeline() | |
``` | |
We want to share this amazing pipeline with the community, so we would open a PR request to add the following code under `one_step_unet.py` to [https://github.com/huggingface/diffusers/tree/main/examples/community](https://github.com/huggingface/diffusers/tree/main/examples/community) . | |
```python | |
from diffusers import DiffusionPipeline | |
import torch | |
class UnetSchedulerOneForwardPipeline(DiffusionPipeline): | |
def __init__(self, unet, scheduler): | |
super().__init__() | |
self.register_modules(unet=unet, scheduler=scheduler) | |
def __call__(self): | |
image = torch.randn( | |
(1, self.unet.in_channels, self.unet.sample_size, self.unet.sample_size), | |
) | |
timestep = 1 | |
model_output = self.unet(image, timestep).sample | |
scheduler_output = self.scheduler.step(model_output, timestep, image).prev_sample | |
return scheduler_output | |
``` | |
Our amazing pipeline got merged here: [#840](https://github.com/huggingface/diffusers/pull/840). | |
Now everybody that has `diffusers >= 0.4.0` installed can use our pipeline magically πͺ as follows: | |
```python | |
from diffusers import DiffusionPipeline | |
pipe = DiffusionPipeline.from_pretrained("google/ddpm-cifar10-32", custom_pipeline="one_step_unet") | |
pipe() | |
``` | |
Another way to upload your custom_pipeline, besides sending a PR, is uploading the code that contains it to the Hugging Face Hub, [as exemplified here](https://huggingface.co/docs/diffusers/using-diffusers/custom_pipeline_overview#loading-custom-pipelines-from-the-hub). | |
**Try it out now - it works!** | |
In general, you will want to create much more sophisticated pipelines, so we recommend looking at existing pipelines here: [https://github.com/huggingface/diffusers/tree/main/examples/community](https://github.com/huggingface/diffusers/tree/main/examples/community). | |
IMPORTANT: | |
You can use whatever package you want in your community pipeline file - as long as the user has it installed, everything will work fine. Make sure you have one and only one pipeline class that inherits from `DiffusionPipeline` as this will be automatically detected. | |
## How do community pipelines work? | |
A community pipeline is a class that has to inherit from ['DiffusionPipeline']: | |
and that has been added to `examples/community` [files](https://github.com/huggingface/diffusers/tree/main/examples/community). | |
The community can load the pipeline code via the custom_pipeline argument from DiffusionPipeline. See docs [here](https://huggingface.co/docs/diffusers/api/diffusion_pipeline#diffusers.DiffusionPipeline.from_pretrained.custom_pipeline): | |
This means: | |
The model weights and configs of the pipeline should be loaded from the `pretrained_model_name_or_path` [argument](https://huggingface.co/docs/diffusers/api/diffusion_pipeline#diffusers.DiffusionPipeline.from_pretrained.pretrained_model_name_or_path): | |
whereas the code that powers the community pipeline is defined in a file added in [`examples/community`](https://github.com/huggingface/diffusers/tree/main/examples/community). | |
Now, it might very well be that only some of your pipeline components weights can be downloaded from an official repo. | |
The other components should then be passed directly to init as is the case for the ClIP guidance notebook [here](https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/CLIP_Guided_Stable_diffusion_with_diffusers.ipynb#scrollTo=z9Kglma6hjki). | |
The magic behind all of this is that we load the code directly from GitHub. You can check it out in more detail if you follow the functionality defined here: | |
```python | |
# 2. Load the pipeline class, if using custom module then load it from the hub | |
# if we load from explicit class, let's use it | |
if custom_pipeline is not None: | |
pipeline_class = get_class_from_dynamic_module( | |
custom_pipeline, module_file=CUSTOM_PIPELINE_FILE_NAME, cache_dir=custom_pipeline | |
) | |
elif cls != DiffusionPipeline: | |
pipeline_class = cls | |
else: | |
diffusers_module = importlib.import_module(cls.__module__.split(".")[0]) | |
pipeline_class = getattr(diffusers_module, config_dict["_class_name"]) | |
``` | |
This is why a community pipeline merged to GitHub will be directly available to all `diffusers` packages. | |