Papers
arxiv:2411.09703

MagicQuill: An Intelligent Interactive Image Editing System

Published on Nov 14
ยท Submitted by akhaliq on Nov 15
#2 Paper of the day
Authors:
Yue Yu ,
,
,

Abstract

Image editing involves a variety of complex tasks and requires efficient and precise manipulation techniques. In this paper, we present MagicQuill, an integrated image editing system that enables swift actualization of creative ideas. Our system features a streamlined yet functionally robust interface, allowing for the articulation of editing operations (e.g., inserting elements, erasing objects, altering color) with minimal input. These interactions are monitored by a multimodal large language model (MLLM) to anticipate editing intentions in real time, bypassing the need for explicit prompt entry. Finally, we apply a powerful diffusion prior, enhanced by a carefully learned two-branch plug-in module, to process editing requests with precise control. Experimental results demonstrate the effectiveness of MagicQuill in achieving high-quality image edits. Please visit https://magic-quill.github.io to try out our system.

Community

Paper submitter
Paper author

TLDR: MagicQuill is an intelligent and interactive system achieving precise image editing.

Key Features: ๐Ÿ˜Ž User-friendly interface / ๐Ÿค– AI-powered suggestions / ๐ŸŽจ Precise local editing

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Sign up or log in to comment

Models citing this paper 1

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2411.09703 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2411.09703 in a Space README.md to link it from this page.

Collections including this paper 16