Segmentation Transformer: Object-Contextual Representations for Semantic Segmentation Paper • 1909.11065 • Published Sep 24, 2019
MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation Paper • 2311.18829 • Published Nov 30, 2023 • 1
Exploring Predicate Visual Context in Detecting of Human-Object Interactions Paper • 2308.06202 • Published Aug 11, 2023
Mask-Attention-Free Transformer for 3D Instance Segmentation Paper • 2309.01692 • Published Sep 4, 2023
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning Paper • 2210.01035 • Published Oct 3, 2022
Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering Paper • 2406.10208 • Published Jun 14, 2024 • 22
Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators Paper • 2408.05710 • Published Aug 11, 2024 • 2
DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing Paper • 2403.14487 • Published Mar 21, 2024 • 1
ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models Paper • 2311.18834 • Published Nov 30, 2023
Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision Paper • 2106.01226 • Published Jun 2, 2021
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Paper • 2502.18364 • Published Feb 25 • 36
BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation Paper • 2503.20672 • Published 13 days ago • 13
BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation Paper • 2503.20672 • Published 13 days ago • 13
FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation Paper • 2406.08392 • Published Jun 12, 2024 • 21