microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ Updated about 2 hours ago β’ 7.35k β’ 507
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Paper β’ 2502.18364 β’ Published 3 days ago β’ 29 β’ 3
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Paper β’ 2502.18364 β’ Published 3 days ago β’ 29
Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering Paper β’ 2406.10208 β’ Published Jun 14, 2024 β’ 22
Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering Paper β’ 2406.10208 β’ Published Jun 14, 2024 β’ 22 β’ 2
FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation Paper β’ 2406.08392 β’ Published Jun 12, 2024 β’ 20
Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step Paper β’ 2406.04314 β’ Published Jun 6, 2024 β’ 28
playgroundai/playground-v2.5-1024px-aesthetic Text-to-Image β’ Updated Mar 15, 2024 β’ 281k β’ β’ 708
Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering Paper β’ 2403.09622 β’ Published Mar 14, 2024 β’ 17