sdtana's picture

sdtana

sdtana

·

roxani_17

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

upvoted a paper 28 days ago

TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

upvoted a paper about 1 month ago

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

View all activity

Organizations

sdtana's activity

upvoted a paper 12 days ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published 14 days ago • 27

upvoted a paper 28 days ago

TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

Paper • 2502.07870 • Published about 1 month ago • 43

upvoted 3 papers about 1 month ago

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Paper • 2502.04320 • Published Feb 6 • 35

LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer

Paper • 2502.01105 • Published Feb 3 • 20

SliderSpace: Decomposing the Visual Capabilities of Diffusion Models

Paper • 2502.01639 • Published Feb 3 • 25