PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding Paper β’ 2501.16411 β’ Published 13 days ago β’ 17
EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents Paper β’ 2501.11858 β’ Published 20 days ago β’ 5
Efficient-Large-Model/Sana_600M_1024px_diffusers Text-to-Image β’ Updated about 1 month ago β’ 27 β’ 2
Efficient-Large-Model/Sana_1600M_512px_MultiLing_diffusers Text-to-Image β’ Updated about 1 month ago β’ 1
Efficient-Large-Model/Sana_1600M_4Kpx_BF16_diffusers Text-to-Image β’ Updated about 1 month ago β’ 7
Efficient-Large-Model/Sana_1600M_2Kpx_BF16_diffusers Text-to-Image β’ Updated about 1 month ago β’ 7
Efficient-Large-Model/Sana_1600M_1024px_diffusers Text-to-Image β’ Updated about 1 month ago β’ 34 β’ 15
Efficient-Large-Model/Sana_1600M_1024px_MultiLing_diffusers Text-to-Image β’ Updated about 1 month ago β’ 9 β’ 2
Efficient-Large-Model/Sana_1600M_512px_MultiLing Text-to-Image β’ Updated about 1 month ago β’ 55 β’ 15
Efficient-Large-Model/Sana_1600M_4Kpx_BF16 Text-to-Image β’ Updated about 1 month ago β’ 1.3k β’ 27
Efficient-Large-Model/Sana_1600M_2Kpx_BF16 Text-to-Image β’ Updated about 1 month ago β’ 1.77k β’ 11
Efficient-Large-Model/Sana_1600M_1024px_BF16 Text-to-Image β’ Updated about 1 month ago β’ 631 β’ 11