PERSE: Personalized 3D Generative Avatars from A Single Portrait Paper β’ 2412.21206 β’ Published 12 days ago β’ 15
How "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System? Paper β’ 2412.18495 β’ Published 18 days ago β’ 8
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation Paper β’ 2411.08380 β’ Published Nov 13, 2024 β’ 25
aalonso-developer/vit-base-patch16-224-in21k-clothing-classifier Image Classification β’ Updated May 21, 2023 β’ 47 β’ 9
rvv-karma/Human-Action-Recognition-VIT-Base-patch16-224 Image Classification β’ Updated Dec 10, 2023 β’ 463 β’ 9
view article Article Fine-Tune ViT for Image Classification with π€ Transformers Feb 11, 2022 β’ 30