microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 11 days ago • 620k • 1.31k
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published Feb 3 • 213
ehcalabres/wav2vec2-lg-xlsr-en-speech-emotion-recognition Audio Classification • Updated Oct 24, 2024 • 64.6k • 218