The MossFormer2_SS_16K model weights for 16 kHz speech separation in ClearerVoice-Studio repo.
This model is trained on large scale datasets inclduing open-sourced and private data.
It separates mixed-speaker speeches into individual speaker's speech.
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
HF Inference deployability: The model has no library tag.