Virtual try-on for clothes on a person
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Audio-based Lip Sync for Talking Head Video Editing
Audio Conditioned LipSync with Latent Diffusion Models
Generate music from text and melody descriptions