Transcribe audio in realtime with Whisper
Generate customized portraits using ID images and prompts
Audio Conditioned LipSync with Latent Diffusion Models
Virtual Doctor Assistant.