Audio Conditioned LipSync with Latent Diffusion Models
Convert text to speech with adjustable settings