44
Diception Demo
π₯
A Generalist Diffusion Model for Vision Perception
Generate structured video chapters from transcripts
Replace faces in videos with images
Optical Flow Propagation on an Image
Framer: Interactive Frame Interpolation
Generate speaker-specific clips from a video
High-fidelity Text-To-Speech
Transcribe audio from microphone, file, or YouTube link
Media understanding