EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions Paper โข 2402.17485 โข Published Feb 27 โข 190
ChatMusician: Understanding and Generating Music Intrinsically with LLM Paper โข 2402.16153 โข Published Feb 25 โข 56
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration Paper โข 2306.09093 โข Published Jun 15, 2023 โข 15