Diffusers
ONNX
Safetensors
Hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Mingwang Xu1*  Hui Li1*  Qingkun Su1*  Hanlin Shang1  Liwei Zhang1  Ce Liu3 
Jingdong Wang2  Yao Yao4  Siyu Zhu1 
1Fudan University  2Baidu Inc  3ETH Zurich  4Nanjing University


Social Risks and Mitigations

The development of portrait image animation technologies driven by audio inputs poses social risks, such as the ethical implications of creating realistic portraits that could be misused for deepfakes. To mitigate these risks, it is crucial to establish ethical guidelines and responsible use practices. Privacy and consent concerns also arise from using individuals' images and voices. Addressing these involves transparent data usage policies, informed consent, and safeguarding privacy rights. By addressing these risks and implementing mitigations, the research aims to ensure the responsible and ethical development of this technology.

Downloads last month
0
Inference API
Unable to determine this model’s pipeline type. Check the docs .

Spaces using fudan-generative-ai/hallo 59