File size: 257 Bytes
a73d03f 9ab7dfd a73d03f b867139 9ab7dfd |
1 2 3 4 5 6 7 8 |
---
license: mit
tags:
- video-to-audio
---
This repository contains the model described in [Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis](https://huggingface.co/papers/2412.15322).
Code: https://github.com/hkchengrex/MMAudio. |