Update README.md
Browse files
README.md
CHANGED
@@ -16,7 +16,7 @@ tags:
|
|
16 |
These are model weights originally provided by the authors of the paper [Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction](https://arxiv.org/pdf/2201.02184.pdf).
|
17 |
|
18 |
<figure>
|
19 |
-
<img src="https://huggingface.co/vumichien/AV-HuBERT/
|
20 |
<figcaption>Audio-visual HuBERT
|
21 |
</figcaption>
|
22 |
</figure>
|
|
|
16 |
These are model weights originally provided by the authors of the paper [Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction](https://arxiv.org/pdf/2201.02184.pdf).
|
17 |
|
18 |
<figure>
|
19 |
+
<img src="https://huggingface.co/vumichien/AV-HuBERT/resolve/main/HuBert.png" alt="Audio-visual HuBERT">
|
20 |
<figcaption>Audio-visual HuBERT
|
21 |
</figcaption>
|
22 |
</figure>
|