scb10x
/

llama-3-typhoon-v1.5-8b-audio-preview

Text Generation

feature-extraction

Model card Files Files and versions Community

potsawee commited on Nov 20, 2024

Commit

346bf2e

•

1 Parent(s): 570bf15

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ pipeline_tag: text-generation
 **llama-3-typhoon-v1.5-8b-audio-preview** is a 🇹🇭 Thai *audio-language* model. It supports both text and audio input modalities natively while the output is text. This version (August 2024) is our first audio-language model as a part of our multimodal effort, and it is a research *preview* version. The base language model is our [llama-3-typhoon-v1.5-8b-instruct](https://huggingface.co/scb10x/llama-3-typhoon-v1.5-8b-instruct).
-More details can be found in our [release blog](https://blog.opentyphoon.ai/typhoon-audio-preview-release-6fbb3f938287) and [technical report](https://arxiv.org/abs/2409.10999). *To acknowledge Meta's effort in creating the foundation model and to comply with the license, we explicitly include "llama-3" in the model name.
 ## Model Description
@@ -66,7 +66,7 @@ print(response)
 - streamer (`TextIteratorStreamer`, *optional*, defaults to `None`) -- this allows streaming output
 ## Evaluation Results
-More information is provided in our [release blog](https://blog.opentyphoon.ai/typhoon-audio-preview-release-6fbb3f938287).
 | Model                       | ASR-en (WER↓)      | ASR-th (WER↓) | En2Th (BLEU↑) | X2Th (BLEU↑) | Th2En (BLEU↑) |
 |:----------------------------|:-------------------|:--------------|:--------------|:-------------|:--------------|
 | SALMONN-13B                 | 5.79      | 98.07         | 0.07         | 0.10        | 14.97        |

 **llama-3-typhoon-v1.5-8b-audio-preview** is a 🇹🇭 Thai *audio-language* model. It supports both text and audio input modalities natively while the output is text. This version (August 2024) is our first audio-language model as a part of our multimodal effort, and it is a research *preview* version. The base language model is our [llama-3-typhoon-v1.5-8b-instruct](https://huggingface.co/scb10x/llama-3-typhoon-v1.5-8b-instruct).
+More details can be found in our [technical report](https://arxiv.org/abs/2409.10999). *To acknowledge Meta's effort in creating the foundation model and to comply with the license, we explicitly include "llama-3" in the model name.
 ## Model Description
 - streamer (`TextIteratorStreamer`, *optional*, defaults to `None`) -- this allows streaming output
 ## Evaluation Results
+More information is provided in our [technical report](https://arxiv.org/abs/2409.10999).
 | Model                       | ASR-en (WER↓)      | ASR-th (WER↓) | En2Th (BLEU↑) | X2Th (BLEU↑) | Th2En (BLEU↑) |
 |:----------------------------|:-------------------|:--------------|:--------------|:-------------|:--------------|
 | SALMONN-13B                 | 5.79      | 98.07         | 0.07         | 0.10        | 14.97        |