Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@ pipeline_tag: text-generation
|
|
11 |
|
12 |
**llama-3-typhoon-v1.5-8b-audio-preview** is a 🇹🇭 Thai *audio-language* model. It supports both text and audio input modalities natively while the output is text. This version (August 2024) is our first audio-language model as a part of our multimodal effort, and it is a research *preview* version. The base language model is our [llama-3-typhoon-v1.5-8b-instruct](https://huggingface.co/scb10x/llama-3-typhoon-v1.5-8b-instruct).
|
13 |
|
14 |
-
More details can be found in our [
|
15 |
|
16 |
## Model Description
|
17 |
|
@@ -66,7 +66,7 @@ print(response)
|
|
66 |
- streamer (`TextIteratorStreamer`, *optional*, defaults to `None`) -- this allows streaming output
|
67 |
|
68 |
## Evaluation Results
|
69 |
-
More information is provided in our [
|
70 |
| Model | ASR-en (WER↓) | ASR-th (WER↓) | En2Th (BLEU↑) | X2Th (BLEU↑) | Th2En (BLEU↑) |
|
71 |
|:----------------------------|:-------------------|:--------------|:--------------|:-------------|:--------------|
|
72 |
| SALMONN-13B | 5.79 | 98.07 | 0.07 | 0.10 | 14.97 |
|
|
|
11 |
|
12 |
**llama-3-typhoon-v1.5-8b-audio-preview** is a 🇹🇭 Thai *audio-language* model. It supports both text and audio input modalities natively while the output is text. This version (August 2024) is our first audio-language model as a part of our multimodal effort, and it is a research *preview* version. The base language model is our [llama-3-typhoon-v1.5-8b-instruct](https://huggingface.co/scb10x/llama-3-typhoon-v1.5-8b-instruct).
|
13 |
|
14 |
+
More details can be found in our [technical report](https://arxiv.org/abs/2409.10999). *To acknowledge Meta's effort in creating the foundation model and to comply with the license, we explicitly include "llama-3" in the model name.
|
15 |
|
16 |
## Model Description
|
17 |
|
|
|
66 |
- streamer (`TextIteratorStreamer`, *optional*, defaults to `None`) -- this allows streaming output
|
67 |
|
68 |
## Evaluation Results
|
69 |
+
More information is provided in our [technical report](https://arxiv.org/abs/2409.10999).
|
70 |
| Model | ASR-en (WER↓) | ASR-th (WER↓) | En2Th (BLEU↑) | X2Th (BLEU↑) | Th2En (BLEU↑) |
|
71 |
|:----------------------------|:-------------------|:--------------|:--------------|:-------------|:--------------|
|
72 |
| SALMONN-13B | 5.79 | 98.07 | 0.07 | 0.10 | 14.97 |
|