scb10x
/

llama3.1-typhoon2-8b-instruct

Text Generation

Safetensors

llama

conversational

Model card Files Files and versions Community

kunato commited on 20 days ago

Commit

df2f754

•

1 Parent(s): e9209ce

Update README.md

Browse files

Files changed (1) hide show

README.md +24 -5

README.md CHANGED Viewed

@@ -7,14 +7,33 @@ pipeline_tag: text-generation
 **Llama3.1-Typhoon2-8B-instruct** is a instruct Thai 🇹🇭 large language model with 8 billion parameters, and it is based on Llama3.1-8B.
-| Model                          | IFEval - TH | IFEval - EN | MT-Bench TH | MT-Bench EN | Thai Code-Switching(t=0.7) | Thai Code-Switching(t=1.0) | FunctionCall-TH     | FunctionCall-EN     | GSM8K-TH  | GSM8K-EN  | MATH-TH   | MATH-EN   | HumanEval-TH | HumanEval-EN | MBPP-TH   | MBPP-EN   |
-|--------------------------------|-------------|-------------|-------------|-------------|--------------------------------|--------------------------------|-----------|-----------|-----------|-----------|-----------|-----------|-------------|-------------|-----------|-----------|
-| **Llama3.1 8b instruct**       | 58.04%      | **77.64%**  | 5.109       | **8.118**   | 93%                            | 11.2%                         | 36.92%    | 66.06%    | 45.18%    | 62.4%     | 24.42%    | 48%       | 51.8%       | 67.7%       | **64.6%**  | **66.9%**  |
-| **Typhoon2 llama3 8b instruct**| **72.60%**  | 76.43%      | **5.7417**  | 7.584       | **98.8%**                      | **98%**                       | **75.12%** | **79.08%** | **71.72%** | **81.0%**  | **38.48%** | **49.04%** | **58.5%**    | **68.9%**    | 60.8%     | 63.0%     |
-# TODO add image - general / domain specific / long context
 For release post, please see our [blog](...).
 *To acknowledge Meta's effort in creating the foundation model and to comply with the license, we explicitly include "llama-3.1" in the model name.

 **Llama3.1-Typhoon2-8B-instruct** is a instruct Thai 🇹🇭 large language model with 8 billion parameters, and it is based on Llama3.1-8B.
+## Performance
+**Instruction-Following & Function Call Performance**
+<div align="center">
+  <img src="https://storage.googleapis.com/typhoon-public/assets/typhoon2-text/llama7b_general.png" alt="Typhoon2 Llama 8B General Performance" width="100%" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
+</div>
+**Specific Domain Performance (Math & Coding)**
+<div align="center">
+  <img src="https://storage.googleapis.com/typhoon-public/assets/typhoon2-text/llama7b_specific.png" alt="TTyphoon2 Llama 8B Specific Domain Performance" width="100%" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
+</div>
+**Long Context Performance**
+<div align="center">
+  <img src="https://storage.googleapis.com/typhoon-public/assets/typhoon2-text/llama7b_long.jpg" alt="Typhoon2 Llama 8B Long Context Performance" width="100%" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
+</div>
+**Detail Performance**
+| Model                          | IFEval - TH | IFEval - EN | MT-Bench TH | MT-Bench EN | Thai Code-Switching(t=0.7) | Thai Code-Switching(t=1.0) | FunctionCall-TH     | FunctionCall-EN     | GSM8K-TH  | GSM8K-EN  | MATH-TH   | MATH-EN   | HumanEval-TH | HumanEval-EN | MBPP-TH   | MBPP-EN   |
+|--------------------------------|-------------|-------------|-------------|-------------|--------------------------------|--------------------------------|-----------|-----------|-----------|-----------|-----------|-----------|-------------|-------------|-----------|-----------|
+| **Llama3.1 8B Instruct**       | 58.04%      | **77.64%**  | 5.109       | **8.118**   | 93%                            | 11.2%                         | 36.92%    | 66.06%    | 45.18%    | 62.4%     | 24.42%    | 48%       | 51.8%       | 67.7%       | **64.6%**  | **66.9%**  |
+| **Typhoon2 Llama3 8B Instruct**| **72.60%**  | 76.43%      | **5.7417**  | 7.584       | **98.8%**                      | **98%**                       | **75.12%** | **79.08%** | **71.72%** | **81.0%**  | **38.48%** | **49.04%** | **58.5%**    | **68.9%**    | 60.8%     | 63.0%     |
 For release post, please see our [blog](...).
 *To acknowledge Meta's effort in creating the foundation model and to comply with the license, we explicitly include "llama-3.1" in the model name.