Update README.md
Browse files
README.md
CHANGED
@@ -47,6 +47,16 @@ We evaluated this model on the [VLMU benchmark](https://vmlu.ai/) and achieved a
|
|
47 |
|
48 |
Function Calling Benchmark: https://huggingface.co/datasets/phamhai/Vietnamese-Function-Calling-Test
|
49 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
50 |
|
51 |
<h2> Run the model </h2>
|
52 |
|
|
|
47 |
|
48 |
Function Calling Benchmark: https://huggingface.co/datasets/phamhai/Vietnamese-Function-Calling-Test
|
49 |
|
50 |
+
| Model | Model size | Function name Acc (%) | Exact Match Acc (%)
|
51 |
+
| ------------ | ------------------ | ---------- | --------- |
|
52 |
+
| [phamhai/Llama-3.2-3B-Instruct-Frog](https://huggingface.co/phamhai/Llama-3.2-3B-Instruct-Frog) | ~3B | 95.79 | 47.78 |
|
53 |
+
| [Gemini-1.5-Pro](https://deepmind.google/technologies/gemini/pro/) | --- | 96.96 | 55.16 |
|
54 |
+
| [Gemini-1.5-Flash](https://deepmind.google/technologies/gemini/flash/) | --- | 97.10 | 51.64 |
|
55 |
+
| [Gemini-1.5-Flash-8B](https://deepmind.google/technologies/gemini/flash/) | --- | 97.38 | 64.75 |
|
56 |
+
| [gpt-4o-2024-08-06](https://platform.openai.com/docs/models#gpt-4o) | --- | 94.38 | 52.88 |
|
57 |
+
| [arcee-ai/Arcee-VyLinh](https://huggingface.co/arcee-ai/Arcee-VyLinh) | ~3B | | |
|
58 |
+
|
59 |
+
<p align="left"> Table 1. [Vietnamese Function Calling Benchmark](https://huggingface.co/datasets/phamhai/Vietnamese-Function-Calling-Test) </p>
|
60 |
|
61 |
<h2> Run the model </h2>
|
62 |
|