phamhai commited on
Commit
559daae
1 Parent(s): 724a9fb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -0
README.md CHANGED
@@ -47,6 +47,16 @@ We evaluated this model on the [VLMU benchmark](https://vmlu.ai/) and achieved a
47
 
48
  Function Calling Benchmark: https://huggingface.co/datasets/phamhai/Vietnamese-Function-Calling-Test
49
 
 
 
 
 
 
 
 
 
 
 
50
 
51
  <h2> Run the model </h2>
52
 
 
47
 
48
  Function Calling Benchmark: https://huggingface.co/datasets/phamhai/Vietnamese-Function-Calling-Test
49
 
50
+ | Model | Model size | Function name Acc (%) | Exact Match Acc (%)
51
+ | ------------ | ------------------ | ---------- | --------- |
52
+ | [phamhai/Llama-3.2-3B-Instruct-Frog](https://huggingface.co/phamhai/Llama-3.2-3B-Instruct-Frog) | ~3B | 95.79 | 47.78 |
53
+ | [Gemini-1.5-Pro](https://deepmind.google/technologies/gemini/pro/) | --- | 96.96 | 55.16 |
54
+ | [Gemini-1.5-Flash](https://deepmind.google/technologies/gemini/flash/) | --- | 97.10 | 51.64 |
55
+ | [Gemini-1.5-Flash-8B](https://deepmind.google/technologies/gemini/flash/) | --- | 97.38 | 64.75 |
56
+ | [gpt-4o-2024-08-06](https://platform.openai.com/docs/models#gpt-4o) | --- | 94.38 | 52.88 |
57
+ | [arcee-ai/Arcee-VyLinh](https://huggingface.co/arcee-ai/Arcee-VyLinh) | ~3B | | |
58
+
59
+ <p align="left"> Table 1. [Vietnamese Function Calling Benchmark](https://huggingface.co/datasets/phamhai/Vietnamese-Function-Calling-Test) </p>
60
 
61
  <h2> Run the model </h2>
62