Update README.md
Browse files
README.md
CHANGED
@@ -9,6 +9,11 @@ tags:
|
|
9 |
licence: license
|
10 |
---
|
11 |
|
|
|
|
|
|
|
|
|
|
|
12 |
# Model Card for self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4_metaMathQA_dpo_iter5
|
13 |
|
14 |
This model is a fine-tuned version of [RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4](https://huggingface.co/RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4).
|
|
|
9 |
licence: license
|
10 |
---
|
11 |
|
12 |
+
Accuracy@0: 0.468
|
13 |
+
Accuracy@1: 0.43
|
14 |
+
Correct->Incorrect: 0.054
|
15 |
+
Incorrect->Correct: 0.016
|
16 |
+
|
17 |
# Model Card for self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4_metaMathQA_dpo_iter5
|
18 |
|
19 |
This model is a fine-tuned version of [RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4](https://huggingface.co/RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4).
|