eval
Browse files
README.md
CHANGED
@@ -34,7 +34,9 @@ New models are released for everyone once our team's training and validation pro
|
|
34 |
|
35 |
## Evaluation
|
36 |
|
37 |
-
|
|
|
|
|
38 |
|
39 |
## Prompting Guide
|
40 |
Shining Valiant XS uses the same prompt format as Llama 2 Chat - feel free to use your existing prompts and scripts!
|
|
|
34 |
|
35 |
## Evaluation
|
36 |
|
37 |
+
| Model | Avg | ARC | HS | MMLU | TQA | WG | GSM |
|
38 |
+
|-----------------------|--------|-------|-------|--------|-------|-------|-------|
|
39 |
+
| **Shining Valiant XS 1.1** | 62.48 | 64.42 | 83.58 | 60.37 | 55.00 | 76.80 | 34.72 |
|
40 |
|
41 |
## Prompting Guide
|
42 |
Shining Valiant XS uses the same prompt format as Llama 2 Chat - feel free to use your existing prompts and scripts!
|