Update README.md
Browse files
README.md
CHANGED
@@ -59,24 +59,33 @@ datasets:
|
|
59 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
60 |
should probably proofread and complete it, then remove this comment. -->
|
61 |
|
62 |
-
# verysmol_llama-
|
|
|
|
|
63 |
|
64 |
This model is a fine-tuned version of [pszemraj/verysmol_llama-v10-rw3m_dd](https://huggingface.co/pszemraj/verysmol_llama-v10-rw3m_dd) on the None dataset.
|
65 |
It achieves the following results on the evaluation set:
|
66 |
- Loss: 2.8876
|
67 |
- Accuracy: 0.4502
|
68 |
|
69 |
-
## Model description
|
70 |
-
|
71 |
-
More information needed
|
72 |
|
73 |
-
##
|
74 |
|
75 |
-
|
76 |
|
77 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
78 |
|
79 |
-
More information needed
|
80 |
|
81 |
## Training procedure
|
82 |
|
|
|
59 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
60 |
should probably proofread and complete it, then remove this comment. -->
|
61 |
|
62 |
+
# verysmol_llama-v11-KIx2
|
63 |
+
|
64 |
+
## Model description
|
65 |
|
66 |
This model is a fine-tuned version of [pszemraj/verysmol_llama-v10-rw3m_dd](https://huggingface.co/pszemraj/verysmol_llama-v10-rw3m_dd) on the None dataset.
|
67 |
It achieves the following results on the evaluation set:
|
68 |
- Loss: 2.8876
|
69 |
- Accuracy: 0.4502
|
70 |
|
|
|
|
|
|
|
71 |
|
72 |
+
## evals
|
73 |
|
74 |
+
`hf-causal-experimental (pretrained=pszemraj/verysmol_llama-v11-KIx2,revision=main,trust_remote_code=True,dtype='float'), limit: None, provide_description: False, num_fewshot: 0, batch_size: 16`
|
75 |
|
76 |
+
| Task |Version| Metric | Value | |Stderr|
|
77 |
+
|--------------|------:|--------|-------:|---|-----:|
|
78 |
+
|arc_easy | 0|acc | 0.4024|± |0.0101|
|
79 |
+
| | |acc_norm| 0.3788|± |0.0100|
|
80 |
+
|boolq | 1|acc | 0.6199|± |0.0085|
|
81 |
+
|lambada_openai| 0|ppl |111.9939|± |4.6906|
|
82 |
+
| | |acc | 0.2354|± |0.0059|
|
83 |
+
|openbookqa | 0|acc | 0.1440|± |0.0157|
|
84 |
+
| | |acc_norm| 0.2760|± |0.0200|
|
85 |
+
|piqa | 0|acc | 0.5713|± |0.0115|
|
86 |
+
| | |acc_norm| 0.5664|± |0.0116|
|
87 |
+
|winogrande | 0|acc | 0.5201|± |0.0140|
|
88 |
|
|
|
89 |
|
90 |
## Training procedure
|
91 |
|