BEE-spoke-data
/

verysmol_llama-v11-KIx2

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions

pszemraj commited on Oct 20, 2023

Commit

fc9bdd9

·

1 Parent(s): 4a8cba0

Update README.md

Files changed (1) hide show

README.md +17 -8

README.md CHANGED Viewed

@@ -59,24 +59,33 @@ datasets:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# verysmol_llama-v10-rw3m_dd-knowledge-inoc-concat-v1-vN
 This model is a fine-tuned version of [pszemraj/verysmol_llama-v10-rw3m_dd](https://huggingface.co/pszemraj/verysmol_llama-v10-rw3m_dd) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 2.8876
 - Accuracy: 0.4502
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# verysmol_llama-v11-KIx2
+## Model description
 This model is a fine-tuned version of [pszemraj/verysmol_llama-v10-rw3m_dd](https://huggingface.co/pszemraj/verysmol_llama-v10-rw3m_dd) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 2.8876
 - Accuracy: 0.4502
+## evals
+`hf-causal-experimental (pretrained=pszemraj/verysmol_llama-v11-KIx2,revision=main,trust_remote_code=True,dtype='float'), limit: None, provide_description: False, num_fewshot: 0, batch_size: 16`
+|     Task     |Version| Metric | Value  |   |Stderr|
+|--------------|------:|--------|-------:|---|-----:|
+|arc_easy      |      0|acc     |  0.4024|±  |0.0101|
+|              |       |acc_norm|  0.3788|±  |0.0100|
+|boolq         |      1|acc     |  0.6199|±  |0.0085|
+|lambada_openai|      0|ppl     |111.9939|±  |4.6906|
+|              |       |acc     |  0.2354|±  |0.0059|
+|openbookqa    |      0|acc     |  0.1440|±  |0.0157|
+|              |       |acc_norm|  0.2760|±  |0.0200|
+|piqa          |      0|acc     |  0.5713|±  |0.0115|
+|              |       |acc_norm|  0.5664|±  |0.0116|
+|winogrande    |      0|acc     |  0.5201|±  |0.0140|
 ## Training procedure