pszemraj
/

tFINE-850m-24x24-v0.5-instruct-L1

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Oct 11, 2024

Commit

c6cfc9d

·

verified ·

1 Parent(s): 3b6f2da

Update README.md

Files changed (1) hide show

README.md +20 -18

README.md CHANGED Viewed

@@ -28,17 +28,26 @@ It achieves the following results on the evaluation set:
 - Gen Len: 441.475
 - Num Input Tokens Seen: 435513684
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
@@ -73,10 +82,3 @@ The following hyperparameters were used during training:
 | 1.2577        | 0.8874 | 11000 | 1.1752          | 39.3539 | 23.0123 | 31.9005 | 37.4941   | 424.445 | 386471860         |
 | 1.193         | 0.9680 | 12000 | 1.1526          | 40.1804 | 23.1008 | 32.3484 | 38.2103   | 422.225 | 421585440         |
-### Framework versions
-- Transformers 4.45.1
-- Pytorch 2.4.1+cu124
-- Datasets 3.0.1
-- Tokenizers 0.20.0

 - Gen Len: 441.475
 - Num Input Tokens Seen: 435513684
+## Quick eval
+Quick eval for:	`pszemraj/tFINE-850m-24x24-v0.5-instruct-L1`
+hf (pretrained=pszemraj/tFINE-850m-24x24-v0.5-instruct-L1,trust_remote_code=True,dtype=bfloat16,trust_remote_code=True), gen_kwargs: (None), limit: None, num_fewshot: None, batch_size: 8
+|    Tasks    |Version|     Filter     |n-shot|  Metric   |   |Value |   |Stderr|
+|-------------|------:|----------------|-----:|-----------|---|-----:|---|------|
+|boolq        |      2|none            |     0|acc        |↑  |0.5661|±  |0.0087|
+|openbookqa   |      1|none            |     0|acc        |↑  |0.1540|±  |0.0162|
+|             |       |none            |     0|acc_norm   |↑  |0.2960|±  |0.0204|
+|piqa         |      1|none            |     0|acc        |↑  |0.6094|±  |0.0114|
+|             |       |none            |     0|acc_norm   |↑  |0.5952|±  |0.0115|
+|social_iqa   |      0|none            |     0|acc        |↑  |0.3900|±  |0.0110|
+|tinyArc      |      0|none            |    25|acc_norm   |↑  |0.2903|±  |   N/A|
+|tinyGSM8k    |      0|flexible-extract|     5|exact_match|↑  |0.0471|±  |   N/A|
+|             |       |strict-match    |     5|exact_match|↑  |0.0339|±  |   N/A|
+|tinyHellaswag|      0|none            |    10|acc_norm   |↑  |0.2490|±  |   N/A|
+|tinyMMLU     |      0|none            |     0|acc_norm   |↑  |0.3021|±  |   N/A|
+|winogrande   |      1|none            |     0|acc        |↑  |0.4925|±  |0.0141|
 ## Training procedure
 | 1.2577        | 0.8874 | 11000 | 1.1752          | 39.3539 | 23.0123 | 31.9005 | 37.4941   | 424.445 | 386471860         |
 | 1.193         | 0.9680 | 12000 | 1.1526          | 40.1804 | 23.1008 | 32.3484 | 38.2103   | 422.225 | 421585440         |