ericflo commited on
Commit
efd2bfe
1 Parent(s): 86b5e8a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -1
README.md CHANGED
@@ -80,7 +80,16 @@ While this model was trained on a mix of high-quality datasets, it may still exh
80
 
81
  ## Evaluation
82
 
83
- The model has not yet been evaluated - evaluation metrics will be added as they become available.
 
 
 
 
 
 
 
 
 
84
 
85
  ## Technical Specifications
86
 
 
80
 
81
  ## Evaluation
82
 
83
+ | Tasks |Version| Filter |n-shot| Metric | |Value | |Stderr|
84
+ |-----------------|-------|----------------|-----:|-----------|---|-----:|---|------|
85
+ |tinyBenchmarks | N/A| | | | | | | |
86
+ | - tinyArc | 0|none | 25|acc_norm |↑ |0.6056|± | N/A|
87
+ | - tinyGSM8k | 0|flexible-extract| 5|exact_match|↑ |0.4793|± | N/A|
88
+ | | |strict-match | 5|exact_match|↑ |0.4793|± | N/A|
89
+ | - tinyHellaswag | 0|none | 10|acc_norm |↑ |0.8261|± | N/A|
90
+ | - tinyMMLU | 0|none | 0|acc_norm |↑ |0.6358|± | N/A|
91
+ | - tinyTruthfulQA| 0|none | 0|acc |↑ |0.5098|± | N/A|
92
+ | - tinyWinogrande| 0|none | 5|acc_norm |↑ |0.7447|± | N/A|
93
 
94
  ## Technical Specifications
95