huihui-ai
/

Qwen2.5-Coder-7B-Instruct-abliterated

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

huihui-ai commited on Oct 6, 2024

Commit

05dd50d

·

verified ·

1 Parent(s): e14d359

Update README.md

Files changed (1) hide show

README.md +13 -0

README.md CHANGED Viewed

	@@ -95,3 +95,16 @@ while True:
95
96	```
97

 ```
+## Evaluations
+The following data has been re-evaluated and calculated as the average for each test.
+| Benchmark   | Qwen2.5-Coder-7B-Instruct | Qwen2.5-Coder-7B-Instruct-abliterated |
+|-------------|---------------------------|---------------------------------------|
+| IF_Eval     | **63.14**                 | 61.90                                 |
+| MMLU Pro    | 33.54                     | **33.56**                             |
+| TruthfulQA  | **51.804**                | 48.8                                  |
+| BBH         | 46.98                     | **47.17**                             |
+| GPQA        | **32.85**                 | 32.63                                 |
+The script used for evaluation can be found inside this repository under /eval.sh, or click [here](https://huggingface.co/huihui-ai/Qwen2.5-Coder-7B-Instruct-abliterated/blob/main/eval.sh)