abacusai
/

Smaug-2-72B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ArkaAbacus commited on Apr 11

Commit

f31412c

•

1 Parent(s): 2064815

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -27,6 +27,14 @@ We ran MT-Bench with the Qwen conversation template.
 | Qwen1.5-72B-Chat | 8.59 | 8.08   | 8.34    |
 | Smaug-2-72B      | 8.86 | 8.20   | 8.53
 ## Model Details

 | Qwen1.5-72B-Chat | 8.59 | 8.08   | 8.34    |
 | Smaug-2-72B      | 8.86 | 8.20   | 8.53
+#### HumanEval
+We ran HumanEval with pass@1 with the Qwen conversation template. Smaug-2 outperforms Qwen1.5-72B-Chat by approximately 10%:
+| Model | pass@1 (%) |
+| ------| ---------- |
+| Qwen1.5-72B-Chat | 56.7 |
+| Smaug-2-72B      | 66.5 |
 ## Model Details