Update README.md
Browse files
README.md
CHANGED
@@ -82,7 +82,7 @@ If you want to deploy the server as an online service, you can use `--api-keys s
|
|
82 |
|
83 |
| Model | Size | Context | Weights | Serving |
|
84 |
|-------------------|------|---------|------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------|
|
85 |
-
| OpenChat
|
86 |
|
87 |
<details>
|
88 |
<summary>Example request (click to expand)</summary>
|
@@ -217,12 +217,13 @@ All models are evaluated in chat mode (e.g. with the respective conversation tem
|
|
217 |
|
218 |
🔥 OpenChat-3.5 0106 (7B) now outperforms Grok-0 (33B) on **all 4 benchmarks** and Grok-1 (???B) on average and **3/4 benchmarks**.
|
219 |
|
220 |
-
|
|
221 |
-
|
222 |
-
| OpenChat
|
223 |
-
| OpenChat
|
224 |
-
|
|
225 |
-
| Grok-
|
|
|
226 |
|
227 |
*: Grok results are reported by [X.AI](https://x.ai/).
|
228 |
|
|
|
82 |
|
83 |
| Model | Size | Context | Weights | Serving |
|
84 |
|-------------------|------|---------|------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------|
|
85 |
+
| OpenChat-3.5-0106 | 7B | 8192 | [Huggingface](https://huggingface.co/openchat/openchat-3.5-0106) | `python -m ochat.serving.openai_api_server --model openchat/openchat-3.5-0106 --engine-use-ray --worker-use-ray` |
|
86 |
|
87 |
<details>
|
88 |
<summary>Example request (click to expand)</summary>
|
|
|
217 |
|
218 |
🔥 OpenChat-3.5 0106 (7B) now outperforms Grok-0 (33B) on **all 4 benchmarks** and Grok-1 (???B) on average and **3/4 benchmarks**.
|
219 |
|
220 |
+
| | License | # Param | Average | MMLU | HumanEval | MATH | GSM8k |
|
221 |
+
|-----------------------|-------------|---------|----------|--------|-----------|----------|----------|
|
222 |
+
| **OpenChat-3.5-0106** | Apache-2.0 | **7B** | **70.0** | 65.8 | **71.3** | **29.3** | **77.4** |
|
223 |
+
| OpenChat-3.5-1210 | Apache-2.0 | **7B** | 60.1 | 65.3 | 68.9 | 28.9 | 77.3 |
|
224 |
+
| OpenChat-3.5 | Apache-2.0 | **7B** | 56.4 | 64.3 | 55.5 | 28.6 | 77.3 |
|
225 |
+
| Grok-0 | Proprietary | 33B | 44.5 | 65.7 | 39.7 | 15.7 | 56.8 |
|
226 |
+
| Grok-1 | Proprietary | ???B | 55.8 | **73** | 63.2 | 23.9 | 62.9 |
|
227 |
|
228 |
*: Grok results are reported by [X.AI](https://x.ai/).
|
229 |
|