openchat
/

openchat-3.5-0106

@@ -82,7 +82,7 @@ If you want to deploy the server as an online service, you can use `--api-keys s
 | Model             | Size | Context | Weights                                                          | Serving                                                                                                          |
 |-------------------|------|---------|------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------|
-| OpenChat 3.5 0106 | 7B   | 8192    | [Huggingface](https://huggingface.co/openchat/openchat-3.5-0106) | `python -m ochat.serving.openai_api_server --model openchat/openchat-3.5-0106 --engine-use-ray --worker-use-ray` |
 <details>
   <summary>Example request (click to expand)</summary>
@@ -217,12 +217,13 @@ All models are evaluated in chat mode (e.g. with the respective conversation tem
 🔥 OpenChat-3.5 0106 (7B) now outperforms Grok-0 (33B) on **all 4 benchmarks** and Grok-1 (???B) on average and **3/4 benchmarks**.
-|                   | License     | # Param | Average  | MMLU | HumanEval | MATH     | GSM8k    |
-|-------------------|-------------|---------|----------|------|-----------|----------|----------|
-| OpenChat 3.5 1210 | Apache-2.0  | **7B**  | **60.1** | 65.3 | **68.9**  | **28.9** | **77.3** |
-| OpenChat 3.5      | Apache-2.0  | **7B**  | 56.4     | 64.3 | 55.5      | 28.6     | **77.3** |
-| Grok-0            | Proprietary | 33B     | 44.5     | 65.7 | 39.7      | 15.7     | 56.8     |
-| Grok-1            | Proprietary | ???B    | 55.8     | 73   | 63.2      | 23.9     | 62.9     |
 *: Grok results are reported by [X.AI](https://x.ai/).

 | Model             | Size | Context | Weights                                                          | Serving                                                                                                          |
 |-------------------|------|---------|------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------|
+| OpenChat-3.5-0106 | 7B   | 8192    | [Huggingface](https://huggingface.co/openchat/openchat-3.5-0106) | `python -m ochat.serving.openai_api_server --model openchat/openchat-3.5-0106 --engine-use-ray --worker-use-ray` |
 <details>
   <summary>Example request (click to expand)</summary>
 🔥 OpenChat-3.5 0106 (7B) now outperforms Grok-0 (33B) on **all 4 benchmarks** and Grok-1 (???B) on average and **3/4 benchmarks**.
+|                       | License     | # Param | Average  | MMLU   | HumanEval | MATH     | GSM8k    |
+|-----------------------|-------------|---------|----------|--------|-----------|----------|----------|
+| **OpenChat-3.5-0106** | Apache-2.0  | **7B**  | **70.0** | 65.8   | **71.3**  | **29.3** | **77.4** |
+| OpenChat-3.5-1210     | Apache-2.0  | **7B**  | 60.1     | 65.3   | 68.9      | 28.9     | 77.3     |
+| OpenChat-3.5          | Apache-2.0  | **7B**  | 56.4     | 64.3   | 55.5      | 28.6     | 77.3     |
+| Grok-0                | Proprietary | 33B     | 44.5     | 65.7   | 39.7      | 15.7     | 56.8     |
+| Grok-1                | Proprietary | ???B    | 55.8     | **73** | 63.2      | 23.9     | 62.9     |
 *: Grok results are reported by [X.AI](https://x.ai/).