Spaces:

openchat
/

README

Running

App Files Files Community

alpayariyak commited on Dec 18, 2023

Commit

9603be0

•

1 Parent(s): b9a70f8

Update README.md

Browse files

Files changed (1) hide show

README.md +32 -45

README.md CHANGED Viewed

@@ -71,36 +71,25 @@ pinned: false
 <hr>
-<p align="center" style="margin-top: 0px; font-size: 1.2em; background-color: #3c72db; padding: 0.5em; border-radius: 0.5em; color: white; font-weight: bold;">
-  <a href="https://huggingface.co/openchat/openchat_3.5" style="text-decoration: none; color: white;">
-    <span style="font-size: 1.4em; font-family: 'Helvetica'; letter-spacing: 0.2em">OPENCHAT</span>
-    <span style="font-size: 1.4em; font-family: 'Helvetica'; background-color: white; padding: 0.2em; border-radius: 0.3em; color: #3c72db;"> 3.5 </span>
-    <br>
-    <span>
-    First 7B Model that Achieves ChatGPT-Level Performance
-    <br>#1 Open-Source Model on MT-bench scoring 7.81, outperforming 70B models
     </span>
   </a>
-<!-- <a href="https://huggingface.co/openchat/openchat_3.5">
-  <button class="common-button">Model Repo</button>
-</a>
-<a href="https://openchat.team">
-  <button class="common-button">OpenChatUI Demo</button>
-</a>
-<a href="https://huggingface.co/spaces/openchat/openchat_3.5">
-  <button class="common-button">HuggingFace Space</button>
-</a>
-<a href="https://arxiv.org/pdf/2309.11235.pdf">
-  <button class="common-button">Paper</button>
-</a>
- -->
-</p>
-  <div align="center" style="justify-content: center; align-items: center; "'>
-  <img src="https://github.com/alpayariyak/openchat/blob/master/assets/3.5-benchmarks.png?raw=true" style="width: 100%;  border-radius: 0.5em">
-  </div>
-</p>
 <h1 style="vertical-align: middle;">
     <img src="https://github.com/alpayariyak/openchat/blob/master/assets/logo_nobg.png?raw=true" alt="OpenChat Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 0px; margin-top: 0px; margin-bottom: 0px;"/>About OpenChat
@@ -120,26 +109,24 @@ pinned: false
 # 📊 Benchmarks
-| Model              | # Params | Average  | MT-Bench     | AGIEval  | BBH MC   | TruthfulQA    | MMLU         | HumanEval       | BBH CoT     | GSM8K        |
-|--------------------|----------|----------|--------------|----------|----------|---------------|--------------|-----------------|-------------|--------------|
-| OpenChat-3.5       | **7B**   | **61.6** | 7.81         | **47.4** | **47.6** | **59.1**      | 64.3         | **55.5**        | 63.5        | **77.3**     |
-| ChatGPT (March)*   | ?        | 61.5     | **7.94**     | 47.1     | **47.6** | 57.7          | **67.3**     | 48.1            | **70.1**    | 74.9         |
-|                    |          |          |              |          |          |               |              |                 |             |              |
-| OpenHermes 2.5     | 7B       | 59.3     | 7.54         | 46.5     | 49.4     | 57.5          | 63.8         | 48.2            | 59.9        | 73.5         |
-| OpenOrca Mistral   | 7B       | 52.7     | 6.86         | 42.9     | 49.4     | 45.9          | 59.3         | 38.4            | 58.1        | 59.1         |
-| Zephyr-β^          | 7B       | 34.6     | 7.34         | 39.0     | 40.6     | 40.8          | 39.8         | 22.0            | 16.0        | 5.1          |
-| Mistral**          | 7B       | -        | 6.84         | 38.0     | 39.0     | -             | 60.1         | 30.5            | -           | 52.2         |
-| Open-source SOTA** | 13B-70B  | 61.4     | 7.71         | 41.7     | 49.7     | 62.3          | 63.7         | 73.2            | 41.4        | 82.3         |
-|                    |          |          | WizardLM 70B | Orca 13B | Orca 13B | Platypus2 70B | WizardLM 70B | WizardCoder 34B | Flan-T5 11B | MetaMath 70B |
 ## 𝕏 Comparison with [X.AI Grok](https://x.ai/)
-|              | License     | # Param | Average  | MMLU | HumanEval | MATH     | GSM8k    |
-|--------------|-------------|---------|----------|------|-----------|----------|----------|
-| OpenChat 3.5 | Apache-2.0  | 7B      | **56.4** | 64.3 | 55.5      | **28.6** | **77.3** |
-| Grok-0       | Proprietary | 33B     | 44.5     | 65.7 | 39.7      | 15.7     | 56.8     |
-| Grok-1       | Proprietary | ?       | 55.8     | 73   | 63.2      | 23.9     | 62.9     |
 # 💌Contact

 <hr>
+<div style="background-color: white; padding: 0.7em; border-radius: 0.5em; color: black; display: flex; flex-direction: column; justify-content: center; text-align: center; ont-size: 0.5em;">
+  <a href="https://huggingface.co/openchat/openchat_3.5" style="text-decoration: none; color: black;">
+    <span style="font-size: 1.7em; font-family: 'Helvetica'; letter-spacing: 0.1em; font-weight: bold; color: black;">OPENCHAT</span><span style="font-size: 1.8em; font-family: 'Helvetica'; color: #3c72db; ">3.5</span>
+        <span style="font-size: 0.7em;  font-family: 'Helvetica'; color:  white; vertical-align: top;  background-color:red;  border-radius: 6em; padding: 0.066em 0.4em; letter-spacing: 0.1em; font-weight: bold;">1210</span>
+    <span style="font-size: 0.85em; font-family: 'Helvetica'; color: black;">
+      <br> 🏆 The Overall Best Performing Open Source 7B Model 🏆
+    <br> 🤖 Outperforms <span style="font-weight: bold;">ChatGPT</span> (March) and <span style="font-weight: bold;">Grok-1</span> 🤖
+      <br> 🚀<span style="font-size: 1em; font-family: 'Helvetica'; color: black; font-weight: bold;">15</span>-point improvement in Coding over <span style="font-size: 0.9em;
+      font-family: 'Helvetica'; color: black; font-weight: bold;">OpenChat-3.5🚀</span>
+      <br><br><span style="font-size: 1em; font-family: 'Helvetica'; color: #3c72db; font-weight: bold;">New Features</span>
+      <br> 💡 2 Modes: Coding + Generalist, Mathematical Reasoning 💡
+      <br> 🧑‍⚖️ Experimental support for Evaluator and Feedback capabilities 🧑‍⚖️
     </span>
   </a>
+</div>
+<div style="display: flex; justify-content: center; align-items: center">
+  <img src="https://github.com/alpayariyak/openchat/blob/master/assets/1210bench.png?raw=true" style="width: 100%; border-radius: 1em">
+</div>
 <h1 style="vertical-align: middle;">
     <img src="https://github.com/alpayariyak/openchat/blob/master/assets/logo_nobg.png?raw=true" alt="OpenChat Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 0px; margin-top: 0px; margin-bottom: 0px;"/>About OpenChat
 # 📊 Benchmarks
+| Model              | # Params | Average  | MT-Bench     | HumanEval       | BBH MC   | AGIEval  | TruthfulQA    | MMLU         | GSM8K        | BBH CoT     |
+|--------------------|----------|----------|--------------|-----------------|----------|----------|---------------|--------------|--------------|-------------|
+| OpenChat-3.5-1210  | **7B**   | **63.8** | 7.76         | **68.9**        | **49.5** | **48.0** | **61.8**      | 65.3         | **77.3**     | 61.8        |
+| OpenChat-3.5       | **7B**   | 61.6     | 7.81         | 55.5            | 47.6     | 47.4     | 59.1          | 64.3         | **77.3**     | 63.5        |
+| ChatGPT (March)*   | ?        | 61.5     | **7.94**     | 48.1            | 47.6     | 47.1     | 57.7          | **67.3**     | 74.9         | **70.1**    |
+|                    |          |          |              |                 |          |          |               |              |              |             |
+| OpenHermes 2.5     | 7B       | 59.3     | 7.54         | 48.2            | 49.4     | 46.5     | 57.5          | 63.8         | 73.5         | 59.9        |
+| OpenOrca Mistral   | 7B       | 52.7     | 6.86         | 38.4            | 49.4     | 42.9     | 45.9          | 59.3         | 59.1         | 58.1        |
+| Zephyr-β^          | 7B       | 34.6     | 7.34         | 22.0            | 40.6     | 39.0     | 40.8          | 39.8         | 5.1          | 16.0        |
+| Mistral            | 7B       | -        | 6.84         | 30.5            | 39.0     | 38.0     | -             | 60.1         | 52.2         | -           |
 ## 𝕏 Comparison with [X.AI Grok](https://x.ai/)
+|                   | License     | # Param | Average  | MMLU | HumanEval | MATH     | GSM8k    |
+|-------------------|-------------|---------|----------|------|-----------|----------|----------|
+| OpenChat 3.5 1210 | Apache-2.0  | **7B**  | **60.1** | 65.3 | **68.9**  | **28.9** | **77.3** |
+| OpenChat 3.5      | Apache-2.0  | **7B**  | 56.4     | 64.3 | 55.5      | 28.6     | **77.3** |
+| Grok-0            | Proprietary | 33B     | 44.5     | 65.7 | 39.7      | 15.7     | 56.8     |
+| Grok-1            | Proprietary | ???B    | 55.8     | 73   | 63.2      | 23.9     | 62.9     |
 # 💌Contact