File size: 8,222 Bytes
2e39f41
 
fd2fdaa
 
 
2e39f41
 
 
8c069a4
ed22ba7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9136c02
 
 
 
 
ba5dffd
9136c02
ba5dffd
9136c02
 
 
 
 
f6dcda2
0f52dd9
53b6be6
f613971
044620d
f613971
044620d
53b6be6
 
ed22ba7
76ec8f2
a9d27f0
9136c02
76ec8f2
 
a9d27f0
ed22ba7
76ec8f2
 
a9d27f0
ed22ba7
76ec8f2
 
a9d27f0
ed22ba7
 
8c069a4
76ec8f2
f6dcda2
0f52dd9
044620d
32eb045
 
 
 
 
 
 
 
 
 
9136c02
 
 
 
 
 
 
 
 
 
 
 
32eb045
 
 
 
 
46252b9
0f52dd9
e0106af
fd2fdaa
22edf13
 
 
 
fd2fdaa
 
 
 
afcb675
fd2fdaa
 
 
 
 
afcb675
fd2fdaa
 
 
 
 
 
 
 
 
 
 
 
 
 
afcb675
fd2fdaa
 
 
 
 
 
 
 
 
 
 
 
 
 
7f01910
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
---
title: README
emoji: πŸš€
colorFrom: blue
colorTo: blue
sdk: static
pinned: false
---

<style>
  a {
    text-decoration: none; /* Removes the default underline */
    color: inherit; /* Inherits the color from the parent element */
  }

  a:hover, a:focus {
    text-decoration: underline; /* Adds underline on hover or focus for better accessibility */
  }

  .link-text {
    display: inline-block; 
    vertical-align: middle; 
    font-size: 1.26em; 
    margin-right: 10px; 
    margin-left: 0px; 
    margin-top: 0px; 
    margin-bottom: 0px;
  }
</style>

<style>
  .common-button {
    background-color: white;
    color: #3c72db;
    border: 2px solid #3c72db;
    padding: 10px 25px;
    border-radius: 0.3em;
    margin: 2px;
    font-size: 1em;
    text-decoration: none;
  }
</style>

<link href='https://fonts.googleapis.com/css?family=Source+Sans+Pro' rel='stylesheet' type='text/css'>

<div align="center">
        <a>
    <img src="https://github.com/imoneoi/openchat/raw/master/assets/logo_new.png" style="width: 55%">
  </a>
    <h1 style="font-size: 1.30em; ">Advancing Open-source Language Models with Mixed-Quality Data</h1>
</div>

<p align="center" style="margin-top: 0px;">
  <a href="https://openchat.team">
    <img src="https://github.com/alpayariyak/openchat/blob/master/logo_new-removebg-preview.png?raw=true" alt="OpenChat Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
    <span class="link-text">OpenChat Online Demo</span>
  </a> |
  <a href="https://github.com/imoneoi/openchat">
    <img src="https://camo.githubusercontent.com/4133dc1cd4511d4a292b84ce10e52e4ed92569fb2a8165381c9c47be5edc2796/68747470733a2f2f6564656e742e6769746875622e696f2f537570657254696e7949636f6e732f696d616765732f706e672f6769746875622e706e67" alt="GitHub Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
    <span class="link-text">GitHub</span>
  </a> |
  <a href="https://arxiv.org/pdf/2309.11235.pdf">
    <img src="https://github.com/alpayariyak/openchat/blob/master/arxiv-logomark-small-square-border.png?raw=true" alt="ArXiv Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
    <span class="link-text">Paper</span>
  </a> |
  <a href="https://discord.gg/pQjnXvNKHY">
    <img src="https://cloud.githubusercontent.com/assets/6291467/26705903/96c2d66e-477c-11e7-9f4e-f3c0efe96c9a.png" alt="Discord Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
    <span class="link-text">Discord</span>
  </a>
</p>


<hr>
<p align="center" style="margin-top: 0px; font-size: 1.2em; background-color: #3c72db; padding: 0.5em; border-radius: 0.5em; color: white; font-weight: bold;">
  <a href="https://huggingface.co/openchat/openchat_3.5" style="text-decoration: none; color: white;">
    <span style="font-size: 1.4em; font-family: 'Helvetica'; letter-spacing: 0.2em">OPENCHAT</span> 
    <span style="font-size: 1.4em; font-family: 'Helvetica'; background-color: white; padding: 0.2em; border-radius: 0.3em; color: #3c72db;"> 3.5 </span>
    <br>
    <span>
    First 7B Model to Achieve ChatGPT-level Performance
    <br>#1 Open-Source Model on MT-bench scoring 7.81, outperforming 70B models
    </span>
  </a>
  <br>
<a href="https://huggingface.co/openchat/openchat_3.5">
  <button class="common-button">Model Repo</button>
</a>
<a href="https://openchat.team">
  <button class="common-button">OpenChatUI Demo</button>
</a>
<a href="https://huggingface.co/spaces/openchat/openchat_3.5">
  <button class="common-button">HuggingFace Space</button>
</a>
<a href="https://arxiv.org/pdf/2309.11235.pdf">
  <button class="common-button">Paper</button>
</a>

</p>


  <div align="center" style="justify-content: center; align-items: center; "'>
  <img src="https://github.com/alpayariyak/openchat/blob/master/assets/Untitled%20design-17.png?raw=true" style="width: 100%;  border-radius: 0.5em">
  </div>
</p>

<h1 style="vertical-align: middle;">
    <img src="https://github.com/alpayariyak/openchat/blob/master/logo_new-removebg-preview.png?raw=true" alt="OpenChat Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 0px; margin-top: 0px; margin-bottom: 0px;"/>About OpenChat
</h1>

- OpenChat is an innovative library of **open-source language models**, fine-tuned with [**C-RLFT**](https://arxiv.org/pdf/2309.11235.pdf) - a strategy inspired by offline reinforcement learning.
- Our models learn from mixed-quality data without preference labels, delivering exceptional performance on par with `ChatGPT`, even with a `7B` model which can be run on a **consumer GPU (e.g. RTX 3090)**.
- Despite our simple approach, we are committed to developing a high-performance, commercially viable, open-source large language model, and we continue to make significant strides toward this vision.

# πŸ“° News

- [2023/11/01] We released the [OpenChat-3.5-7B](https://huggingface.co/openchat/openchat_3.5) model, surpassing ChatGPT on various benchmarks πŸ”₯.

- [2023/09/21] We released our paper [OpenChat: Advancing Open-source Language Models with Mixed-Quality Data](https://arxiv.org/pdf/2309.11235.pdf).

# πŸ“Š Benchmarks

| Model              | # Params | Average  | MT-Bench     | AGIEval  | BBH MC   | TruthfulQA    | MMLU         | HumanEval       | BBH CoT     | GSM8K        |
|--------------------|----------|----------|--------------|----------|----------|---------------|--------------|-----------------|-------------|--------------|
| OpenChat-3.5       | **7B**   | **61.6** | 7.81         | **47.4** | **47.6** | **59.1**      | 64.3         | **55.5**        | 63.5        | **77.3**     |
| ChatGPT (March)*   | ?        | 61.5     | **7.94**     | 47.1     | **47.6** | 57.7          | **67.3**     | 48.1            | **70.1**    | 74.9         |
|                    |          |          |              |          |          |               |              |                 |             |              |
| OpenHermes 2.5     | 7B       | 59.3     | 7.54         | 46.5     | 49.4     | 57.5          | 63.8         | 48.2            | 59.9        | 73.5         |
| OpenOrca Mistral   | 7B       | 52.7     | 6.86         | 42.9     | 49.4     | 45.9          | 59.3         | 38.4            | 58.1        | 59.1         |
| Zephyr-Ξ²^          | 7B       | 34.6     | 7.34         | 39.0     | 40.6     | 40.8          | 39.8         | 22.0            | 16.0        | 5.1          |
| Mistral**          | 7B       | -        | 6.84         | 38.0     | 39.0     | -             | 60.1         | 30.5            | -           | 52.2         |
| Open-source SOTA** | 13B-70B  | 61.4     | 7.71         | 41.7     | 49.7     | 62.3          | 63.7         | 73.2            | 41.4        | 82.3         |
|                    |          |          | WizardLM 70B | Orca 13B | Orca 13B | Platypus2 70B | WizardLM 70B | WizardCoder 34B | Flan-T5 11B | MetaMath 70B |


## 𝕏 Comparison with [X.AI Grok](https://x.ai/)

|              | License     | # Param | Average  | MMLU | HumanEval | MATH     | GSM8k    |
|--------------|-------------|---------|----------|------|-----------|----------|----------|
| OpenChat 3.5 | Apache-2.0  | 7B      | **56.4** | 64.3 | 55.5      | **28.6** | **77.3** |
| Grok-0       | Proprietary | 33B     | 44.5     | 65.7 | 39.7      | 15.7     | 56.8     |
| Grok-1       | Proprietary | ?       | 55.8     | 73   | 63.2      | 23.9     | 62.9     |

# πŸ’ŒContact

We are a student team  Tsinghua University, working on OpenChat, a project that requires additional computing power or LLMs API keys for further development. If you are interested in our project and would like to offer support, please feel free to reach out to us:

* Wang Guan [imonenext at gmail dot com]
* Cheng Sijie [csj23 at mails dot tsinghua dot edu dot cn]

We look forward to hearing  you and collaborating on this exciting project!