File size: 7,446 Bytes
8cde32b 8c069a4 ed22ba7 9136c02 ba5dffd 9136c02 ba5dffd 9136c02 f6dcda2 0f52dd9 53b6be6 f613971 044620d f613971 044620d 53b6be6 ed22ba7 76ec8f2 b7541fd b9a70f8 76ec8f2 5d36022 ed22ba7 76ec8f2 b7541fd ed22ba7 76ec8f2 a9d27f0 ed22ba7 8c069a4 76ec8f2 819c236 5d36022 fd2fdaa 5d36022 22edf13 5d36022 fd2fdaa 5d36022 fd2fdaa 5d36022 b9a70f8 5d36022 fd2fdaa 5d36022 fd2fdaa afcb675 9603be0 5d36022 9603be0 5d36022 9603be0 afcb675 fd2fdaa 9603be0 5d36022 9603be0 5d36022 819c236 fd2fdaa 2ab535f 8cde32b fd2fdaa 622280a 2ab535f |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 |
---
title: README
emoji: π
colorFrom: blue
colorTo: blue
sdk: static
pinned: false
---
<style>
a {
text-decoration: none; /* Removes the default underline */
color: inherit; /* Inherits the color from the parent element */
}
a:hover, a:focus {
text-decoration: underline; /* Adds underline on hover or focus for better accessibility */
}
.link-text {
display: inline-block;
vertical-align: middle;
font-size: 1.26em;
margin-right: 10px;
margin-left: 0px;
margin-top: 0px;
margin-bottom: 0px;
}
</style>
<style>
.common-button {
background-color: white;
color: #3c72db;
border: 2px solid #3c72db;
padding: 10px 25px;
border-radius: 0.3em;
margin: 2px;
font-size: 1em;
text-decoration: none;
}
</style>
<link href='https://fonts.googleapis.com/css?family=Source+Sans+Pro' rel='stylesheet' type='text/css'>
<div align="center">
<a>
<img src="https://github.com/imoneoi/openchat/raw/master/assets/logo_new.png" style="width: 55%">
</a>
<h1 style="font-size: 1.30em; ">Advancing Open-source Language Models with Mixed-Quality Data</h1>
</div>
<p align="center" style="margin-top: 0px;">
<a href="https://openchat.team">
<img src="https://github.com/alpayariyak/openchat/blob/master/assets/logo_nobg.png?raw=true" alt="OpenChat Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
<span class="link-text">Online Demo</span>
</a> |
<a href="https://github.com/imoneoi/openchat">
<img src="https://camo.githubusercontent.com/582429992c94328783a1509030dfd344c5845fb94be4a7b85fcf8e70b686e1b1/68747470733a2f2f6564656e742e6769746875622e696f2f537570657254696e7949636f6e732f696d616765732f706e672f6769746875622e706e67" alt="GitHub Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
<span class="link-text">GitHub</span>
</a> |
<a href="https://arxiv.org/pdf/2309.11235.pdf">
<img src="https://github.com/alpayariyak/openchat/blob/master/assets/arxiv-logomark-small-square-border.png?raw=true" alt="ArXiv Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
<span class="link-text">Paper</span>
</a> |
<a href="https://discord.gg/pQjnXvNKHY">
<img src="https://cloud.githubusercontent.com/assets/6291467/26705903/96c2d66e-477c-11e7-9f4e-f3c0efe96c9a.png" alt="Discord Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
<span class="link-text">Discord</span>
</a>
</p>
OpenChat is dedicated to advancing and releasing **open-source language models**, fine-tuned with our [**C-RLFT**](https://arxiv.org/pdf/2309.11235.pdf) technique, which is inspired by offline reinforcement learning. Our models learn from mixed-quality data without preference labels, delivering exceptional performance on par with `ChatGPT`, which we were the first to beat with only `7B` parameter LLMs which can be run on a **consumer GPU (e.g. RTX 3090)**.
<!--
# π° News
- [2024/03/15] Nexusflow releases [Starling-Beta](https://huggingface.co/Nexusflow/Starling-LM-7B-beta), an RLHF-tune of openchat-3.5-1106, which is currently the highest ranking Open Source LLM on LMSys Arena not originating from a company, **beating all others at only 7B**.
- [2024/03/08] Released [OpenChat-3.5-0106-Gemma](https://huggingface.co/openchat/openchat-3.5-0106-gemma), the highest performing Gemma fine-tune.
- [2024/01/07] Released [OpenChat-3.5-0106](https://huggingface.co/openchat/openchat-3.5-0106), trained with a new data pipeline - **the strongest 7B LLM in the world**.
- Ranked as the top 7B LLM on LMSys Arena.
- Ranked on LMSys Arena as the top open source LLM not originating from a company.
- [2023/12/10] Rleased [OpenChat-3.5-1210](https://huggingface.co/openchat/openchat-3.5-1210), 15-point improvements in coding.
- [2023/11/01] Released [OpenChat-3.5-7B](https://huggingface.co/openchat/openchat_3.5), surpassing ChatGPT on various benchmarks π₯.
- [2023/09/21] Released our paper [OpenChat: Advancing Open-source Language Models with Mixed-Quality Data](https://arxiv.org/pdf/2309.11235.pdf).
# π Benchmarks
| Model | # Params | Average | MT-Bench | HumanEval | BBH MC | AGIEval | TruthfulQA | MMLU | GSM8K | BBH CoT |
|--------------------|----------|----------|--------------|-----------------|----------|----------|---------------|--------------|--------------|-------------|
| OpenChat-3.5-0106 | **7B** | **64.5** | 7.8 | **71.3** | 51.5 | 49.1 | 61.0 | **65.8** | 77.4 | 62.2 |
| OpenChat-3.5-0106-Gemma | **7B** | 64.4 | 7.83 | 67.7 | **52.7** | **50.2** | 55.4 | 65.7 | **81.5** | 63.7 |
| OpenChat-3.5-1210 | **7B** | 63.8 | 7.76 | 68.9 | 49.5 | 48.0 | **61.8** | 65.3 | 77.3 | 61.8 |
| OpenChat-3.5 | **7B** | 61.6 | 7.81 | 55.5 | 47.6 | 47.4 | 59.1 | 64.3 | **77.3** | 63.5 |
| ChatGPT (March)* | ? | 61.5 | **7.94** | 48.1 | 47.6 | 47.1 | 57.7 | 67.3 | 74.9 | **70.1** |
| | | | | | | | | | | |
| OpenHermes 2.5 | 7B | 59.3 | 7.54 | 48.2 | 49.4 | 46.5 | 57.5 | 63.8 | 73.5 | 59.9 |
| OpenOrca Mistral | 7B | 52.7 | 6.86 | 38.4 | 49.4 | 42.9 | 45.9 | 59.3 | 59.1 | 58.1 |
| Zephyr-Ξ²^ | 7B | 34.6 | 7.34 | 22.0 | 40.6 | 39.0 | 40.8 | 39.8 | 5.1 | 16.0 |
| Mistral | 7B | - | 6.84 | 30.5 | 39.0 | 38.0 | - | 60.1 | 52.2 | - |
## π Comparison with [X.AI Grok](https://x.ai/)
| | License | # Param | Average | MMLU | HumanEval | MATH | GSM8k |
|-------------------|-------------|---------|----------|------|-----------|----------|----------|
| **OpenChat-3.5-0106** | Apache-2.0 | **7B** | **61.0** | 65.8 | **71.3** | **29.3** | **77.4** |
| OpenChat 3.5 1210 | Apache-2.0 | **7B** | 60.1 | 65.3 | 68.9 | 28.9 | 77.3 |
| OpenChat 3.5 | Apache-2.0 | **7B** | 56.4 | 64.3 | 55.5 | 28.6 | 77.3 |
| Grok-0 | Proprietary | 33B | 44.5 | 65.7 | 39.7 | 15.7 | 56.8 |
| Grok-1 | Proprietary | ???B | 55.8 | **73** | 63.2 | 23.9 | 62.9 |
-->
# πContact
**Project Lead:**
- Guan Wang [[email protected]]
- [Alpay Ariyak](https://github.com/alpayariyak) [[email protected]]
**Sponsors:**
- [Sen Song](https://scholar.google.com/citations?user=cYgtRP4AAAAJ) (Tsinghua University)
- [Yang Liu](https://nlp.csai.tsinghua.edu.cn/~ly/) (Tsinghua University)
- [01.AI Company](https://www.lingyiwanwu.com/en)
- [RunPod](https://www.runpod.io/) |