File size: 8,222 Bytes
2e39f41 fd2fdaa 2e39f41 8c069a4 ed22ba7 9136c02 ba5dffd 9136c02 ba5dffd 9136c02 f6dcda2 0f52dd9 53b6be6 f613971 044620d f613971 044620d 53b6be6 ed22ba7 76ec8f2 a9d27f0 9136c02 76ec8f2 a9d27f0 ed22ba7 76ec8f2 a9d27f0 ed22ba7 76ec8f2 a9d27f0 ed22ba7 8c069a4 76ec8f2 f6dcda2 0f52dd9 044620d 32eb045 9136c02 32eb045 46252b9 0f52dd9 e0106af fd2fdaa 22edf13 fd2fdaa afcb675 fd2fdaa afcb675 fd2fdaa afcb675 fd2fdaa 7f01910 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 |
---
title: README
emoji: π
colorFrom: blue
colorTo: blue
sdk: static
pinned: false
---
<style>
a {
text-decoration: none; /* Removes the default underline */
color: inherit; /* Inherits the color from the parent element */
}
a:hover, a:focus {
text-decoration: underline; /* Adds underline on hover or focus for better accessibility */
}
.link-text {
display: inline-block;
vertical-align: middle;
font-size: 1.26em;
margin-right: 10px;
margin-left: 0px;
margin-top: 0px;
margin-bottom: 0px;
}
</style>
<style>
.common-button {
background-color: white;
color: #3c72db;
border: 2px solid #3c72db;
padding: 10px 25px;
border-radius: 0.3em;
margin: 2px;
font-size: 1em;
text-decoration: none;
}
</style>
<link href='https://fonts.googleapis.com/css?family=Source+Sans+Pro' rel='stylesheet' type='text/css'>
<div align="center">
<a>
<img src="https://github.com/imoneoi/openchat/raw/master/assets/logo_new.png" style="width: 55%">
</a>
<h1 style="font-size: 1.30em; ">Advancing Open-source Language Models with Mixed-Quality Data</h1>
</div>
<p align="center" style="margin-top: 0px;">
<a href="https://openchat.team">
<img src="https://github.com/alpayariyak/openchat/blob/master/logo_new-removebg-preview.png?raw=true" alt="OpenChat Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
<span class="link-text">OpenChat Online Demo</span>
</a> |
<a href="https://github.com/imoneoi/openchat">
<img src="https://camo.githubusercontent.com/4133dc1cd4511d4a292b84ce10e52e4ed92569fb2a8165381c9c47be5edc2796/68747470733a2f2f6564656e742e6769746875622e696f2f537570657254696e7949636f6e732f696d616765732f706e672f6769746875622e706e67" alt="GitHub Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
<span class="link-text">GitHub</span>
</a> |
<a href="https://arxiv.org/pdf/2309.11235.pdf">
<img src="https://github.com/alpayariyak/openchat/blob/master/arxiv-logomark-small-square-border.png?raw=true" alt="ArXiv Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
<span class="link-text">Paper</span>
</a> |
<a href="https://discord.gg/pQjnXvNKHY">
<img src="https://cloud.githubusercontent.com/assets/6291467/26705903/96c2d66e-477c-11e7-9f4e-f3c0efe96c9a.png" alt="Discord Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 10px; margin-top: 0px; margin-bottom: 0px;"/>
<span class="link-text">Discord</span>
</a>
</p>
<hr>
<p align="center" style="margin-top: 0px; font-size: 1.2em; background-color: #3c72db; padding: 0.5em; border-radius: 0.5em; color: white; font-weight: bold;">
<a href="https://huggingface.co/openchat/openchat_3.5" style="text-decoration: none; color: white;">
<span style="font-size: 1.4em; font-family: 'Helvetica'; letter-spacing: 0.2em">OPENCHAT</span>
<span style="font-size: 1.4em; font-family: 'Helvetica'; background-color: white; padding: 0.2em; border-radius: 0.3em; color: #3c72db;"> 3.5 </span>
<br>
<span>
First 7B Model to Achieve ChatGPT-level Performance
<br>#1 Open-Source Model on MT-bench scoring 7.81, outperforming 70B models
</span>
</a>
<br>
<a href="https://huggingface.co/openchat/openchat_3.5">
<button class="common-button">Model Repo</button>
</a>
<a href="https://openchat.team">
<button class="common-button">OpenChatUI Demo</button>
</a>
<a href="https://huggingface.co/spaces/openchat/openchat_3.5">
<button class="common-button">HuggingFace Space</button>
</a>
<a href="https://arxiv.org/pdf/2309.11235.pdf">
<button class="common-button">Paper</button>
</a>
</p>
<div align="center" style="justify-content: center; align-items: center; "'>
<img src="https://github.com/alpayariyak/openchat/blob/master/assets/Untitled%20design-17.png?raw=true" style="width: 100%; border-radius: 0.5em">
</div>
</p>
<h1 style="vertical-align: middle;">
<img src="https://github.com/alpayariyak/openchat/blob/master/logo_new-removebg-preview.png?raw=true" alt="OpenChat Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 0px; margin-top: 0px; margin-bottom: 0px;"/>About OpenChat
</h1>
- OpenChat is an innovative library of **open-source language models**, fine-tuned with [**C-RLFT**](https://arxiv.org/pdf/2309.11235.pdf) - a strategy inspired by offline reinforcement learning.
- Our models learn from mixed-quality data without preference labels, delivering exceptional performance on par with `ChatGPT`, even with a `7B` model which can be run on a **consumer GPU (e.g. RTX 3090)**.
- Despite our simple approach, we are committed to developing a high-performance, commercially viable, open-source large language model, and we continue to make significant strides toward this vision.
# π° News
- [2023/11/01] We released the [OpenChat-3.5-7B](https://huggingface.co/openchat/openchat_3.5) model, surpassing ChatGPT on various benchmarks π₯.
- [2023/09/21] We released our paper [OpenChat: Advancing Open-source Language Models with Mixed-Quality Data](https://arxiv.org/pdf/2309.11235.pdf).
# π Benchmarks
| Model | # Params | Average | MT-Bench | AGIEval | BBH MC | TruthfulQA | MMLU | HumanEval | BBH CoT | GSM8K |
|--------------------|----------|----------|--------------|----------|----------|---------------|--------------|-----------------|-------------|--------------|
| OpenChat-3.5 | **7B** | **61.6** | 7.81 | **47.4** | **47.6** | **59.1** | 64.3 | **55.5** | 63.5 | **77.3** |
| ChatGPT (March)* | ? | 61.5 | **7.94** | 47.1 | **47.6** | 57.7 | **67.3** | 48.1 | **70.1** | 74.9 |
| | | | | | | | | | | |
| OpenHermes 2.5 | 7B | 59.3 | 7.54 | 46.5 | 49.4 | 57.5 | 63.8 | 48.2 | 59.9 | 73.5 |
| OpenOrca Mistral | 7B | 52.7 | 6.86 | 42.9 | 49.4 | 45.9 | 59.3 | 38.4 | 58.1 | 59.1 |
| Zephyr-Ξ²^ | 7B | 34.6 | 7.34 | 39.0 | 40.6 | 40.8 | 39.8 | 22.0 | 16.0 | 5.1 |
| Mistral** | 7B | - | 6.84 | 38.0 | 39.0 | - | 60.1 | 30.5 | - | 52.2 |
| Open-source SOTA** | 13B-70B | 61.4 | 7.71 | 41.7 | 49.7 | 62.3 | 63.7 | 73.2 | 41.4 | 82.3 |
| | | | WizardLM 70B | Orca 13B | Orca 13B | Platypus2 70B | WizardLM 70B | WizardCoder 34B | Flan-T5 11B | MetaMath 70B |
## π Comparison with [X.AI Grok](https://x.ai/)
| | License | # Param | Average | MMLU | HumanEval | MATH | GSM8k |
|--------------|-------------|---------|----------|------|-----------|----------|----------|
| OpenChat 3.5 | Apache-2.0 | 7B | **56.4** | 64.3 | 55.5 | **28.6** | **77.3** |
| Grok-0 | Proprietary | 33B | 44.5 | 65.7 | 39.7 | 15.7 | 56.8 |
| Grok-1 | Proprietary | ? | 55.8 | 73 | 63.2 | 23.9 | 62.9 |
# πContact
We are a student team Tsinghua University, working on OpenChat, a project that requires additional computing power or LLMs API keys for further development. If you are interested in our project and would like to offer support, please feel free to reach out to us:
* Wang Guan [imonenext at gmail dot com]
* Cheng Sijie [csj23 at mails dot tsinghua dot edu dot cn]
We look forward to hearing you and collaborating on this exciting project! |