Update README.md
Browse files
README.md
CHANGED
@@ -11,13 +11,13 @@ license: mit
|
|
11 |
**Chuxin-1.6B-1M**是基于Chuxin-1.6B-base模型在1M窗口下训练后的结果,大海捞针实验显示其具有非常强的上下文检索能力。
|
12 |
|
13 |
|
14 |
-
如果您想了解更多关于Chuxin-1.6B开源模型的细节,我们建议您参阅我们的[技术报告](https://
|
15 |
|
16 |
**Chuxin-1.6B-Base** is a model with 1.6 billion parameters. Chuxin-1.6B is built entirely on open-source data. After being trained with large-scale data, Chuxin has very competitive capabilities in various downstream tasks.
|
17 |
|
18 |
**Chuxin-1.6B-1M** is the result of training the Chuxin-1.6B-base model with a 1M windows. Experiments such as searching for a needle in a haystack demonstrate its strong contextual retrieval abilities.
|
19 |
|
20 |
-
If you would like to learn more about the Chuxin-1.6B open-source model, we suggest you refer to our [technical report](https://
|
21 |
<br>
|
22 |
|
23 |
## 快速使用(Quickstart)
|
@@ -96,7 +96,7 @@ If you find our work helpful, feel free to give us a cite.
|
|
96 |
@article{chuxin,
|
97 |
title={CHUXIN: 1.6B TECHNICAL REPORT},
|
98 |
author={Xiaomin Zhuang, Yufan Jiang, Qiaozhi He, Zhihua Wu},
|
99 |
-
journal={arXiv preprint arXiv:
|
100 |
year={2024}
|
101 |
}
|
102 |
```
|
|
|
11 |
**Chuxin-1.6B-1M**是基于Chuxin-1.6B-base模型在1M窗口下训练后的结果,大海捞针实验显示其具有非常强的上下文检索能力。
|
12 |
|
13 |
|
14 |
+
如果您想了解更多关于Chuxin-1.6B开源模型的细节,我们建议您参阅我们的[技术报告](https://arxiv.org/pdf/2405.04828)
|
15 |
|
16 |
**Chuxin-1.6B-Base** is a model with 1.6 billion parameters. Chuxin-1.6B is built entirely on open-source data. After being trained with large-scale data, Chuxin has very competitive capabilities in various downstream tasks.
|
17 |
|
18 |
**Chuxin-1.6B-1M** is the result of training the Chuxin-1.6B-base model with a 1M windows. Experiments such as searching for a needle in a haystack demonstrate its strong contextual retrieval abilities.
|
19 |
|
20 |
+
If you would like to learn more about the Chuxin-1.6B open-source model, we suggest you refer to our [technical report](https://arxiv.org/pdf/2405.04828).
|
21 |
<br>
|
22 |
|
23 |
## 快速使用(Quickstart)
|
|
|
96 |
@article{chuxin,
|
97 |
title={CHUXIN: 1.6B TECHNICAL REPORT},
|
98 |
author={Xiaomin Zhuang, Yufan Jiang, Qiaozhi He, Zhihua Wu},
|
99 |
+
journal={arXiv preprint arXiv:2405.04828},
|
100 |
year={2024}
|
101 |
}
|
102 |
```
|