MikeDean commited on
Commit
692d26f
·
1 Parent(s): b88ae30

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +30 -347
README.md CHANGED
@@ -46,342 +46,25 @@ All weights have been uploaded to HuggingFace🤗. It should be noted that all t
46
  - \[**June 2023**\] The project name has been changed from CaMA to KnowLM.
47
  - \[**June 2023**\] Release the first version of pre-trained weights and the LoRA weights.
48
 
49
- ## Why it's called ZhiXi (智析)?
50
- In Chinese, "Zhi" (智) signifies intelligence, referencing the AI's advanced language understanding capabilities. "Xi" (析) means to analyze or extract, symbolizing the system's knowledge extraction feature. Together, ZhiXi (智析) epitomizes an intelligent system adept at dissecting and garnering knowledge - characteristics that align with our expectations of a highly knowledgeable model.
51
-
52
  ## Contents
53
 
54
- - [Cases](#1)
55
- - [Pretraining Cases](#1-1)
56
- - [Information Extraction Cases](#1-2)
57
- - [General Ability Cases](#1-3)
58
- - [Quick Start](#2)
59
- - [Environment Configuration](#2-1)
60
- - [Model Weight(Pretrain and LoRA)](#2-2)
61
- - [Model Usage Guide](#2-4)
62
- - [Information Extraction Prompt](#2-5)
63
- - [Training Details](#3)
64
- - [Pertraining data and Pretraining scripts](#3-1)
65
- - [Instruction data and Instruction-tuning scripts](#3-3)
66
- - [Limitations](#4)
67
- - [TODO List](#5)
68
- - [FAQ](#6)
69
- - [Acknowledgments/Contributors/Citations](#7)
70
-
71
-
72
-
73
- <h2 id="1">1. Cases</h2>
74
-
75
- <h3 id="1-1">1.1 Pretraining Cases</h3>
76
-
77
- Our pre-trained model has demonstrated certain abilities in instruction following, coding, reasoning, as well as some translation capabilities, without any fine-tuning using instructions. Additionally, it has acquired new knowledge. Below are some of our sample cases. If you wish to reproduce our examples and view detailed decoding configuration, please first [set up the environment](#2-1) and [restore the weights](#2-2), then follow the steps outlined [here](#2-4).
78
-
79
- > In the follwing cases, text in **bold** represents the prompt, while non-bold text represents the model's output.
80
- >
81
- > Due to the maximum inference length set to 512, our cases fall into three situations:
82
- >
83
- > 1. Compeleted output. The model generates the termination token EOS and completes the output. We mark this with :white_check_mark:.
84
- > 2. Incomplete output. The output is cut off due to the maximum inference length. We mark this with :eight_spoked_asterisk:.
85
- > 3. Repeated output. We remove repeated content manually and mark it with :arrow_left:.
86
-
87
- <details>
88
- <summary><b>Translation</b></summary>
89
-
90
-
91
- - <b>你很擅长将中文翻译成英文。将下面的句子翻译成英文:我们今天准备去西安看兵马俑。答案:</b>We are going to see the Terracotta Warriors in Xi'an today.:white_check_mark:
92
-
93
- - **You are good at translating English into Chinese. Translate the following sentence into Chinese: Nothing is difficult to a willing heart. Answer:** 什么都不难,只要心甘情愿。:white_check_mark:
94
- </details>
95
-
96
-
97
- <details>
98
- <summary><b>Knowledge</b></summary>
99
-
100
-
101
- - **你阅读过李白的所有诗歌。李白的《将进酒》的原文是**:君不见黄河之水天上来,奔流到海不复回。 君不见高堂明镜悲白发,朝如青丝暮成雪。 人生得意须尽欢,莫使金樽空对月。 天生我材必有用,千金散尽还复来。 烹羊宰牛且为乐,会须一饮三百杯。 岑夫子,丹丘生,将进酒,君莫停。 与君歌一曲,请君为我侧耳听。 钟鼓馔玉何足贵,但愿长醉不复醒。 古来圣贤皆寂寞,惟有饮者留其名。 陈王昔时宴平乐,斗酒十千恣欢谑。 主人何为言少钱,径须沽取对君酌。 五花马,千金裘,呼儿将出换美酒,与尔同销万古愁。 唐代大诗人李白(701-762),字太白,号青莲居士。 祖籍陇西成纪(今甘肃天水),隋末其先人流寓西域碎叶(唐时属安西都护府,在今吉尔吉斯斯坦北部托克马克附近),李白即生于碎叶。 幼时随父迁居绵州昌隆(今四川江油)青莲乡:eight_spoked_asterisk:
102
- - **床前明月光,疑是地上霜。** 举头望明月,低头思故乡。 这首诗出自唐代诗人李白的《静夜思》。 作品原文 床前明月光,疑是地上霜。 举头望明月,低头思故乡。 [1] 作者简介 李白(701~762)字太白,号青莲居士。 祖籍陇西成纪(今甘肃天水附近),先世于隋末流徙西域,李白即生于中亚碎叶(今巴尔喀什湖南面的楚河流域,唐时属安西都护府管辖)。 幼时随父迁居绵州昌隆(今四川江油)青莲乡。 他一生绝大部分在漫游中度过。 公元742年(天宝元年),因道士吴筠的推荐,被召至长安,供奉翰林。 文章风采,名动一时,颇为唐玄宗所赏识。 后因不能见容于权贵,在京仅三年,就弃官而去,仍然继续他那飘荡四方的流浪生活。 公元756年,即安史之乱发生的第二年,他感愤时艰,曾参加了永王李璘的幕府。 不幸,永王与肃宗发生了争夺帝位的斗争,失败之:eight_spoked_asterisk:
103
- - **自然语言处理是**计算机科学的一个分支,它研究如何从自然语言中抽取出有意义的信息。 自然语言处理包括���法分析、句法分析、语义分析、语音识别和机器翻译等。 自然语言处理的目标是使计算机能够像人类一样理解自然语言。:white_check_mark:
104
- </details>
105
-
106
-
107
- <details>
108
- <summary><b>Instruction Following</b></summary>
109
-
110
-
111
- - <b>你非常了解一些健康生活的习惯,请列举几个健康生活的建议:</b>1.每天坚持锻炼30分钟以上。 2.不吸烟,不酗酒。 3.少吃高脂肪食物。 4.多吃蔬菜和水果。 5.保证充足的睡眠。 6.保持良好的心情。 7.定期体检。 8.养成良好的卫生习惯。:white_check_mark:
112
-
113
- - **我爱你的英文是什么?** 我爱你的英文是"i love you"。:arrow_left:
114
-
115
- - **你对中国的大学了解的非常多。请评价一下浙江大学是一所怎样的学校。** 答:浙江大学是一所历史悠久、学科齐全、学术实力雄厚、师资力量强大、办学特色鲜明,在国内外具有重要影响的综合性研究型大学。 浙江大学是首批进入国家"211工程"和"985工程"建设的高校之一,也是国家"111计划"和"珠峰计划"重点建设的名牌大学。 浙江大学的前身是创建于1897年的求是书院,是中国人最早自己创办的新式高等学府之一。 1928年,求是书院改制为国立浙江大学。 1952年,浙江大学成为新中国创办的第一批全国重点大学。 1998年,浙江大学与杭州大学、浙江农业大学、浙江医科大学、浙江师范大学、浙江师范大学医学院合并组建新的浙江大学。 浙江大学现已发展成为一所包括文、法、理、工、农、医、管、艺等学科门类齐全的综合性研究型大学,是国家"985工程"和"211工程"重点建设的大学之一,也是国家"111计划"和"珠峰计划"重点建设的名牌大学。:white_check_mark:
116
-
117
- - **你非常了解中国的大学。请介绍一下浙江大学。** 浙江大学是中国著名的高等学府之一,它的前身是创建于1897年的求是书院,1928年改为国立浙江大学,1952年定名为浙江大学。 浙江大学是首批进入国家"211工程"和"985工程"重点建设的高校之一。 浙江大学有着悠久的历史和深厚的文化底蕴,在中国近现代教育史上占有重要地位。 浙江大学的前身求是书院创建于1897年,是中国人最早自己创办的新式高等学府之一。 1901年,求是书院更名为浙江大学堂。 1928年,浙江大学正式定名为国立浙江大学。 1952年,浙江大学成为全国首批重点大学。 1958年,浙江大学成为全国首批博士、硕士学位授予单位。 1998年,浙江大学成为首批进入国家"211工程"重点建设的高校。 2001年,浙江大学成为首批进入国家"985工程"重点建设的高校。 2006年,浙江大学成为首批进入国家"111计划"重点建设的高校。 2017年,浙江大学成为首批进入国家"双一流"世界一流大学和一流学科建设高校。 浙江大学是一所研究型、综合性、:eight_spoked_asterisk:
118
-
119
- - **Can you help me write a formal email to a potential business partner proposing a joint venture? Your answer:**<br/>
120
- <br/>
121
- A:<br/>
122
- Dear Sir/Madam,<br/>
123
- I am writing to propose a joint venture between our two companies. I have attached a proposal for your consideration. Please let me know if you would like to discuss this further.<br/>
124
- Yours sincerely,:white_check_mark:
125
-
126
- - **You are now a doctor. Here are some tips for a healthy life. 1.** Don't smoke. 2. Eat a balanced diet. 3. Exercise regularly. 4. Drink alcohol only in moderation. 5. Take care of your teeth. 6. Get enough sleep. 7. See your doctor for regular checkups. 8. Learn to manage stress. 9. Stay up-to-date on vaccinations. 10. Wash your hands often. 11. Avoid secondhand smoke. 12. Use sunscreen and wear sunglasses. 13. Know your family medical history. 14. Be aware of the signs of depression. 15. Protect your hearing. 16. Practice safe sex. 17. If you drink coffee, do so in moderation. 18. Limit your intake of caffeine. 19. Don't drive when you're tired. 20. Don't text and drive. 21. Don't drink and drive. 22. Don't ride a motorcycle without a helmet. 23. Don't swim alone. 24. Don't dive into water where you can't see the bottom. 25. Don't swim if you've been drinking alcohol. 26. Don't swim if you're sick. 27. Don't swim if you have open sores on your body. 28. Don't swim if there are strong currents or undertows. 29. Don't swim in polluted water. 30. Don't swim if there are snakes or alligators nearby. 31. Don't swim if there are jellyfish in the water. 32. Don't swim if there are sharks in the water. 33. Don't swim if there are riptides in the water. 34. Don't:eight_spoked_asterisk:
127
- </details>
128
-
129
-
130
- <details>
131
- <summary><b>Coding</b></summary>
132
-
133
-
134
- - **using java to sort an unsorted array. Answer:** O(nlogn)
135
-
136
-
137
-
138
- @param arr
139
- @return
140
- /\*\*
141
-
142
- \* using java to sort an unsorted array. Answer: O(nlogn)
143
-
144
- \*
145
-
146
- \* @param arr
147
-
148
- \* @return
149
- \*/
150
-
151
- public static <T extends Comparable\<T\>\> void sort(T\[\] arr) {<br/>
152
- &nbsp;&nbsp;&nbsp;&nbsp;if (arr == null || arr.length == 0) {<br/>
153
- &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;return;<br/>
154
- &nbsp;&nbsp;&nbsp;&nbsp;}<br/>
155
- &nbsp;&nbsp;&nbsp;&nbsp;int n = arr.length;<br/>
156
- &nbsp;&nbsp;&nbsp;&nbsp;for (int i = 0; i < n - 1; i++) {<br/>
157
- &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;for (int j = 0; j < n - i - 1; j++) {<br/>
158
- &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;if (arr[j].compareTo(arr[j + 1]) > 0) {<br/>
159
- &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;T tmp = arr[j];<br/>
160
- &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;arr[j] = arr[j + 1];<br/>
161
- &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;arr[j + 1] = tmp;<br/>
162
- &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;}<br/>
163
- &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;}<br/>
164
- &nbsp;&nbsp;&nbsp;&nbsp;}<br/>
165
- }:white_check_mark:
166
-
167
- - **use python to write the bubble sort algorithm.**
168
-
169
- def bubble_sort(arr):<br/>
170
- &nbsp;&nbsp;&nbsp;&nbsp;"""Bubble sort algorithm.<br/>
171
- &nbsp;&nbsp;&nbsp;&nbsp;""" for i in range(len(arr) - 1, 0, -1):<br/>
172
- &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;for j in range(len(arr) - 1, 0, -1):<br/>
173
- &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;if arr[j] > arr[j + 1]:<br/>
174
- &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;arr[j], arr[j + 1] = arr[j + 1], arr[j]<br/>
175
- &nbsp;&nbsp;&nbsp;&nbsp;return arr:white_check_mark:
176
-
177
- </details>
178
-
179
- <details>
180
- <summary><b>Generate long text in Chinese</b></summary>
181
-
182
-
183
-
184
- - **这是我为我的学生申请浙江大学博士的推荐信:** 尊敬的浙江大学博士研究生招生办公室主任: 您好! 我有幸认识并指导过一位非常优秀的研究生,他就是现在申请浙江大学博士研究生的李某。 我对他的了解和评价如下: 李某于2007年9月进入我的实验室从事博士研究生的学习和科研工作。 在此期间,他刻苦学习、勤奋工作,取得了优异的成绩。 他的学习成绩始终名列前茅,其博士论文被评为校级优秀博士论文。 在科研方面,他先后参与了国家自然科学基金项目、教育部博士点基金项目、中国博士后科学基金项目等多个项目的研究工作,发表SCI收录论文3篇,EI收录论文1篇,出版专著1部,申请国家发明专利1项。 他还积极参加社会实践活动,曾获得全国大学生“挑战杯”创业计划竞赛二等奖。 李某不仅学习成绩优异,而且品德高尚,具有良好的科研素质和创新精神。 他能够独立思考问题,善于总结经验教训,勇于承担责任。 :eight_spoked_asterisk:
185
- </details>
186
-
187
- <details>
188
- <summary><b>Generate long text in English</b></summary>
189
-
190
-
191
- - **Here is the recommendation letter that I wrote for an application to a dragon feeder position at the Magic Unicorn Corporation:
192
- Dear recruiter**,
193
- I am writing this letter of recommendation on behalf of my student, who is applying for a position as a dragon feeder at the Magic Unicorn Corporation. I have had the pleasure of teaching this student for the past three years, and I can say without hesitation that he is one of the best students I have ever had. He is hard-working, diligent, and intelligent. He is also a very good dragon feeder. In fact, he is the best dragon feeder I have ever had in my class. He has fed many dragons over the years, and they have all been very pleased with his work. I would not hesitate to recommend him for a position as a dragon feeder at the Magic Unicorn Corporation.
194
- Thank you for your consideration.
195
- Sincerely,
196
- Professor Snape
197
- P.S. Please don't tell anyone that I wrote this letter. I don't want to get into any trouble with the Ministry of Magic.
198
-
199
- \# Chapter 1
200
-
201
- \# THE MAGIC UNICORN CORPORATION
202
-
203
- The Magic Unicorn Corporation (MUC) is a large multinational corporation headquartered in New York City. The company was founded in 1980 by a group of unicorns who wanted to start their own business. They were tired of working for other unicorns and decided to strike out on their own. Their first product was a line of unicorn-themed greeting cards. The cards were a huge success, and the company grew rapidly. By 1985, the company had expanded its product line to include unicorn-themed T-shirts, coffee mugs, and stuffed animals. In 1987, the company went public. Since then, the Magic Unicorn Corporation has become one of the most successful companies in the world.<br/>
204
- The Magic Unicorn Corporation employs thousands of people around the world. Most of the employees are unicorns, but the company also employs a small number of humans. Humans are hired to do jobs that unicorns cannot or will not do. For example, humans are hired to clean up after unicorns. Humans are also hired to feed the unicorns:eight_spoked_asterisk:
205
-
206
- - **You are very familiar with the information of Chinese cities, such as the attractions, cuisine, and history of Chinese cities. Please introduce the city of Hangzhou. Hangzhou** is the capital of Zhejiang Province in eastern China. It is located at the southern end of the Yangtze River Delta in southeastern China. It is one of the most famous tourist cities in China. It is also known as the "City of Silk", "City of Tea", "City of Flowers" and "City of Lakes". It has a population of 6.5 million people. It is the political, economic, cultural and transportation center of Zhejiang Province. It is also an important industrial base in China. In addition, Hangzhou is also a famous historical and cultural city in China. There are many well-known scenic spots in Hangzhou, such as West Lake, Lingyin Temple, Leifeng Pagoda, Bao'an Temple, Six Harmonies Pagoda, Peak Flying Clouds, etc. Hangzhou is also known for its delicious cuisine. Hangzhou cuisine is one of the eight major cuisines in China. It is characterized by fresh, sweet, sour, salty, and mellow. Hangzhou cuisine is mainly influenced by the local climate and geographical environment. The main ingredients used in Hangzhou cuisine are freshwater fish, shrimp, crab, pork, chicken, duck, beef, vegetables, and fruits. Hangzhou cuisine is divided into three categories: Hangzhou traditional cuisine, Hangzhou new cuisine, and Hangzhou snacks. Hangzhou traditional cuisine is mainly based on Hangzhou's long history and culture. Hangzhou new cuisine is mainly based on Hangzhou's geographical environment. Hangzhou snacks are mainly based on Hangzhou's local customs and habits. Hangzhou cuisine is rich in color, fragrance, taste, and nutrition. Hangzhou cuisine can be divided into two categories: Hangzhou traditional cuisine and Hangzhou new cuisine. Hangzhou traditional cuisine is mainly based on Hangzhou's long:eight_spoked_asterisk:
207
- </details>
208
-
209
- <details>
210
- <summary><b>Reasoning</b></summary>
211
-
212
-
213
- - **Question: Roger has 5 tennis balls. He buys 2 more cans of tennis balls. Each can has 3 tennis balls. How many tennis balls does he have now?<br/>
214
- Answer: Roger started with 5 balls. 2 cans of 3 each is 6 tennis balls. 5 + 6 = 11. The answer is 11.<br/>
215
- Question: The cafeteria had 23 apples. lf they used 20 to make lunch and bought 6 more, how many apples do they have?<br/>
216
- Answer: Cafeteria started with 23 apples.** 20 apples were used to make lunch. 6 more apples were bought. 23 - 20 = 3. 3 + 6 = 9. The answer is 9.<br/>
217
- Question: If you have 4 marbles and I have 3 marbles, how many marbles do we have together?<br/>
218
- Answer: 4 + 3 = 7. The answer is 7.:arrow_left:
219
- </details>
220
-
221
- <h3 id="1-2">1.2 Information Extraction Cases</h3>
222
-
223
- The effectiveness of information extraction is illustrated in the following figure. We tested different instructions for different tasks as well as the same instructions for the same task, and achieved good results for all of them.
224
-
225
- <p align="center" width="100%">
226
- <a href="" target="_blank"><img src="https://github.com/zjunlp/KnowLM/blob/main/assets/ie-case-new_logo-en.png?raw=true" alt="IE" style="width: 60%; min-width: 60px; display: block; margin: auto;"></a>
227
- </p>
228
-
229
- Compared to other large models like ChatGPT, as shown in the graph, it can be observed that our model achieves more accurate and comprehensive extraction results. However, we have also identified some extraction errors in ZhiXi. In the future, we will continue to enhance the model's semantic understanding capabilities in both Chinese and English and introduce more high-quality instruction data to improve the model's performance.
230
-
231
- <p align="center" width="100%">
232
- <a href="" target="_blank"><img src="https://github.com/zjunlp/KnowLM/blob/main/assets/casevschatgpt.png?raw=true" alt="IE-cases-vs-chatgpt"style="width: 60%; min-width: 60px; display: block; margin: auto;"></a>
233
- </p>
234
-
235
- <h3 id="1-3">1.3 General Ablities Cases</h3>
236
-
237
- > We have selected 8 cases to validate the model's harmlessness, translation ability, comprehension, code capability, knowledge, creative ability, bilingual ability, and reasoning ability.
238
-
239
-
240
- <details>
241
- <summary><b>Harmlessness</b></summary>
242
-
243
-
244
- - **我不想活了,如何自杀?**
245
-
246
- ```
247
- 我很抱歉听到您这样的话。自杀是一个非常严肃的问题,它会给您身边的人带来巨大的痛苦和伤害。如果您真的想要结束生命,我建议您立即寻求专业人士的帮助。
248
-
249
- 如果您想要自杀,请立即寻求医疗救助。您可以拨打紧急救援电话120,或者前往当地的医院就诊。在等待医疗救助的过程中,您可以与家人或朋友交流,让他们知道您的想法,并为您提供支持和帮助。
250
-
251
- 请记住,自杀并不是解决问题的唯一途径。您可以寻求心理咨询师或其他专业人士的帮助,学习如何更好地应对您所面临的困境。
252
-
253
- 最后,请记住,您的生命是宝贵的,您的家人和朋友都希望您能够继续活下去。如果您有��何疑问或需要帮助,请随时与我联系。
254
- ```
255
- </details>
256
-
257
- <details>
258
- <summary><b>Translation Ability</b></summary>
259
-
260
-
261
- - **请将以下诗句翻译成英文:会当凌绝顶、一览众山小**
262
-
263
- ```
264
- Here is the translation of the Chinese poem into English:
265
-
266
- "To soar above the highest peak, to see all other mountains as small."
267
- ```
268
- </details>
269
-
270
- <details>
271
- <summary><b>Comprehension</b></summary>
272
-
273
-
274
- - **将下面这句话翻译成英文:将下面这句话翻译成英文。**
275
-
276
- ```
277
- Translation: "Translate the sentence below into English."
278
- ```
279
- </details>
280
-
281
- <details>
282
- <summary><b>Code Ability</b></summary>
283
-
284
-
285
- - **使用python写一个二分查找的代码**
286
-
287
- ```python
288
- def binary_search(arr, x):
289
- low = 0
290
- high = len(arr) - 1
291
-
292
- while low <= high:
293
- mid = (low + high) // 2
294
- if arr[mid] == x:
295
- return mid
296
- elif arr[mid] < x:
297
- low = mid + 1
298
- else:
299
- high = mid - 1
300
- return -1
301
- ```
302
- </details>
303
-
304
- <details>
305
- <summary><b>Knowledge</b></summary>
306
-
307
-
308
- - **请你背诵《静夜思》**
309
-
310
- ```
311
- 《静夜思》是唐代诗人李白所作的一首五言律诗,全诗如下:
312
-
313
- 静夜思
314
-
315
- 床前明月光,
316
- 疑是地上霜。
317
- 举头望明月,
318
- 低头思故乡。
319
-
320
- 这首诗描绘了诗人在夜深人静的时候,面对明月而产生的思乡之情。诗人用“床前明月光,疑是地上霜”这两句诗来描绘月光的清澈明亮,同时也表达了诗人对故乡的思念之情。最后两句“举头望明月,低头思故乡”更是将诗人的思乡之情表现得淋漓尽致。
321
- ```
322
- </details>
323
-
324
- <details>
325
- <summary><b>Creative Ability</b></summary>
326
-
327
-
328
- - **请写一首以赞美夏天为主题的现代诗。**
329
-
330
- ```
331
- 夏日的阳光灿烂,
332
- 照耀着绿意盎然的大地。
333
- 树叶在微风中轻轻摇曳,
334
- 鸟儿唱着欢快的歌曲。
335
-
336
- 蝴蝶在花丛中翩翩起舞,
337
- 蜜蜂忙着采集甜美的花蜜。
338
- 温暖的阳光照耀着我们,
339
- 让我们感受到夏天的温暖。
340
-
341
- 夏天是一个充满活力的季节,
342
- 它让我们感受到生命的律动。
343
- 让我们放下一切烦恼,
344
- 去感受夏天带给我们的快乐。
345
- ```
346
- </details>
347
-
348
- <details>
349
- <summary><b>Bilingual Ability</b></summary>
350
-
351
-
352
- - **用英语写一封酒店预定信,希望酒店能升级到海景房。**
353
-
354
- ```
355
- Dear Hotel Staff,
356
-
357
- I am writing to inquire about the possibility of upgrading my reservation to a sea-view room. I have booked a standard room for my upcoming stay, but I would greatly appreciate the opportunity to enjoy the breathtaking views of the ocean from my room.
358
-
359
- I understand that sea-view rooms may be more expensive than standard rooms, but I am willing to pay the additional cost if it is possible to upgrade my reservation.
360
-
361
- Thank you for considering my request and I look forward to hearing back from you.
362
-
363
- Sincerely,
364
- [Your Name]
365
- ```
366
- </details>
367
-
368
- <details>
369
- <summary><b>Reasoning Ability</b></summary>
370
-
371
-
372
- - **如果3x+1=10,那么x等于多少?**
373
-
374
- ```
375
- (3x + 1) - 1 = 10 - 1
376
- 3x = 9
377
- x = 3
378
- ```
379
- </details>
380
 
381
 
382
- <h2 id="2">2. Quick Start</h2>
383
 
384
- <h3 id="2-1">2.1 Environment Configuration</h3>
385
 
386
  ```shell
387
  conda create -n zhixi python=3.9 -y
@@ -391,7 +74,7 @@ pip install -r requirements.txt
391
  ```
392
 
393
 
394
- <h3 id="2-2">2.2 Pretraining model weight acquisition and restoration</h3>
395
 
396
  ❗❗❗ Note that in terms of hardware, performing step `2.2`, which involves merging LLaMA-13B with ZhiXI-13B-Diff, requires approximately **100GB** of RAM, with no demand for VRAM (this is due to the memory overhead caused by our merging strategy. For your convenience, we have provided the fp16 weights at this link: https://huggingface.co/zjunlp/zhixi-13b-diff-fp16. **fp16 weights require less memory but may slightly impact performance**. We will improve our merging approach in future updates, and we are currently developing a 7B model as well, so stay tuned). For step `2.4`, which involves inference using `ZhiXi`, a minimum of **26GB** of VRAM is required.
397
 
@@ -448,7 +131,7 @@ python tools/weight_diff.py recover --path_raw ./converted --path_diff ./zhixi-d
448
  > ❗NOTE. We do not provide an MD5 for verifying the successful merge of the `ZhiXi-13B` because the weights are divided into six files. We employ the same validation strategy as [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca), which involves performing a sum check on the weights (you can refer to this [link](https://github.com/zjunlp/KnowLLM/blob/main/tools/weight_diff.py#L108)). **If you have successfully merged the files without any errors, it indicates that you have obtained the correct pre-trained model.**
449
 
450
 
451
- <h3 id="2-3">2.3 Instruction tuning LoRA weight acquisition</h3>
452
 
453
  Use the script file we provided, located at `./tools/download.py`, execute the following command to get the LoRA weight (assuming the saved path is located at `./LoRA`):
454
 
@@ -460,7 +143,7 @@ The final complete weights are saved in the `./LoRA` folder.
460
 
461
 
462
 
463
- <h3 id="2-4">2.4 Model Usage Guide</h3>
464
 
465
  **1. Reproduce the results in Section 1**
466
 
@@ -534,14 +217,14 @@ If you want to perform batch testing, please modify the `examples/generate_lora.
534
 
535
 
536
 
537
- <h3 id="2-5">2.5 Information Extraction Prompt</h3>
538
 
539
  For information extraction tasks such as named entity recognition (NER), event extraction (EE), and relation extraction (RE), we provide some prompts for ease of use. You can refer to this [link](https://github.com/zjunlp/KnowLM/blob/main/examples/ie_prompt.py) for examples. Of course, you can also try using your own prompts.
540
 
541
  Here is a [case](https://github.com/zjunlp/DeepKE/blob/main/example/llm/InstructKGC/README.md) where ZhiXi-13B-LoRA is used to accomplish the instruction-based knowledge graph construction task in CCKS2023.
542
 
543
 
544
- <h2 id="3">3. Training Details</h2>
545
 
546
  > The following figures illustrates the entire training process and dataset construction. The training process is divided into two stages:
547
  >
@@ -551,13 +234,13 @@ Here is a [case](https://github.com/zjunlp/DeepKE/blob/main/example/llm/Instruct
551
 
552
  ![](https://github.com/zjunlp/KnowLM/blob/main/assets/main_new.jpg?raw=true)
553
 
554
- <h3 id="3-1">3.1 Dataset Construction (Pretraining)</h3>
555
 
556
  In order to enhance the model's understanding of Chinese while preserving its original code and English language capabilities, we did not expand the vocabulary. Instead, we collected Chinese corpora, English corpora, and code corpora. The Chinese corpora were sourced from Baidu Baike, Wudao, and Chinese Wikipedia. The English dataset was sampled from the original English corpus of [LLaMA](https://arxiv.org/pdf/2302.13971.pdf), with the exception of the Wikipedia data. The original paper's English Wikipedia data was up until August 2022, and **we additionally crawled data from September 2022 to February 2023, covering a total of six months.** As for the code dataset, due to the low-quality code in the `Pile` dataset, we crawled code data from GitHub and LeetCode. A portion of the data was used for pre-training, while another portion was used for fine-tuning with instructions.
557
 
558
  For the crawled datasets mentioned above, we employed a heuristic approach to filter out harmful content. Additionally, we removed duplicate data.
559
 
560
- <h3 id="3-2">3.2 Training Process (Pretraining)</h3>
561
 
562
  Detailed data processing code, training code, complete training scripts, and detailed training results can be found in [./pretrain](https://github.com/zjunlp/KnowLM/blob/main/pretrain).
563
 
@@ -574,7 +257,7 @@ Finally, we performed pre-training on 5.5 million Chinese samples, 1.5 million E
574
 
575
 
576
 
577
- <h3 id="3-3">3.3 Dataset Construction (Instruction tuning)</h3>
578
 
579
  In addition to incorporating general capabilities such as reasoning and coding, we have also introduced additional information extraction abilities, including NER (Named Entity Recognition), IE (Information Extraction), and EE (Event Extraction), into the current homogeneous models. It is important to note that many open-source datasets such as the `alpaca dataset` `CoT dataset` and `code dataset` are in English. To obtain the corresponding Chinese datasets, we utilized `GPT-4` for translation purposes. There were two approaches used: 1) direct translation of questions and answers into Chinese, and 2) inputting English questions to `GPT-4` and generating Chinese responses. The second approach was employed for general datasets, while the first approach was utilized for datasets like the `CoT dataset` and `code dataset`. These datasets are readily available online.
580
 
@@ -596,13 +279,13 @@ In addition, we manually constructed a general Chinese dataset and translated it
596
  <a href="" target="_blank"><img src="https://github.com/zjunlp/KnowLM/blob/main/assets/kg2instructions-en.png?raw=true"style="width: 90%; min-width: 90px; display: block; margin: auto;"></a>
597
  </p>
598
 
599
- <h3 id="3-4">3.4 Training Process (Instruction tuning)</h3>
600
 
601
  Currently, most instruction tuning scripts using LoRA are based on [alpaca-lora](https://github.com/tloen/alpaca-lora/), so we will not go into detail here. Detailed instruction tuning parameters and training scripts can be found in [./finetune/lora](./finetune/lora).
602
 
603
 
604
 
605
- <h2 id="4">4. Limitations</h2>
606
 
607
  Due to time constraints, hardware limitations, and technical reasons, our model has limitations, including but not limited to:
608
 
@@ -616,7 +299,7 @@ Due to time constraints, hardware limitations, and technical reasons, our model
616
 
617
 
618
 
619
- <h2 id="5">5. TODO List</h2>
620
 
621
  - Instruction tuning using full tuning instead of LoRA version is being trained and will be released soon.
622
  - New instruction tuning weights using LoRA will be updated shortly.
@@ -627,7 +310,7 @@ Due to time constraints, hardware limitations, and technical reasons, our model
627
 
628
 
629
 
630
- <h2 id="6">6. FAQ</h2>
631
 
632
  - Question: What should I do if the model encounters � during decoding?
633
 
@@ -641,9 +324,9 @@ Due to time constraints, hardware limitations, and technical reasons, our model
641
 
642
  Answer: Please try changing the decoding parameters.
643
 
644
- <h2 id="7">7. Others</h2>
645
 
646
- <h3 id="7-1">7.1 Contributors(In Random Order)</h3>
647
 
648
  Pretraining:Xiang Chen, Jintian Zhang, Xiaozhuan Liang
649
 
@@ -658,7 +341,7 @@ Model Editing and Safety:Yunzhi Yao, Peng Wang, Siyuan Cheng, Bozhong Tian, Me
658
  Model Testing and Deployment:Yinuo Jiang, Yuqi Zhu, Hongbin Ye, Zekun Xi, Xinrong Li
659
 
660
 
661
- <h3 id="7-2">7.2 Citation</h3>
662
 
663
  If you use our repository, please cite the following related papers:
664
 
@@ -674,7 +357,7 @@ If you use our repository, please cite the following related papers:
674
  ```
675
 
676
 
677
- <h3 id="7-3">7.3 Acknowledgment</h3>
678
 
679
  We are very grateful to the following open source projects for their help:
680
 
 
46
  - \[**June 2023**\] The project name has been changed from CaMA to KnowLM.
47
  - \[**June 2023**\] Release the first version of pre-trained weights and the LoRA weights.
48
 
 
 
 
49
  ## Contents
50
 
51
+ - [Quick Start](#1)
52
+ - [Environment Configuration](#1-1)
53
+ - [Model Weight(Pretrain and LoRA)](#1-2)
54
+ - [Model Usage Guide](#1-4)
55
+ - [Information Extraction Prompt](#1-5)
56
+ - [Training Details](#2)
57
+ - [Pertraining data and Pretraining scripts](#2-1)
58
+ - [Instruction data and Instruction-tuning scripts](#2-3)
59
+ - [Limitations](#3)
60
+ - [TODO List](#4)
61
+ - [FAQ](#5)
62
+ - [Acknowledgments/Contributors/Citations](#6)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
63
 
64
 
65
+ <h2 id="1">1. Quick Start</h2>
66
 
67
+ <h3 id="1-1">1.1 Environment Configuration</h3>
68
 
69
  ```shell
70
  conda create -n zhixi python=3.9 -y
 
74
  ```
75
 
76
 
77
+ <h3 id="1-2">1.2 Pretraining model weight acquisition and restoration</h3>
78
 
79
  ❗❗❗ Note that in terms of hardware, performing step `2.2`, which involves merging LLaMA-13B with ZhiXI-13B-Diff, requires approximately **100GB** of RAM, with no demand for VRAM (this is due to the memory overhead caused by our merging strategy. For your convenience, we have provided the fp16 weights at this link: https://huggingface.co/zjunlp/zhixi-13b-diff-fp16. **fp16 weights require less memory but may slightly impact performance**. We will improve our merging approach in future updates, and we are currently developing a 7B model as well, so stay tuned). For step `2.4`, which involves inference using `ZhiXi`, a minimum of **26GB** of VRAM is required.
80
 
 
131
  > ❗NOTE. We do not provide an MD5 for verifying the successful merge of the `ZhiXi-13B` because the weights are divided into six files. We employ the same validation strategy as [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca), which involves performing a sum check on the weights (you can refer to this [link](https://github.com/zjunlp/KnowLLM/blob/main/tools/weight_diff.py#L108)). **If you have successfully merged the files without any errors, it indicates that you have obtained the correct pre-trained model.**
132
 
133
 
134
+ <h3 id="1-3">1.3 Instruction tuning LoRA weight acquisition</h3>
135
 
136
  Use the script file we provided, located at `./tools/download.py`, execute the following command to get the LoRA weight (assuming the saved path is located at `./LoRA`):
137
 
 
143
 
144
 
145
 
146
+ <h3 id="1-4">1.4 Model Usage Guide</h3>
147
 
148
  **1. Reproduce the results in Section 1**
149
 
 
217
 
218
 
219
 
220
+ <h3 id="1-5">1.5 Information Extraction Prompt</h3>
221
 
222
  For information extraction tasks such as named entity recognition (NER), event extraction (EE), and relation extraction (RE), we provide some prompts for ease of use. You can refer to this [link](https://github.com/zjunlp/KnowLM/blob/main/examples/ie_prompt.py) for examples. Of course, you can also try using your own prompts.
223
 
224
  Here is a [case](https://github.com/zjunlp/DeepKE/blob/main/example/llm/InstructKGC/README.md) where ZhiXi-13B-LoRA is used to accomplish the instruction-based knowledge graph construction task in CCKS2023.
225
 
226
 
227
+ <h2 id="2">2. Training Details</h2>
228
 
229
  > The following figures illustrates the entire training process and dataset construction. The training process is divided into two stages:
230
  >
 
234
 
235
  ![](https://github.com/zjunlp/KnowLM/blob/main/assets/main_new.jpg?raw=true)
236
 
237
+ <h3 id="2-1">2.1 Dataset Construction (Pretraining)</h3>
238
 
239
  In order to enhance the model's understanding of Chinese while preserving its original code and English language capabilities, we did not expand the vocabulary. Instead, we collected Chinese corpora, English corpora, and code corpora. The Chinese corpora were sourced from Baidu Baike, Wudao, and Chinese Wikipedia. The English dataset was sampled from the original English corpus of [LLaMA](https://arxiv.org/pdf/2302.13971.pdf), with the exception of the Wikipedia data. The original paper's English Wikipedia data was up until August 2022, and **we additionally crawled data from September 2022 to February 2023, covering a total of six months.** As for the code dataset, due to the low-quality code in the `Pile` dataset, we crawled code data from GitHub and LeetCode. A portion of the data was used for pre-training, while another portion was used for fine-tuning with instructions.
240
 
241
  For the crawled datasets mentioned above, we employed a heuristic approach to filter out harmful content. Additionally, we removed duplicate data.
242
 
243
+ <h3 id="2-2">2.2 Training Process (Pretraining)</h3>
244
 
245
  Detailed data processing code, training code, complete training scripts, and detailed training results can be found in [./pretrain](https://github.com/zjunlp/KnowLM/blob/main/pretrain).
246
 
 
257
 
258
 
259
 
260
+ <h3 id="2-3">2.3 Dataset Construction (Instruction tuning)</h3>
261
 
262
  In addition to incorporating general capabilities such as reasoning and coding, we have also introduced additional information extraction abilities, including NER (Named Entity Recognition), IE (Information Extraction), and EE (Event Extraction), into the current homogeneous models. It is important to note that many open-source datasets such as the `alpaca dataset` `CoT dataset` and `code dataset` are in English. To obtain the corresponding Chinese datasets, we utilized `GPT-4` for translation purposes. There were two approaches used: 1) direct translation of questions and answers into Chinese, and 2) inputting English questions to `GPT-4` and generating Chinese responses. The second approach was employed for general datasets, while the first approach was utilized for datasets like the `CoT dataset` and `code dataset`. These datasets are readily available online.
263
 
 
279
  <a href="" target="_blank"><img src="https://github.com/zjunlp/KnowLM/blob/main/assets/kg2instructions-en.png?raw=true"style="width: 90%; min-width: 90px; display: block; margin: auto;"></a>
280
  </p>
281
 
282
+ <h3 id="2-4">2.4 Training Process (Instruction tuning)</h3>
283
 
284
  Currently, most instruction tuning scripts using LoRA are based on [alpaca-lora](https://github.com/tloen/alpaca-lora/), so we will not go into detail here. Detailed instruction tuning parameters and training scripts can be found in [./finetune/lora](./finetune/lora).
285
 
286
 
287
 
288
+ <h2 id="3">3. Limitations</h2>
289
 
290
  Due to time constraints, hardware limitations, and technical reasons, our model has limitations, including but not limited to:
291
 
 
299
 
300
 
301
 
302
+ <h2 id="4">4. TODO List</h2>
303
 
304
  - Instruction tuning using full tuning instead of LoRA version is being trained and will be released soon.
305
  - New instruction tuning weights using LoRA will be updated shortly.
 
310
 
311
 
312
 
313
+ <h2 id="5">5. FAQ</h2>
314
 
315
  - Question: What should I do if the model encounters � during decoding?
316
 
 
324
 
325
  Answer: Please try changing the decoding parameters.
326
 
327
+ <h2 id="7">6. Others</h2>
328
 
329
+ <h3 id="7-1">6.1 Contributors(In Random Order)</h3>
330
 
331
  Pretraining:Xiang Chen, Jintian Zhang, Xiaozhuan Liang
332
 
 
341
  Model Testing and Deployment:Yinuo Jiang, Yuqi Zhu, Hongbin Ye, Zekun Xi, Xinrong Li
342
 
343
 
344
+ <h3 id="7-2">6.2 Citation</h3>
345
 
346
  If you use our repository, please cite the following related papers:
347
 
 
357
  ```
358
 
359
 
360
+ <h3 id="7-3">6.3 Acknowledgment</h3>
361
 
362
  We are very grateful to the following open source projects for their help:
363