zjunlp
/

zhixi-13b-diff

@@ -33,7 +33,18 @@ The project's `initial phase` introduced a knowledge extraction LLM based on LLa
 - The **full-scale pre-training code** (providing conversion, construction, and loading of large corpora) and **LoRA instruction fine-tuning code** are open-sourced (support multi-machine multi-GPU).
-All weights have been uploaded to Hugging Face. The ZhiXi differential weights can be found [here](https://huggingface.co/zjunlp/zhixi-13B-Diff), and the LoRA weights can be found [here](https://huggingface.co/zjunlp/zhixi-13B-LoRA).
 ## Contents
@@ -606,6 +617,9 @@ Due to time constraints, hardware limitations, and technical reasons, our model
 - Instruction tuning using full tuning instead of LoRA version is being trained and will be released soon.
 - New instruction tuning weights using LoRA will be updated shortly.
 - ......
@@ -626,7 +640,7 @@ Due to time constraints, hardware limitations, and technical reasons, our model
 <h2 id="7">7. Others</h2>
-<h3 id="7-1">7.1 Contributors（in random order）</h3>
 Pretraining：Xiang Chen, Jintian Zhang, Xiaozhuan Liang
@@ -638,7 +652,7 @@ Tool learning and Multimodal：Shuofei Qiao, Yixin Ou, Lei Li
 Model Editing and Safety：Yunzhi Yao, Peng Wang, Siyuan Cheng, Bozhong Tian, Mengru Wang, Zhoubo Li
-Model Testing and Deployment：Yinuo Jiang, Yuqi Zhu, Hongbin Ye, Zekun Xi
 <h3 id="7-2">7.2 Citation</h3>
@@ -647,7 +661,7 @@ If you use our repository, please cite the following related papers:
 ```bibtex
 @article{cama,
-  author = {Jintian Zhang, Xiaohan Wang, Honghao Gui, Xiang Chen, Yinuo Jiang, Zhen Bi, Jing Chen, Shengyu Mao, Shuofei Qiao, Xiaozhuan Liang, Yixin Ou, Ruinan Fang, Zekun Xi, Shumin Deng, Huajun Chen, Ningyu Zhang},
   title = {DeepKE-LLM: A Large Language Model Based Knowledge Extraction Toolkit},
   year = {2023},
   publisher = {GitHub},
@@ -657,7 +671,6 @@ If you use our repository, please cite the following related papers:
 ```
 <h3 id="7-3">7.3 Acknowledgment</h3>
 We are very grateful to the following open source projects for their help:

 - The **full-scale pre-training code** (providing conversion, construction, and loading of large corpora) and **LoRA instruction fine-tuning code** are open-sourced (support multi-machine multi-GPU).
+All weights have been uploaded to HuggingFace🤗. It should be noted that all the following effects are based on `ZhiXi-13B-Diff`. If you have downloaded `ZhiXi-13B-Diff-fp16`, there may be some variations in the effects.
+| Model Name       | Train Method    | Weight Type          | Size     | Download Link                           | Notes                                                         |
+| -------------- | ------------ | --------------------- | -------- | ---------------------------------- | ------------------------------------------------------------ |
+| ZhiXi-13B-Diff | Full Pretraining   | Differential Weights | 48GB     | [HuggingFace](https://huggingface.co/zjunlp/zhixi-13b-diff) <br/> [GoogleDrive](https://drive.google.com/drive/folders/1PZDqZNaBJYQYeON1-9aFBtagktEWAtUK?usp=drive_link)| Restoring the pre-trained weights (i.e. **ZhiXi-13B**) needs to match the weights of `LLaMA-13B`, please refer to [here](#2-2) for specific instructions. |
+| ZhiXi-13B-Diff-fp16 | Full Pretraining   | Differential Weights(fp16) | 24GB     | [HuggingFace](https://huggingface.co/zjunlp/zhixi-13b-diff-fp16) <br/> [Google Drive](https://drive.google.com/drive/folders/1LYm-HUSSQ5Rl8nqZcswdiSpcP9xYTXaO?usp=sharing) | The main difference with `ZhiXi-13B-Diff` is the adoption of the `fp16` format for storage, which reduces memory usage. However, it may result in slight differences in the weights obtained from our actual training, which can slightly impact performance. For specific usage instructions, please refer to [here](#2-2) for specific instructions. |
+| ZhiXi-13B-LoRA | LoRA Instruction-tuning | LoRA Weights              | 251MB    | [HuggingFace](https://huggingface.co/zjunlp/zhixi-13b-lora) <br/>  [GoogleDrive](https://drive.google.com/drive/folders/1GLyaWIyDIayudrQhb_tJYoNPAUk1xByS?usp=drive_link) | It needs to be used with **ZhiXi-13B**. For specific instructions, please refer to [here](#2-4).          |
+| ZhiXi-7B Series   | Coming soon     | Coming soon            | Coming soon | Coming soon                           | Coming soon                                                 |
+## NEWS
+- \[**June 2023**\] The project name has been changed from CaMA to KnowLM.
+- \[**June 2023**\] Release the first version of pre-trained weights and the LoRA weights.
 ## Contents
 - Instruction tuning using full tuning instead of LoRA version is being trained and will be released soon.
 - New instruction tuning weights using LoRA will be updated shortly.
+- New models (Llama-7b, Falcon-7b) are being trained (We have limited GPUs!).
+- New abilities such as molecule and protein generation with [Mol-Instructions](https://github.com/zjunlp/Mol-Instructions), a large-scale biomolecules instruction dataset for large language models.
+- supporting llama.cpp
 - ......
 <h2 id="7">7. Others</h2>
+<h3 id="7-1">7.1 Contributors（In Random Order）</h3>
 Pretraining：Xiang Chen, Jintian Zhang, Xiaozhuan Liang
 Model Editing and Safety：Yunzhi Yao, Peng Wang, Siyuan Cheng, Bozhong Tian, Mengru Wang, Zhoubo Li
+Model Testing and Deployment：Yinuo Jiang, Yuqi Zhu, Hongbin Ye, Zekun Xi, Xinrong Li
 <h3 id="7-2">7.2 Citation</h3>
 ```bibtex
 @article{cama,
+  author = {Jintian Zhang, Ningyu Zhang, Xiaohan Wang, Honghao Gui, Xiang Chen, Yinuo Jiang, Zhen Bi, Jing Chen, Shengyu Mao, Shuofei Qiao, Xiaozhuan Liang, Yixin Ou, Runnan Fang, Zekun Xi, Xin Xu, Huajun Chen},
   title = {DeepKE-LLM: A Large Language Model Based Knowledge Extraction Toolkit},
   year = {2023},
   publisher = {GitHub},
 ```
 <h3 id="7-3">7.3 Acknowledgment</h3>
 We are very grateful to the following open source projects for their help: