Spaces:

AiForTheChurch
/

README

Running

tr416 commited on Oct 8, 2023

Commit

62d076e

•

1 Parent(s): a05b67a

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -35,7 +35,7 @@ Denomination specific models like ChristianGPT-catholic are based on ChristianGP
 - GPU with over 16GB of memory (we trained on NVIDIA Tesla V100 32GB and NVIDIA Ampere A6000 45GB)
 - 30GB of RAM (the raw model weights are about 29GB, our models are cast to 8bit to use less memory)
-- ## References
 We leaned heavily on open-source libraries like `transformers`, `peft`, and `bitsandbytes` for this project.
 - Dettmers, Tim, Mike Lewis, Younes Belkada, and Luke Zettlemoyer. 2022. "[LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale](https://arxiv.org/abs/2208.07339)." *arXiv preprint arXiv:2208.07339*.
 - Hu, Edward J., Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, and Weizhu Chen. 2021. "[LoRA: Low-Rank Adaptation of Large Language Models](https://arxiv.org/abs/2106.09685)." *arXiv preprint arXiv:2106.09685*.

 - GPU with over 16GB of memory (we trained on NVIDIA Tesla V100 32GB and NVIDIA Ampere A6000 45GB)
 - 30GB of RAM (the raw model weights are about 29GB, our models are cast to 8bit to use less memory)
+## References
 We leaned heavily on open-source libraries like `transformers`, `peft`, and `bitsandbytes` for this project.
 - Dettmers, Tim, Mike Lewis, Younes Belkada, and Luke Zettlemoyer. 2022. "[LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale](https://arxiv.org/abs/2208.07339)." *arXiv preprint arXiv:2208.07339*.
 - Hu, Edward J., Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, and Weizhu Chen. 2021. "[LoRA: Low-Rank Adaptation of Large Language Models](https://arxiv.org/abs/2106.09685)." *arXiv preprint arXiv:2106.09685*.