|
--- |
|
datasets: |
|
- IlyaGusev/saiga_scored |
|
- IlyaGusev/saiga_preferences |
|
language: |
|
- ru |
|
inference: false |
|
license: apache-2.0 |
|
--- |
|
|
|
Llama.cpp compatible versions of an original [12B model](https://huggingface.co/IlyaGusev/saiga_nemo_12b). |
|
|
|
Download one of the versions, for example `saiga_nemo_12b.Q4_K_M.gguf`. |
|
``` |
|
wget https://huggingface.co/IlyaGusev/saiga_nemo_12b_gguf/resolve/main/saiga_nemo_12b.Q4_K_M.gguf |
|
``` |
|
|
|
Download [interact_llama3_llamacpp.py](https://raw.githubusercontent.com/IlyaGusev/rulm/master/self_instruct/src/interact_llama3_llamacpp.py) |
|
``` |
|
wget https://raw.githubusercontent.com/IlyaGusev/rulm/master/self_instruct/src/interact_llama3_llamacpp.py |
|
``` |
|
|
|
How to run: |
|
``` |
|
pip install llama-cpp-python fire |
|
|
|
python3 interact_llama3_llamacpp.py saiga_nemo_12b.Q4_K_M.gguf |
|
``` |
|
|
|
System requirements: |
|
* 15GB RAM for q8_0 and less for smaller quantizations |