Upload 11 files
Browse files- .gitattributes +10 -0
- README.md +43 -0
- Vikhr-Qwen-2.5-0.5B-instruct-BF16.gguf +3 -0
- Vikhr-Qwen-2.5-0.5B-instruct-F16.gguf +3 -0
- Vikhr-Qwen-2.5-0.5B-instruct-F32.gguf +3 -0
- Vikhr-Qwen-2.5-0.5B-instruct-Q2_K.gguf +3 -0
- Vikhr-Qwen-2.5-0.5B-instruct-Q3_K_M.gguf +3 -0
- Vikhr-Qwen-2.5-0.5B-instruct-Q4_0.gguf +3 -0
- Vikhr-Qwen-2.5-0.5B-instruct-Q4_K_M.gguf +3 -0
- Vikhr-Qwen-2.5-0.5B-instruct-Q5_K_M.gguf +3 -0
- Vikhr-Qwen-2.5-0.5B-instruct-Q6_K.gguf +3 -0
- Vikhr-Qwen-2.5-0.5B-instruct-Q8_0.gguf +3 -0
.gitattributes
CHANGED
@@ -33,3 +33,13 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
Vikhr-Qwen-2.5-0.5B-instruct-BF16.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
+
Vikhr-Qwen-2.5-0.5B-instruct-F16.gguf filter=lfs diff=lfs merge=lfs -text
|
38 |
+
Vikhr-Qwen-2.5-0.5B-instruct-F32.gguf filter=lfs diff=lfs merge=lfs -text
|
39 |
+
Vikhr-Qwen-2.5-0.5B-instruct-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
40 |
+
Vikhr-Qwen-2.5-0.5B-instruct-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
41 |
+
Vikhr-Qwen-2.5-0.5B-instruct-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
42 |
+
Vikhr-Qwen-2.5-0.5B-instruct-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
43 |
+
Vikhr-Qwen-2.5-0.5B-instruct-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
44 |
+
Vikhr-Qwen-2.5-0.5B-instruct-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
45 |
+
Vikhr-Qwen-2.5-0.5B-instruct-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
@@ -0,0 +1,43 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
library_name: llamacpp
|
3 |
+
model_name: Vikhr-Qwen-2.5-0.5B-instruct
|
4 |
+
base_model:
|
5 |
+
- Vikhrmodels/Vikhr-Qwen-2.5-0.5B-instruct
|
6 |
+
language:
|
7 |
+
- ru
|
8 |
+
- en
|
9 |
+
license: apache-2.0
|
10 |
+
tags:
|
11 |
+
- instruct
|
12 |
+
datasets:
|
13 |
+
- Vikhrmodels/GrandMaster-PRO-MAX
|
14 |
+
pipeline_tag: text-generation
|
15 |
+
---
|
16 |
+
|
17 |
+
# 💨📟 Vikhr-Qwen-2.5-0.5B-Instruct
|
18 |
+
|
19 |
+
#### RU
|
20 |
+
|
21 |
+
Инструктивная модель на основе **Qwen-2.5-0.5B-Instruct**, обученная на русскоязычном датасете **GrandMaster-PRO-MAX**. В **4 раза эффективнее** базовой модели, и идеально подходит для запуска на слабых мобильных устройствах.
|
22 |
+
|
23 |
+
#### EN
|
24 |
+
|
25 |
+
Instructive model based on **Qwen-2.5-0.5B-Instruct**, trained on the Russian-language dataset **GrandMaster-PRO-MAX**. It is **4 times more efficient** than the base model, making it perfect for deployment on low-end mobile devices.
|
26 |
+
|
27 |
+
- [HF model](https://huggingface.co/Vikhrmodels/Vikhr-Qwen-2.5-0.5B-instruct)
|
28 |
+
|
29 |
+
**Рекомендуемая температура для генерации: 0.3** / **Recommended generation temperature: 0.3**.
|
30 |
+
|
31 |
+
### Авторы / Authors
|
32 |
+
- Sergei Bratchikov, [NLP Wanderer](https://t.me/nlpwanderer), [Vikhr Team](https://t.me/vikhrlabs)
|
33 |
+
- Nikolay Kompanets, [LakoMoor](https://t.me/lakomoor), [Vikhr Team](https://t.me/vikhrlabs)
|
34 |
+
- Konstantin Korolev, [Vikhr Team](https://t.me/vikhrlabs)
|
35 |
+
- Aleksandr Nikolich, [Vikhr Team](https://t.me/vikhrlabs)
|
36 |
+
```
|
37 |
+
@article{nikolich2024vikhr,
|
38 |
+
title={Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian},
|
39 |
+
author={Aleksandr Nikolich and Konstantin Korolev and Sergey Bratchikov and Nikolay Kompanets and Artem Shelmanov},
|
40 |
+
journal={arXiv preprint arXiv:2405.13929},
|
41 |
+
year={2024},
|
42 |
+
url={https://arxiv.org/pdf/2405.13929}
|
43 |
+
}
|
Vikhr-Qwen-2.5-0.5B-instruct-BF16.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e332cb46e57b9f98fa2bbfe5d7617f198b764ba59b5ae822d87e4b94f94834ea
|
3 |
+
size 993664960
|
Vikhr-Qwen-2.5-0.5B-instruct-F16.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:76e871ebf241a87024c491f6b7f6c74de0cae982e22684065997b711a2b4b046
|
3 |
+
size 993664960
|
Vikhr-Qwen-2.5-0.5B-instruct-F32.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ef31bb52beed4e5b7a5773c7079e08d66b836b69340a65824ee047f7a713d809
|
3 |
+
size 1981101760
|
Vikhr-Qwen-2.5-0.5B-instruct-Q2_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:561e7f05115d2f0d8d18a5e8eaa52cde61d493c85cecb027a2a610aed3adc3e6
|
3 |
+
size 338343168
|
Vikhr-Qwen-2.5-0.5B-instruct-Q3_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:7d50c72fae37820da590be74952d0bf7a2a24d980d574fea253e919b9f3566e4
|
3 |
+
size 355202304
|
Vikhr-Qwen-2.5-0.5B-instruct-Q4_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9416c50be56f36f2c857e698698c01b2e0d7f759719e1f492577609569e1b735
|
3 |
+
size 351890688
|
Vikhr-Qwen-2.5-0.5B-instruct-Q4_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8f45f00da2bcd70205461639e978db7c8245e205c8604cfe2ab7f5430506bf32
|
3 |
+
size 397543680
|
Vikhr-Qwen-2.5-0.5B-instruct-Q5_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:7a37b861539603247887fe4044d69535c0230291e9707718dfd106efc6dbe81e
|
3 |
+
size 419821824
|
Vikhr-Qwen-2.5-0.5B-instruct-Q6_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9f728fcb4bb527fec59d993c02850e034fcba86a723f9b515095cfe3549a5550
|
3 |
+
size 505472256
|
Vikhr-Qwen-2.5-0.5B-instruct-Q8_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:43df3b688d17f68321648bbf55c871ee9674350417b630994fa41b73a7557352
|
3 |
+
size 530803968
|