cgus
/

Solar-10.7B-SLERP-exl2

Text Generation

Model card Files Files and versions

cgus commited on Apr 16, 2024

Commit

d1fb75f

·

verified ·

1 Parent(s): f2d2a17

Update README.md

Description and base model link

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -3,6 +3,8 @@ inference: false
 license: apache-2.0
 language:
 - en
 ---
 <!-- header start -->
 <!-- 200823 -->
@@ -22,7 +24,10 @@ Created by: [upstage](https://huggingface.co/upstage)
 [8bpw h8](https://huggingface.co/cgus/Solar-10.7B-SLERP-exl2/tree/8bpw-h8)
 ## Quantization notes
 30.01.2024 Replaced old quants with newer ones that were made with exllamav2-0.0.12, calibrated with default exllamav2 dataset.
 This model uses a lot less memory compared to Llama2-13B models and it *almost* fits 12GB VRAM even at 8bpw.
 But with 8-bit cache it even uses just a tiny bit less than 12GB VRAM.

 license: apache-2.0
 language:
 - en
+base_model:
+- jan-hq/Solar-10.7B-SLERP
 ---
 <!-- header start -->
 <!-- 200823 -->
 [8bpw h8](https://huggingface.co/cgus/Solar-10.7B-SLERP-exl2/tree/8bpw-h8)
 ## Quantization notes
+In my experience, it's one of the best models within 13B range for Slavic languages.
+And overall it has very good language skills. I haven't encountered cases when the model randomly switches language unlike most models I tried.
+It also capable to understand instructions in other languages, a pretty rare feat.
 30.01.2024 Replaced old quants with newer ones that were made with exllamav2-0.0.12, calibrated with default exllamav2 dataset.
 This model uses a lot less memory compared to Llama2-13B models and it *almost* fits 12GB VRAM even at 8bpw.
 But with 8-bit cache it even uses just a tiny bit less than 12GB VRAM.