chrisociepa
commited on
Commit
•
24a5746
1
Parent(s):
9967ed9
Update README.md
Browse files
README.md
CHANGED
@@ -24,3 +24,7 @@ The training took 20 days on a single RTX 4090 with the following hyperparameter
|
|
24 |
* optimizer: paged_adamw_32bit
|
25 |
|
26 |
This adapter allows the model to speak Polish more accurately than vanilla [Llama-2-7b](https://huggingface.co/meta-llama/Llama-2-7b-hf).
|
|
|
|
|
|
|
|
|
|
24 |
* optimizer: paged_adamw_32bit
|
25 |
|
26 |
This adapter allows the model to speak Polish more accurately than vanilla [Llama-2-7b](https://huggingface.co/meta-llama/Llama-2-7b-hf).
|
27 |
+
|
28 |
+
<p align="center">
|
29 |
+
<img src="https://huggingface.co/Azurro/llama-2-7b-qlora-polish/raw/main/llama-2-7b-qlora-pl.jpg">
|
30 |
+
</p>
|