csabakecskemeti
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -16,7 +16,7 @@ contains less than 100 tokens). The maximum token size for Orca2 is 4096 so a si
|
|
16 |
Evaluation set has been generated similar method on 1% of the raw data with LLama2 chat (https://huggingface.co/TheBloke/Llama-2-13B-chat-GGUF).
|
17 |
|
18 |
Trained locally on 2x3090 GPU with vanila DDP with HuggingFace Accelerate for 50 Epoch.
|
19 |
-
As I wanted to add new knowledge to the base model r=128 and lora_alpha=128 has been used -> LoRA weights
|
20 |
|
21 |
|
22 |
Chat with model sample code:
|
|
|
16 |
Evaluation set has been generated similar method on 1% of the raw data with LLama2 chat (https://huggingface.co/TheBloke/Llama-2-13B-chat-GGUF).
|
17 |
|
18 |
Trained locally on 2x3090 GPU with vanila DDP with HuggingFace Accelerate for 50 Epoch.
|
19 |
+
As I wanted to add new knowledge to the base model r=128 and lora_alpha=128 has been used -> LoRA weights were 3.5% of the base model.
|
20 |
|
21 |
|
22 |
Chat with model sample code:
|