DevQuasar
/

vintage-nextstep_os_systemadmin-ft-phi2

Text Generation

text-generation-inference

Model card Files Files and versions

csabakecskemeti commited on Mar 13, 2024

Commit

95c0f7e

·

verified ·

1 Parent(s): e57cc7c

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -15,7 +15,8 @@ contains less than 100 tokens). The maximum token size for Orca2 is 4096 so a si
 (considering propt instructions) has been used. Chunking did not consider context (text data might split within the context).
 Evaluation set has been generated similar method on 1% of the raw data with LLama2 chat (https://huggingface.co/TheBloke/Llama-2-13B-chat-GGUF).
-Trained locally on 2x3090 GPU with vanila DDP with HuggingFace Accelerate for 50 Epoch.
 Chat with model sample code:

 (considering propt instructions) has been used. Chunking did not consider context (text data might split within the context).
 Evaluation set has been generated similar method on 1% of the raw data with LLama2 chat (https://huggingface.co/TheBloke/Llama-2-13B-chat-GGUF).
+Trained locally on 2x3090 GPU with vanila DDP with HuggingFace Accelerate for 50 Epoch.
+As I wanted to add new knowledge to the base model r=128 and lora_alpha=128 has been used -> LoRA weights are 3.5% of the base model.
 Chat with model sample code: