Update README.md
Browse files
README.md
CHANGED
@@ -10,6 +10,6 @@ base_model:
|
|
10 |
---
|
11 |
|
12 |
# What's new?
|
13 |
-
- This is the pre-compiled version of Llama-3.2-1B-Instruct finetuned on our synthetic-hybrid QuizGen dataset
|
14 |
- The fine-tuning was performed by Unsloth in quantized 4-bit style then merged to 16-bit (AWS tutorial only show how to compile 16-bit model)
|
15 |
- Noted that if you want to infer on this model, you need to load the tokenizer from the base model at https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct
|
|
|
10 |
---
|
11 |
|
12 |
# What's new?
|
13 |
+
- This is the pre-compiled version of Llama-3.2-1B-Instruct finetuned on our synthetic-hybrid QuizGen dataset for searving as a LLM endpoint in AWS Sagemaker
|
14 |
- The fine-tuning was performed by Unsloth in quantized 4-bit style then merged to 16-bit (AWS tutorial only show how to compile 16-bit model)
|
15 |
- Noted that if you want to infer on this model, you need to load the tokenizer from the base model at https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct
|