vietphuon
/

Llama-3.2-1B-Instruct-alpaca-then-quizgen-16bit-neuron

Model card Files Files and versions Community

vietphuon commited on 6 days ago

Commit

d984b0a

•

1 Parent(s): b276d74

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -10,6 +10,6 @@ base_model:
 ---
 # What's new?
-- This is the pre-compiled version of Llama-3.2-1B-Instruct finetuned on our synthetic-hybrid QuizGen dataset.
 - The fine-tuning was performed by Unsloth in quantized 4-bit style then merged to 16-bit (AWS tutorial only show how to compile 16-bit model)
 - Noted that if you want to infer on this model, you need to load the tokenizer from the base model at https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct

 ---
 # What's new?
+- This is the pre-compiled version of Llama-3.2-1B-Instruct finetuned on our synthetic-hybrid QuizGen dataset for searving as a LLM endpoint in AWS Sagemaker
 - The fine-tuning was performed by Unsloth in quantized 4-bit style then merged to 16-bit (AWS tutorial only show how to compile 16-bit model)
 - Noted that if you want to infer on this model, you need to load the tokenizer from the base model at https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct