vietphuon
/

Llama-3.2-1B-Instruct-alpaca-then-quizgen-16bit-neuron

Model card Files Files and versions Community

Llama-3.2-1B-Instruct-alpaca-then-quizgen-16bit-neuron / README.md

vietphuon's picture

Upload model trained with Unsloth

72aa3fc verified 3 days ago

|

669 Bytes

	---
	base_model:
	- meta-llama/Llama-3.2-1B-Instruct
	- vietphuon/Llama-3.2-1B-Instruct-alpaca-then-quizgen-16bit
	datasets:
	- rockship/quizgen
	language:
	- en
	license: apache-2.0
	tags:
	- unsloth
	---

	# What's new?
	- This is the pre-compiled version of Llama-3.2-1B-Instruct finetuned on our synthetic-hybrid QuizGen dataset for searving as a LLM endpoint in AWS Sagemaker
	- The fine-tuning was performed by Unsloth in quantized 4-bit style then merged to 16-bit (AWS tutorial only show how to compile 16-bit model)
	- Noted that if you want to infer on this model, you need to load the tokenizer from the base model at https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct