s3nh/phi-2_dolly_instruction_polish-GGUF

Quantized GGUF model files for phi-2_dolly_instruction_polish from s3nh

Name	Quant method	Size
phi-2_dolly_instruction_polish.fp16.gguf	fp16	5.56 GB
phi-2_dolly_instruction_polish.q2_k.gguf	q2_k	1.17 GB
phi-2_dolly_instruction_polish.q3_k_m.gguf	q3_k_m	1.48 GB
phi-2_dolly_instruction_polish.q4_k_m.gguf	q4_k_m	1.79 GB
phi-2_dolly_instruction_polish.q5_k_m.gguf	q5_k_m	2.07 GB
phi-2_dolly_instruction_polish.q6_k.gguf	q6_k	2.29 GB
phi-2_dolly_instruction_polish.q8_0.gguf	q8_0	2.96 GB

Original Model Card:

This model is a fine-tuned version of microsoft/phi-2 on the None dataset. It achieves the following results on the evaluation set:

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss
No log	0.0	1	1.7973
1.9767	0.25	5290	1.4832
1.8474	0.5	10580	1.4356
1.8121	0.75	15870	1.4022
1.8333	1.0	21160	1.3678
1.6601	1.25	26450	1.3508
1.5452	1.5	31740	1.3357
1.7381	1.75	37030	1.3191
1.6256	2.0	42320	1.3090
1.5521	2.25	47610	1.2961
1.8318	2.5	52900	1.2910
1.6761	2.75	58190	1.2901
1.6312	3.0	63480	1.2879
1.7003	3.25	68770	1.2820
1.6915	3.5	74060	1.2814
1.5757	3.75	79350	1.2813

The following bitsandbytes quantization config was used during training: