QLoRA Finetune Llama 3 Instruct 8B + OpenHermes 2.5

This model is based on Llama-3-8b, and is governed by META LLAMA 3 COMMUNITY LICENSE AGREEMENT

Llama 3 Instruct 8B 4-bit from unsloth, finetuned with the OpenHermes 2.5 dataset on my home PC on one 24GB 4090.

Special care was taken to preserve and reinforce proper eos token structure.

Chat with llama.cpp

llama.cpp/main -ngl 33 -c 0 --interactive-first --color -e --in-prefix '<|start_header_id|>user<|end_header_id|>\n\n' --in-suffix '<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\n' -r '<|eot_id|>' -m ./llama-3-8b-Instruct-OpenHermes-2.5-QLoRA.Q4_K_M.gguf

Downloads last month: 40

GGUF

Model size

8.03B params

Architecture

llama

4-bit

16-bit

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for yaystevek/llama-3-8b-Instruct-OpenHermes-2.5-QLoRA-GGUF

Base model

unsloth/llama-3-8b-Instruct-bnb-4bit

Finetuned

yaystevek/llama-3-8b-Instruct-OpenHermes-2.5-QLoRA

Quantized

(2)

this model

Dataset used to train yaystevek/llama-3-8b-Instruct-OpenHermes-2.5-QLoRA-GGUF