QuantFactory/calme-2.8-qwen2-7b-GGUF
This is quantized version of MaziyarPanahi/calme-2.8-qwen2-7b created using llama.cpp
Original Model Card
MaziyarPanahi/calme-2.8-qwen2-7b
This is a fine-tuned version of the Qwen/Qwen2-7B
model. It aims to improve the base model across all benchmarks.
β‘ Quantized GGUF
All GGUF models are available here: MaziyarPanahi/calme-2.8-qwen2-7b-GGUF
π Open LLM Leaderboard Evaluation Results
coming soon!
Prompt Template
This model uses ChatML
prompt template:
<|im_start|>system
{System}
<|im_end|>
<|im_start|>user
{User}
<|im_end|>
<|im_start|>assistant
{Assistant}
How to use
# Use a pipeline as a high-level helper
from transformers import pipeline
messages = [
{"role": "user", "content": "Who are you?"},
]
pipe = pipeline("text-generation", model="MaziyarPanahi/calme-2.8-qwen2-7b")
pipe(messages)
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("MaziyarPanahi/calme-2.8-qwen2-7b")
model = AutoModelForCausalLM.from_pretrained("MaziyarPanahi/calme-2.8-qwen2-7b")
- Downloads last month
- 333
Model tree for QuantFactory/calme-2.8-qwen2-7b-GGUF
Base model
Qwen/Qwen2-7B