QuantFactory
/

Qwen2.5-Math-14B-Instruct-GGUF

text-generation-inference

Model card Files Files and versions Community

QuantFactory/Qwen2.5-Math-14B-Instruct-GGUF

This is quantized version of qingy2019/Qwen2.5-Math-14B-Instruct created using llama.cpp

Original Model Card

Uploaded model

Developed by: qingy2019
License: apache-2.0
Finetuned from model : unsloth/qwen2.5-14b-instruct-bnb-4bit

This Qwen 2.5 model was trained 2x faster with Unsloth and Huggingface's TRL library.

I fine-tuned it for 400 steps on garage-bAInd/Open-Platypus with a batch size of 3.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	36.71
IFEval (0-Shot)	60.66
BBH (3-Shot)	47.02
MATH Lvl 5 (4-Shot)	28.47
GPQA (0-shot)	16.33
MuSR (0-shot)	19.63
MMLU-PRO (5-shot)	48.12

Downloads last month: 183

GGUF

Model size

14.8B params

Architecture

qwen2

Hardware compatibility

Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

60.660
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

47.020
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

28.470
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

16.330
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

19.630
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

48.120

View on Papers With Code