Merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Qwen Coder tied to Reasoning LORA.

Merge Method

This model was merged using the Passthrough merge method using unsloth/Qwen2.5-Coder-3B-Instruct + bunnycore/Qwen-2.5-3b-R1-lora_model-v.1 as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

base_model: unsloth/Qwen2.5-Coder-3B-Instruct+bunnycore/Qwen-2.5-3b-R1-lora_model-v.1
dtype: bfloat16
merge_method: passthrough
models:
  - model: unsloth/Qwen2.5-Coder-3B-Instruct+bunnycore/Qwen-2.5-3b-R1-lora_model-v.1
tokenizer_source: unsloth/Qwen2.5-Coder-3B

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 19.81
IFEval (0-Shot) 35.88
BBH (3-Shot) 25.31
MATH Lvl 5 (4-Shot) 16.39
GPQA (0-shot) 7.16
MuSR (0-shot) 12.14
MMLU-PRO (5-shot) 21.99
Downloads last month
20
Safetensors
Model size
3.09B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for Triangle104/Q2.5-CodeR1-3B

Collections including Triangle104/Q2.5-CodeR1-3B

Evaluation results