Code
Collection
Coding Models
•
170 items
•
Updated
This is a merge of pre-trained language models created using mergekit.
Qwen Coder tied to Reasoning LORA.
This model was merged using the Passthrough merge method using unsloth/Qwen2.5-Coder-3B-Instruct + bunnycore/Qwen-2.5-3b-R1-lora_model-v.1 as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
base_model: unsloth/Qwen2.5-Coder-3B-Instruct+bunnycore/Qwen-2.5-3b-R1-lora_model-v.1
dtype: bfloat16
merge_method: passthrough
models:
- model: unsloth/Qwen2.5-Coder-3B-Instruct+bunnycore/Qwen-2.5-3b-R1-lora_model-v.1
tokenizer_source: unsloth/Qwen2.5-Coder-3B
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 19.81 |
IFEval (0-Shot) | 35.88 |
BBH (3-Shot) | 25.31 |
MATH Lvl 5 (4-Shot) | 16.39 |
GPQA (0-shot) | 7.16 |
MuSR (0-shot) | 12.14 |
MMLU-PRO (5-shot) | 21.99 |