Merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

image/webp

RP with some o1 inspiration.

Merge Method

This model was merged using the Model Stock merge method using codelion/Llama-3.3-70B-o1 as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: codelion/Llama-3.3-70B-o1
  - model: TheDrummer/Anubis-70B-v1
  - model: TheDrummer/Nautilus-70B-v0.1
base_model: codelion/Llama-3.3-70B-o1
merge_method: model_stock
parameters:
  normalize: true
dtype: bfloat16

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 44.02
IFEval (0-Shot) 76.43
BBH (3-Shot) 56.88
MATH Lvl 5 (4-Shot) 36.33
GPQA (0-shot) 26.17
MuSR (0-shot) 18.96
MMLU-PRO (5-shot) 49.36
Downloads last month
38
Safetensors
Model size
70.6B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for Triangle104/Set-70b

Collections including Triangle104/Set-70b

Evaluation results