Merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

RP with some o1 inspiration.

Merge Method

This model was merged using the Model Stock merge method using codelion/Llama-3.3-70B-o1 as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: codelion/Llama-3.3-70B-o1
  - model: TheDrummer/Anubis-70B-v1
  - model: TheDrummer/Nautilus-70B-v0.1
base_model: codelion/Llama-3.3-70B-o1
merge_method: model_stock
parameters:
  normalize: true
dtype: bfloat16

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	44.02
IFEval (0-Shot)	76.43
BBH (3-Shot)	56.88
MATH Lvl 5 (4-Shot)	36.33
GPQA (0-shot)	26.17
MuSR (0-shot)	18.96
MMLU-PRO (5-shot)	49.36

Model tree for Triangle104/Set-70b

Evaluation results

strict accuracy on IFEval (0-Shot)

Open LLM Leaderboard

76.430

normalized accuracy on BBH (3-Shot)

Open LLM Leaderboard

56.880

exact match on MATH Lvl 5 (4-Shot)

Open LLM Leaderboard

36.330

acc_norm on GPQA (0-shot)

Open LLM Leaderboard

26.170

acc_norm on MuSR (0-shot)

Open LLM Leaderboard

18.960

accuracy on MMLU-PRO (5-shot)

test set Open LLM Leaderboard

49.360

Triangle104
/

Set-70b

Merge

Merge Details

Merge Method

Models Merged

Configuration

Open LLM Leaderboard Evaluation Results

Model tree for Triangle104/Set-70b

Collections including Triangle104/Set-70b

Llama

RP

Merges

Evaluation results