miscii-14b-0218
Image source: The Angel’s Message
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the Model Stock merge method using /Users/sthenno/models/tempesthenno-ppo-enchanted as a base.
Models Merged
The following models were included in the merge:
- /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt40
- /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt50
- /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt60
- /Users/sthenno/models/tempesthenno-sft-0218-ckpt60
- /Users/sthenno/models/tempesthenno-sft-0218-ckpt80
Configuration
The following YAML configuration was used to produce this model:
name: tempesthenno-ms-0218
merge_method: model_stock
base_model: /Users/sthenno/models/tempesthenno-ppo-enchanted
tokenizer:
source: base
dtype: float32
out_dtype: bfloat16
parameters:
int8_mask: true
normalize: true
rescale: false
models:
- model: /Users/sthenno/models/tempesthenno-sft-0218-ckpt60
- model: /Users/sthenno/models/tempesthenno-sft-0218-ckpt80
- model: /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt40
- model: /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt50
- model: /Users/sthenno/models/tempesthenno-sft-0218-stage2-ckpt60
Open LLM Leaderboard Evaluation Results
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 42.90 |
IFEval (0-Shot) | 76.56 |
BBH (3-Shot) | 50.64 |
MATH Lvl 5 (4-Shot) | 51.44 |
GPQA (0-shot) | 17.79 |
MuSR (0-shot) | 13.21 |
MMLU-PRO (5-shot) | 47.75 |
- Downloads last month
- 79
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
Model tree for sthenno-com/miscii-14b-0218
Merge model
this model
Evaluation results
- strict accuracy on IFEval (0-Shot)Open LLM Leaderboard76.560
- normalized accuracy on BBH (3-Shot)Open LLM Leaderboard50.640
- exact match on MATH Lvl 5 (4-Shot)Open LLM Leaderboard51.440
- acc_norm on GPQA (0-shot)Open LLM Leaderboard17.790
- acc_norm on MuSR (0-shot)Open LLM Leaderboard13.210
- accuracy on MMLU-PRO (5-shot)test set Open LLM Leaderboard47.750