metadata

license: apache-2.0
library_name: transformers
tags:
  - mergekit
  - merge
  - not-for-all-audiences
model-index:
  - name: NameLess-12B-prob
    results:
      - task:
          type: text-generation
          name: Text Generation
        dataset:
          name: IFEval (0-Shot)
          type: HuggingFaceH4/ifeval
          args:
            num_few_shot: 0
        metrics:
          - type: inst_level_strict_acc and prompt_level_strict_acc
            value: 66.02
            name: strict accuracy
        source:
          url: >-
            https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=bamec66557/NameLess-12B-prob
          name: Open LLM Leaderboard
      - task:
          type: text-generation
          name: Text Generation
        dataset:
          name: BBH (3-Shot)
          type: BBH
          args:
            num_few_shot: 3
        metrics:
          - type: acc_norm
            value: 31.36
            name: normalized accuracy
        source:
          url: >-
            https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=bamec66557/NameLess-12B-prob
          name: Open LLM Leaderboard
      - task:
          type: text-generation
          name: Text Generation
        dataset:
          name: MATH Lvl 5 (4-Shot)
          type: hendrycks/competition_math
          args:
            num_few_shot: 4
        metrics:
          - type: exact_match
            value: 11.1
            name: exact match
        source:
          url: >-
            https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=bamec66557/NameLess-12B-prob
          name: Open LLM Leaderboard
      - task:
          type: text-generation
          name: Text Generation
        dataset:
          name: GPQA (0-shot)
          type: Idavidrein/gpqa
          args:
            num_few_shot: 0
        metrics:
          - type: acc_norm
            value: 8.61
            name: acc_norm
        source:
          url: >-
            https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=bamec66557/NameLess-12B-prob
          name: Open LLM Leaderboard
      - task:
          type: text-generation
          name: Text Generation
        dataset:
          name: MuSR (0-shot)
          type: TAUR-Lab/MuSR
          args:
            num_few_shot: 0
        metrics:
          - type: acc_norm
            value: 14.7
            name: acc_norm
        source:
          url: >-
            https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=bamec66557/NameLess-12B-prob
          name: Open LLM Leaderboard
      - task:
          type: text-generation
          name: Text Generation
        dataset:
          name: MMLU-PRO (5-shot)
          type: TIGER-Lab/MMLU-Pro
          config: main
          split: test
          args:
            num_few_shot: 5
        metrics:
          - type: acc
            value: 29.83
            name: accuracy
        source:
          url: >-
            https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=bamec66557/NameLess-12B-prob
          name: Open LLM Leaderboard
datasets:
  - open-llm-leaderboard/bamec66557__NameLess-12B-prob-details

Z-2-A.TEST-TEMP-MODEL

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the SLERP merge method.

Models Merged

The following models were included in the merge:

D:\VICIOUS_MESH-12B-OMEGA
D:\jetreessence

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: "D:\\VICIOUS_MESH-12B-OMEGA"
  - model: "D:\\jetreessence"
merge_method: slerp
base_model: "D:\\VICIOUS_MESH-12B-OMEGA"
dtype: bfloat16
parameters:
  t: [0, 0.5, 1, 0.5, 0]

regularization:
  - method: gradient_penalty
    scale: 0.05
  - method: weight_clipping
    clip_range: [-0.15, 0.15]
  - method: random_noise
    scale: 0.01
  - method: attention_dropout
    scale: 0.02

postprocessing:
  - operation: entropy_regularization
    scale: 0.05
  - operation: non_linear_scaling
    parameters:
      function: relu
  - operation: sharpening
    intensity: 0.6
  - operation: gaussian_smoothing
    sigma: 0.3
  - operation: normalize
  - operation: dynamic_scaling
    scale_range: [0.98, 1.02]
  - operation: smoothing
    parameters:
      adaptive: true
      range: [0.98, 1.02]
      kernel_size: 3

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	26.94
IFEval (0-Shot)	66.02
BBH (3-Shot)	31.36
MATH Lvl 5 (4-Shot)	11.10
GPQA (0-shot)	8.61
MuSR (0-shot)	14.70
MMLU-PRO (5-shot)	29.83