Tofu_3B / README.md
jeiku's picture
Upload 9 files
1039fb0 verified
metadata
base_model:
  - jeiku/Rosa_v1_3B
  - jeiku/Theory_of_Mind_128_StableLM
  - jeiku/Rosa_v1_3B
  - jeiku/Rosa_v1_3B
  - jeiku/PIPPA_128_StableLM
  - jeiku/Rosa_v1_3B
  - jeiku/LimaRP_StableLM
  - jeiku/Rosa_v1_3B
  - jeiku/Theory_of_Mind_RP_128_StableLM
  - jeiku/Rosa_v1_3B
  - jeiku/No_Robots_Alpaca_StableLM
  - jeiku/Rosa_v1_3B
  - jeiku/Alpaca_128_StableLM
  - jeiku/Rosa_v1_3B
  - jeiku/Everything_v3_128_StableLM
  - jeiku/Rosa_v1_3B
  - jeiku/RPGPT_StableLM
  - jeiku/Rosa_v1_3B
  - jeiku/Toxic_DPO_StableLM
  - jeiku/Rosa_v1_3B
  - jeiku/Gnosis_256_StableLM
  - jeiku/Rosa_v1_3B
  - jeiku/Bluemoon_cleaned_StableLM
tags:
  - mergekit
  - merge

bigmix

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the task arithmetic merge method using jeiku/Rosa_v1_3B as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

merge_method: task_arithmetic
base_model: jeiku/Rosa_v1_3B
parameters:
  normalize: true
models:
  - model: jeiku/Rosa_v1_3B+jeiku/No_Robots_Alpaca_StableLM
    parameters:
      weight: 0.5
  - model: jeiku/Rosa_v1_3B+jeiku/Toxic_DPO_StableLM
    parameters:
      weight: 0.5
  - model: jeiku/Rosa_v1_3B+jeiku/Alpaca_128_StableLM
    parameters:
      weight: 0.4
  - model: jeiku/Rosa_v1_3B+jeiku/Everything_v3_128_StableLM
    parameters:
      weight: 0.4
  - model: jeiku/Rosa_v1_3B+jeiku/Gnosis_256_StableLM
    parameters:
      weight: 1
  - model: jeiku/Rosa_v1_3B+jeiku/Theory_of_Mind_128_StableLM
    parameters:
      weight: 0.8
  - model: jeiku/Rosa_v1_3B+jeiku/PIPPA_128_StableLM
    parameters:
      weight: 0.4
  - model: jeiku/Rosa_v1_3B+jeiku/LimaRP_StableLM
    parameters:
      weight: 0.7
  - model: jeiku/Rosa_v1_3B+jeiku/Theory_of_Mind_RP_128_StableLM
    parameters:
      weight: 0.6
  - model: jeiku/Rosa_v1_3B+jeiku/Bluemoon_cleaned_StableLM
    parameters:
      weight: 0.8
  - model: jeiku/Rosa_v1_3B+jeiku/RPGPT_StableLM
    parameters:
      weight: 0.4
dtype: float16