asiansoul's picture
Update README.md
6a79ddc verified
|
raw
history blame
1.17 kB
metadata
license: other
base_model:
  - beomi/Llama-3-Open-Ko-8B-Instruct-preview
  - beomi/Llama-3-Open-Ko-8B
library_name: transformers
tags:
  - mergekit
  - merge

πŸ‘‘ Llama-3-Open-Ko-Linear-8B

This is a merge of pre-trained language models created using mergekit.

🏝️ Merge Details

πŸ‡°πŸ‡· Merge Method

This model was merged using the task arithmetic merge method using beomi/Llama-3-Open-Ko-8B as a base.

πŸ‡°πŸ‡· Models Merged

The following models were included in the merge:

πŸ’Ύ Configuration

The following YAML configuration was used to produce this model:

models:
  - layer_range: [0, 31]
    model: beomi/Llama-3-Open-Ko-8B
    parameters:
      weight: 0.2
  - layer_range: [0, 31]
    model: beomi/Llama-3-Open-Ko-8B-Instruct-preview
    parameters:
      weight: 0.8
merge_method: task_arithmetic
base_model: beomi/Llama-3-Open-Ko-8B
dtype: bfloat16
random_seed: 0

πŸ’» Usage