KaraKaraWitch's picture
Update README.md
3162216 verified
metadata
license: other
license_name: qwen
license_link: https://huggingface.co/Qwen/Qwen2.5-72B/blob/main/LICENSE
base_model:
  - rombodawg/Rombos-LLM-V2.5-Qwen-72b
  - abacusai/Dracarys2-72B-Instruct
  - EVA-UNIT-01/EVA-Qwen2.5-72B-v0.0
  - ZeusLabs/Chronos-Platinum-72B
  - Qwen/Qwen2.5-72B
  - anthracite-org/magnum-v4-72b
  - m8than/banana-2-b-72b
language:
  - en
pipeline_tag: text-generation
library_name: transformers
tags:
  - mergekit
  - merge

LLENN-v0.69420-Qwen2.5-72b

image/png

Model stock merge for fun. Probably final model mix.
This merge is an answer to people's requests. I really don't wanna do more merges without myself considering to use it.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: EVA-UNIT-01/EVA-Qwen2.5-72B-v0.0
  - model: ZeusLabs/Chronos-Platinum-72B
  - model: anthracite-org/magnum-v4-72b
  - model: abacusai/Dracarys2-72B-Instruct
  - model: rombodawg/Rombos-LLM-V2.5-Qwen-72b
  - model: m8than/banana-2-b-72b

merge_method: model_stock
base_model: Qwen/Qwen2.5-72B
parameters:
  normalize: true
dtype: bfloat16

Prompt Format

ChatML works for the most part.

Sampler Settings

Personally I use the following:

Temp: 1.2
Min P: 0.07
Rep Pen: 1.1

Others have suggested the following:

Temp: 1.1
Top P: 0.98
Min P: 0.05