Arkhiveus's picture
Update README.md
27728cd verified
|
raw
history blame
1.47 kB
---
base_model:
- Arkhiveus/L3.1-70B-LumineaDare
tags:
- mergekit
- merge
inference: false
---
# Luminea-Dare
![LumineaDare](LumineaDare.webp)
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). Seems to be better than the model stock version, retains [nvidia/Llama-3.1-Nemotron-70B-Instruct-HF](https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF) style formatting, while writing like [NeverSleep/Lumimaid-v0.2-70B](https://huggingface.co/NeverSleep/Lumimaid-v0.2-70B)
## Merge Details
### Merge Method
This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [Arkhiveus/L3.1-70B-Luminea](https://huggingface.co/Arkhiveus/L3.1-70B-Luminea) as a base.
### Models Merged
The following models were included in the merge:
* [NeverSleep/Lumimaid-v0.2-70B](https://huggingface.co/NeverSleep/Lumimaid-v0.2-70B)
* [nvidia/Llama-3.1-Nemotron-70B-Instruct-HF](https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF)
### Configuration
The following YAML configuration was used to produce this model:
```yaml
models:
- model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
parameters:
weight: 1.0
density: 1.0
- model: NeverSleep/Lumimaid-v0.2-70B
parameters:
weight: 1.0
density: 1.0
base_model: Arkhiveus/L3.1-70B-Luminea
merge_method: dare_ties
dtype: float16
name: Luminea-Dare
```