grimjim's picture
Initial release
a832b37
|
raw
history blame
1.2 kB
---
base_model:
- grimjim/HuatuoSkywork-o1-Llama-3.1-8B
- VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct
library_name: transformers
pipeline_tag: text-generation
tags:
- mergekit
- merge
license: llama3.1
---
# SauerHuatuoSkywork-o1-Llama-3.1-8B
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
An experiment to hybridize a relatively high scoring Llama 3.1 8B model with o1 reasoning capabilities.
Made with Llama.
## Merge Details
### Merge Method
This model was merged using the SLERP merge method.
### Models Merged
The following models were included in the merge:
* [grimjim/HuatuoSkywork-o1-Llama-3.1-8B](https://huggingface.co/grimjim/HuatuoSkywork-o1-Llama-3.1-8B)
* [VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct](https://huggingface.co/VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct)
### Configuration
The following YAML configuration was used to produce this model:
```yaml
models:
- model: grimjim/HuatuoSkywork-o1-Llama-3.1-8B
- model: VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct
merge_method: slerp
base_model: grimjim/HuatuoSkywork-o1-Llama-3.1-8B
parameters:
t:
- value: 0.96
dtype: bfloat16
```