--- base_model: - NousResearch/Hermes-2-Pro-Llama-3-8B - meta-llama/Meta-Llama-3-8B-Instruct - hfl/llama-3-chinese-8b-instruct-v2 - shenzhi-wang/Llama3-8B-Chinese-Chat library_name: transformers tags: - mergekit - merge --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the passthrough merge method using [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) as a base. ### Models Merged The following models were included in the merge: * [NousResearch/Hermes-2-Pro-Llama-3-8B](https://huggingface.co/NousResearch/Hermes-2-Pro-Llama-3-8B) * [hfl/llama-3-chinese-8b-instruct-v2](https://huggingface.co/hfl/llama-3-chinese-8b-instruct-v2) * [shenzhi-wang/Llama3-8B-Chinese-Chat](https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat) ### Configuration The following YAML configuration was used to produce this model: ```yaml slices: - sources: - model: "shenzhi-wang/Llama3-8B-Chinese-Chat" layer_range: [0, 10] - sources: - model: "hfl/llama-3-chinese-8b-instruct-v2" layer_range: [7, 17] - sources: - model: "NousResearch/Hermes-2-Pro-Llama-3-8B" layer_range: [13, 23] - sources: - model: "NousResearch/Hermes-2-Pro-Llama-3-8B" layer_range: [18, 28] - sources: - model: "NousResearch/Hermes-2-Pro-Llama-3-8B" layer_range: [22, 32] merge_method: passthrough base_model: "meta-llama/Meta-Llama-3-8B-Instruct" dtype: bfloat16 ```