--- base_model: - nvidia/Llama-3.1-Nemotron-70B-Instruct-HF library_name: transformers tags: - mergekit - merge license: apache-2.0 --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the SLERP merge method. ### Models Merged The following models were included in the merge: * [nvidia/Llama-3.1-Nemotron-70B-Instruct-HF](https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF) ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF - model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF merge_method: slerp base_model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF dtype: float32 parameters: t: [0, 0.5, 1, 0.5, 0] # V shaped curve: Hermes for input & output, WizardMath in the middle layers ```