Harnessing Optimization Dynamics for Curvature-Informed Model Merging
Paper
•
2509.11167
•
Published
•
1
This is a fine-tuned Llama-3.1-8B model specialized for general instruction following tasks. This checkpoint was released alongside https://arxiv.org/abs/2509.11167.
This repository includes export files for state averaging and other advanced techniques.
Base model
meta-llama/Llama-3.1-8B