Llama-3.1-8B-Squareroot

This is a TIES merge that combines the performance of the following models:

image/png

Disclaimer: This one's a failed attempt. Working on a better version, so check back soon!

Benchmarks

The model ranks in the top 5 for MATH benchmarks but performs severely badly on others (which isn't quite what I was expecting). I’m hoping to improve its general abilities without losing its math skills. Qwen still has the top spot :(

image/png

Downloads last month
29
Safetensors
Model size
8.03B params
Tensor type
FP16
·
Inference API
Unable to determine this model's library. Check the docs .

Model tree for 3rd-Degree-Burn/Llama-3.1-8B-Squareroot-v0

Collection including 3rd-Degree-Burn/Llama-3.1-8B-Squareroot-v0