Finetune of amd135m using Rchatml format form reasoning-base-20k dataset from KingNish. Trying to see if i can get this small model to reason. Improvements, suggestions welcome. Will upload training script and dataset script soon (yell at me if I dont)

Downloads last month: 37

Safetensors

Model size

134M params

Tensor type

BF16

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for skdrx/amd135m_reasoning_finetune

Base model

amd/AMD-Llama-135m

Quantized

(16)

this model

skdrx
/

amd135m_reasoning_finetune

Model tree for skdrx/amd135m_reasoning_finetune

Datasets used to train skdrx/amd135m_reasoning_finetune