auryn_dpo_orpo_english

This is a ORPO fine-tune of meta-llama/Llama-3.2-1b trained on three epochs of https://huggingface.co/datasets/celsowm/auryn_dpo_orpo_english

Auryn is a fictional place intended to serve as a proof of concept for injecting knowledge into a large language model using ORPO.

Safetensors

Model size

1.24B params

Tensor type

BF16

Inference API

Unable to determine this model's library. Check the docs .

Model tree for celsowm/auryn_dpo_orpo_english

Base model

Finetuned

(184)

this model