Edit model card
Neurotic crown clown tak stack logo

neurotic-crown-clown-7b-tak-stack-dpo

neurotic-crown-clown-7b-tak-stack-dpo is a DPO fine-tuned version of CorticalStack/neurotic-crown-clown-7b-ties using the CorticalStack/tak-stack-dpo dataset.

LoRA

  • r: 32
  • LoRA alpha: 32
  • LoRA dropout: 0.05

Training arguments

  • Batch size: 4
  • Gradient accumulation steps: 4
  • Optimizer: paged_adamw_32bit
  • Max steps: 100
  • Learning rate: 5e-05
  • Learning rate scheduler type: cosine
  • Beta: 0.1
  • Max prompt length: 1024
  • Max length: 1536
Downloads last month
65
Safetensors
Model size
7.24B params
Tensor type
FP16
Β·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for CorticalStack/neurotic-crown-clown-7b-tak-stack-dpo

Finetuned
(1)
this model

Spaces using CorticalStack/neurotic-crown-clown-7b-tak-stack-dpo 4