image Mistral Nemo 12B Lyra v2 - Stage 1

This is Stage 1 of Lyra v2. Only SFT and 2-Step Merge has been performed at this stage.

This is still raw, there may be issues.

Kinda wordy and verbose like Claude is, but that is fixable in stage 2, hopefully.

Stage 2 Plans - RL currently TBD


- [/INST] may leak, let's say that is an unintended side-effect. Just simply add it to your custom stopping strings. I tried adding it as an EOS in model config but that ruined the model somehow, so I didn't.

- min_p value of 0.1 is a sane value to use. Temp? 0.7-1.2, up to you.

- Prompt Format: ChatML

Have a day.

Downloads last month
28
Safetensors
Model size
12.2B params
Tensor type
BF16
Β·
Inference API
Unable to determine this model's library. Check the docs .

Model tree for Sao10K/MN-12B-Lyra-v2a1

Merges
2 models
Quantizations
6 models

Spaces using Sao10K/MN-12B-Lyra-v2a1 4