Description

This is an instruction following model (based on Mistral v0.1 Base) optimized for Russian language. It was trained using kolibrify on a multitude of instruction datasets.

The model uses ChatML template. It was trained to be sensitive to the system prompt, experiment with it.

Currently in pre-alpha, later releases will include more details regarding training procedure and data mix.

This model is an improved version of older kolibri-mistral-0427.

Instruction following evals

The model was tested using the following benchmarks:

Eval name Strict Value Loose Value
Avg. 53.81 56.57
ifeval-prompt-level 52.68 56.19
ifeval-instruction-level 62.82 66.18
ru-ifeval-prompt-level 44.36 46.39
ru-ifeval-instruction-level 55.39 57.55
Downloads last month
16
Safetensors
Model size
7.24B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Collection including kaleinaNyan/kolibri-mistral-0427-upd