Description

This repo contains GGUF files for the original model.

Files

Original model description

This model is primarily focused on chat, roleplay, storywriting, with good reasoning and logic.

Chronos can generate very long outputs with coherent text, largely due to the human inputs it was trained on, and it supports context length up to 4096 tokens

Up to 16384 with RoPE with solid coherency.

This model uses Alpaca formatting, so for optimal model performance, use it to start the dialogue or story, and if you use a frontend like SillyTavern ENABLE instruction mode:

### Instruction:
{Your instruction or question here.}

### Response:

Not using the format will make the model perform significantly worse than intended unless it is merged.

Downloads last month
4
GGUF
Model size
7.24B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Inference API (serverless) has been turned off for this model.

Model tree for RikudouSage/Chronos-Mistral-7B-GGUF

Quantized
(3)
this model