This model was converted to GGUF format from t-tech/T-lite-it-1.0 using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.

Run with LLaMa-CLI:

.\llama-cli.exe -m .\models\t-lite-it-1.0-q3_k_s.gguf --gpu-layers 50 -p "Write only on Russian" -cnv

GGUF

Model size

7.61B params

Architecture

qwen2

3-bit

4-bit

8-bit

Inference Examples

Unable to determine this model's library. Check the docs .

Model tree for DefaultDF/T-Lite-It-1.0-Quants-GGUF

Base model

Quantized

(7)

this model