This model was converted to GGUF format from t-tech/T-lite-it-1.0 using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.

Run with LLaMa-CLI:

.\llama-cli.exe -m .\models\t-lite-it-1.0-q3_k_s.gguf --gpu-layers 50 -p "Write only on Russian" -cnv
Downloads last month
41
GGUF
Model size
7.61B params
Architecture
qwen2

3-bit

4-bit

8-bit

Inference Examples
Unable to determine this model's library. Check the docs .

Model tree for DefaultDF/T-Lite-It-1.0-Quants-GGUF

Quantized
(7)
this model