QuantFactory Banner

QuantFactory/a1-v005-GGUF

This is quantized version of ashercn97/a1-v005 created using llama.cpp

Original Model Card

Uploaded model

  • Developed by: ashercn97
  • License: apache-2.0
  • Finetuned from model : unsloth/qwen2.5-7b-bnb-4bit

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
36
GGUF
Model size
7.62B params
Architecture
qwen2
Hardware compatibility
Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support