neuralmagic
/

Qwen2-1.5B-Instruct-quantized.w4a16

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

Qwen2-1.5B-Instruct-quantized.w4a16 / mmlu-vllm

2 contributors

History: 1 commit

abhinavnmagic's picture

Upload folder using huggingface_hub

4debe3d verified 6 months ago

__cache__abhinav__models__Phase1__gptq-Qwen__Qwen2-1.5B-Instruct-garage-bAInd__Open-Platypus-mse-damp0.1-ns512-seqlen4K
Upload folder using huggingface_hub 6 months ago