Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
/
Qwen2-1.5B-Instruct-quantized.w4a16
like
0
Follow
Neural Magic
260
Text Generation
Transformers
Safetensors
English
qwen2
conversational
text-generation-inference
Inference Endpoints
4-bit precision
gptq
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
4debe3d
Qwen2-1.5B-Instruct-quantized.w4a16
/
mmlu-vllm
2 contributors
History:
1 commit
abhinavnmagic
Upload folder using huggingface_hub
4debe3d
verified
6 months ago
__cache__abhinav__models__Phase1__gptq-Qwen__Qwen2-1.5B-Instruct-garage-bAInd__Open-Platypus-mse-damp0.1-ns512-seqlen4K
Upload folder using huggingface_hub
6 months ago