The official prequantized EfficientQAT models.
Mengzhao Chen
ChenMnZ
AI & ML interests
model compression
Recent Activity
liked
a model
about 2 months ago
nvidia/DeepSeek-R1-FP4
upvoted
a
paper
2 months ago
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive
Survey
upvoted
a
paper
3 months ago
MangaNinja: Line Art Colorization with Precise Reference Following
Organizations
None yet
Collections
4
models
129
ChenMnZ/Mistral-Large-Instruct-2407-EfficientQAT-w2g64-GPTQ
Updated
•
1
•
25
ChenMnZ/Llama-3-70b-EfficientQAT-w4g128-BitBLAS
Text Generation
•
Updated
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g128-BitBLAS
Text Generation
•
Updated
•
1
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128-BitBLAS
Text Generation
•
Updated
•
1
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64-BitBLAS
Text Generation
•
Updated
•
1
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128-BitBLAS
Text Generation
•
Updated
•
2
ChenMnZ/Llama-3-8b-EfficientQAT-w4g128-BitBLAS
Text Generation
•
Updated
•
1
ChenMnZ/Llama-3-8b-EfficientQAT-w2g64-BitBLAS
Text Generation
•
Updated
•
1
ChenMnZ/Llama-3-8b-EfficientQAT-w2g128-BitBLAS
Text Generation
•
Updated
•
1
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w4g128-BitBLAS
Text Generation
•
Updated
•
1
datasets
None public yet