Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ChenMnZ
's Collections
EfficientQAT
EfficientQAT(w/o E2E-FT)
EfficientQAT(GPTQ format)
EfficientQAT (BitBLAS format)
EfficientQAT(GPTQ format)
updated
Aug 6, 2024
EfficientQAT quantized models with GPTQ data format.
Upvote
-
ChenMnZ/Llama-3-70b-EfficientQAT-w4g128-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
8
ChenMnZ/Llama-3-8b-EfficientQAT-w2g128-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
10
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
12
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
7
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
18
•
1
ChenMnZ/Llama-3-8b-EfficientQAT-w4g128-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
17
•
1
ChenMnZ/Llama-3-8b-EfficientQAT-w2g64-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
10
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w4g128-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
6
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g64-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
9
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g128-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
12
ChenMnZ/Llama-3-70b-EfficientQAT-w2g64-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
9
ChenMnZ/Llama-3-70b-EfficientQAT-w2g128-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
9
ChenMnZ/Llama-2-7b-EfficientQAT-w4g128-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
9
ChenMnZ/Llama-2-7b-EfficientQAT-w2g64-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
20
ChenMnZ/Llama-2-7b-EfficientQAT-w2g128-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
13
ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
7
ChenMnZ/Llama-2-70b-EfficientQAT-w2g64-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
8
ChenMnZ/Llama-2-70b-EfficientQAT-w2g128-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
8
ChenMnZ/Llama-2-13b-EfficientQAT-w4g128-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
9
ChenMnZ/Llama-2-13b-EfficientQAT-w2g128-GPTQ
Text Generation
•
Updated
Jul 22, 2024
•
7
ChenMnZ/Mistral-Large-Instruct-2407-EfficientQAT-w2g64-GPTQ
Updated
Aug 6, 2024
•
4
•
25
Upvote
-
Share collection
View history
Collection guide
Browse collections