Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
vhab10
/
llama_3.1_8b_Q4_K_M-gguf
like
0
Text Generation
Transformers
GGUF
English
llama
quantization
cpu
gpu
efficient-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
14df8d5
llama_3.1_8b_Q4_K_M-gguf
1 contributor
History:
4 commits
vhab10
Update README.md
14df8d5
verified
26 days ago
.gitattributes
1.58 kB
Upload llama_3.1_8b_Q4_K_M.gguf with huggingface_hub
about 1 month ago
README.md
1.3 kB
Update README.md
26 days ago
llama_3.1_8b_Q4_K_M.gguf
4.92 GB
LFS
Upload llama_3.1_8b_Q4_K_M.gguf with huggingface_hub
about 1 month ago