Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
vhab10
/
llama_3.1_8b_Q4_K_M-gguf
like
0
Text Generation
Transformers
GGUF
English
llama
quantization
cpu
gpu
efficient-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
a062dbe
llama_3.1_8b_Q4_K_M-gguf
1 contributor
History:
3 commits
vhab10
Create README.md
a062dbe
verified
about 1 month ago
.gitattributes
1.58 kB
Upload llama_3.1_8b_Q4_K_M.gguf with huggingface_hub
about 1 month ago
README.md
1.27 kB
Create README.md
about 1 month ago
llama_3.1_8b_Q4_K_M.gguf
4.92 GB
LFS
Upload llama_3.1_8b_Q4_K_M.gguf with huggingface_hub
about 1 month ago