Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic-ent
/
Llama-3.1-8B-Instruct-quantized.w8a8
like
0
Follow
Neural Magic Enterprise
12
Text Generation
Safetensors
8 languages
llama
int8
vllm
conversational
8-bit precision
compressed-tensors
arxiv:
2210.17323
License:
llama3.1
Model card
Files
Files and versions
Community
Train
main
Llama-3.1-8B-Instruct-quantized.w8a8
1 contributor
History:
3 commits
nm-research
Update README.md
801bf03
verified
16 days ago
.gitattributes
1.52 kB
initial commit
16 days ago
README.md
15.6 kB
Update README.md
16 days ago
config.json
2.15 kB
Upload folder using huggingface_hub
16 days ago
generation_config.json
184 Bytes
Upload folder using huggingface_hub
16 days ago
model-00001-of-00002.safetensors
5 GB
LFS
Upload folder using huggingface_hub
16 days ago
model-00002-of-00002.safetensors
4.08 GB
LFS
Upload folder using huggingface_hub
16 days ago
model.safetensors.index.json
43.5 kB
Upload folder using huggingface_hub
16 days ago
recipe.yaml
173 Bytes
Upload folder using huggingface_hub
16 days ago
special_tokens_map.json
325 Bytes
Upload folder using huggingface_hub
16 days ago
tokenizer.json
9.09 MB
Upload folder using huggingface_hub
16 days ago
tokenizer_config.json
55.4 kB
Upload folder using huggingface_hub
16 days ago