Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
compressa-ai
/
Saiga-Llama-3-8B-OmniQuant
like
0
Follow
Compressa
7
Text Generation
Transformers
Safetensors
Russian
llama
saiga
llama3
omniquant
gptq
triton
conversational
text-generation-inference
Inference Endpoints
4-bit precision
License:
llama3
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Saiga-Llama-3-8B-OmniQuant
2 contributors
History:
9 commits
Vasily Alexeev
edit tags, refine table
7e31925
10 months ago
.gitattributes
Safe
1.52 kB
initial commit
10 months ago
README.md
Safe
6.78 kB
edit tags, refine table
10 months ago
compressa-config.json
Safe
663 Bytes
add weights and stuff
10 months ago
config.json
Safe
885 Bytes
add weights and stuff
10 months ago
generation_config.json
Safe
277 Bytes
add weights and stuff
10 months ago
model-00001-of-00002.safetensors
Safe
4.68 GB
LFS
add weights and stuff
10 months ago
model-00002-of-00002.safetensors
Safe
1.05 GB
LFS
add weights and stuff
10 months ago
model.safetensors.index.json
Safe
78.5 kB
add weights and stuff
10 months ago
quant_config.json
Safe
63 Bytes
add weights and stuff
10 months ago
special_tokens_map.json
Safe
563 Bytes
add weights and stuff
10 months ago
tokenizer.json
Safe
9.08 MB
add weights and stuff
10 months ago
tokenizer_config.json
Safe
51.3 kB
add weights and stuff
10 months ago