Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

compressa-ai
/

Saiga-Llama-3-8B-OmniQuant

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

Saiga-Llama-3-8B-OmniQuant

2 contributors

History: 9 commits

Vasily Alexeev

edit tags, refine table

7e31925 10 months ago

.gitattributes

1.52 kB

initial commit 10 months ago
README.md

6.78 kB

edit tags, refine table 10 months ago
compressa-config.json

663 Bytes

add weights and stuff 10 months ago
config.json

885 Bytes

add weights and stuff 10 months ago
generation_config.json

277 Bytes

add weights and stuff 10 months ago
model-00001-of-00002.safetensors

4.68 GB
LFS

add weights and stuff 10 months ago
model-00002-of-00002.safetensors

1.05 GB
LFS

add weights and stuff 10 months ago
model.safetensors.index.json

78.5 kB

add weights and stuff 10 months ago
quant_config.json

63 Bytes

add weights and stuff 10 months ago
special_tokens_map.json

563 Bytes

add weights and stuff 10 months ago
tokenizer.json

9.08 MB

add weights and stuff 10 months ago
tokenizer_config.json

51.3 kB

add weights and stuff 10 months ago