Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
pankajroark
/
llama-fp16-engine
like
0
Model card
Files
Files and versions
xet
Community
main
llama-fp16-engine
/
7b-sq-int8kv-tp1
7.01 GB
1 contributor
History:
2 commits
pankajroark
inflight batching engine for 7b-sq-int8kv-tp1
2319002
almost 2 years ago
config.json
Safe
1.31 kB
inflight batching engine for 7b-sq-int8kv-tp1
almost 2 years ago
llama_float16_tp1_rank0.engine
Safe
7.01 GB
xet
inflight batching engine for 7b-sq-int8kv-tp1
almost 2 years ago
model.cache
Safe
96.1 kB
inflight batching engine for 7b-sq-int8kv-tp1
almost 2 years ago