Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DeepInfra
/
Llama-2-70b-chat-hf-trt-fp8
like
0
Follow
Deep Infra Inc.
12
License:
llama2
Model card
Files
Files and versions
xet
Community
main
Llama-2-70b-chat-hf-trt-fp8
71.2 GB
2 contributors
History:
14 commits
Pernekhan
Set legacy to True after initialization
95d9654
almost 2 years ago
ensemble
update models for newer trt 0.6.1 version
almost 2 years ago
postprocessing
bugfix
almost 2 years ago
preprocessing
Set legacy to True after initialization
almost 2 years ago
tensorrt_llm
update models for newer trt 0.6.1 version
almost 2 years ago
.gitattributes
Safe
1.91 kB
add trtllm weights
almost 2 years ago
.gitignore
Safe
6 Bytes
add smaller files
almost 2 years ago
README.md
Safe
24 Bytes
initial commit
almost 2 years ago