Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

DeepInfra
/
Llama-2-70b-chat-hf-trt-fp8

Model card Files Files and versions
xet
Community
Llama-2-70b-chat-hf-trt-fp8
71.2 GB
  • 2 contributors
History: 14 commits
Pernekhan's picture
Pernekhan
Set legacy to True after initialization
95d9654 almost 2 years ago
  • ensemble
    update models for newer trt 0.6.1 version almost 2 years ago
  • postprocessing
    bugfix almost 2 years ago
  • preprocessing
    Set legacy to True after initialization almost 2 years ago
  • tensorrt_llm
    update models for newer trt 0.6.1 version almost 2 years ago
  • .gitattributes
    1.91 kB
    add trtllm weights almost 2 years ago
  • .gitignore
    6 Bytes
    add smaller files almost 2 years ago
  • README.md
    24 Bytes
    initial commit almost 2 years ago