license: mit | |
language: | |
- en | |
tags: | |
- text-generation-inference | |
- text | |
## TinyLLama TensorRT LLM Edition. | |
This repo contains the TensorRT LLM version of TinyLlama Model. The conversion is done to support Float16 precision on Nvidia TensorRT. |
license: mit | |
language: | |
- en | |
tags: | |
- text-generation-inference | |
- text | |
## TinyLLama TensorRT LLM Edition. | |
This repo contains the TensorRT LLM version of TinyLlama Model. The conversion is done to support Float16 precision on Nvidia TensorRT. |