--- license: mit language: - en tags: - text-generation-inference - text --- ## TinyLLama TensorRT LLM Edition. This repo contains the TensorRT LLM version of TinyLlama Model. The conversion is done to support Float16 precision on Nvidia TensorRT.