Download a Model
cortex models pull tinyllama
Start the Engine
cortex engines
Start model
cortex models start tinyllama
Chat
cortex chat
Model Structure
1.1b-chat-v1.0-gguf-Q2_K
1.1b-chat-v1.0-gguf-Q4_K_M
1.1b-chat-v1.0-gguf-Q6_K
1.1b-chat-v1.0-gguf-Q8_0
1.1b-chat-v1.0-tensorrtllm-windows-ada-int4
1.1b-chat-v1.0-tensorrtllm-windows-turing-int4
1.1b-chat-v1.0-tensorrtllm-windows-ampere-int4
1.1b-chat-v1.0-tensorrtllm-windows-ada-fp16
1.1b-chat-v1.0-tensorrtllm-windows-turing-fp16
1.1b-chat-v1.0-tensorrtllm-windows-ampere-fp16
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.