Download a Model
cortex models pull tinyllama
Start the Engine
cortex engines
Start model
cortex models start tinyllama
Chat
cortex chat
Model Structure
1.1b-chat-v1.0-gguf-Q2_K
1.1b-chat-v1.0-gguf-Q4_K_M
1.1b-chat-v1.0-gguf-Q6_K
1.1b-chat-v1.0-gguf-Q8_0
1.1b-chat-v1.0-tensorrtllm-windows-ada-int4
1.1b-chat-v1.0-tensorrtllm-windows-turing-int4
1.1b-chat-v1.0-tensorrtllm-windows-ampere-int4
1.1b-chat-v1.0-tensorrtllm-windows-ada-fp16
1.1b-chat-v1.0-tensorrtllm-windows-turing-fp16
1.1b-chat-v1.0-tensorrtllm-windows-ampere-fp16
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
HF Inference deployability: The model has no library tag.