Model Sources

https://huggingface.co/HuggingFaceTB/SmolLM-360M-Instruct

Uses

v v small model for running on edge with :fire: TTFT & Throughput

Direct Use

Use llama.cpp to inference the model

Downloads last month
84
GGUF
Model size
362M params
Architecture
llama
Hardware compatibility
Log In to view the estimation

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support