Download and Compile LLaMA.cpp

https://github.com/ggerganov/llama.cpp/blob/master/examples/main/README.md#quick-start

Unix-based systems (Linux, macOS, etc.):

Input prompt (One-and-done)

./llama-cli -m LLaMA-O1-Supervised-1129-Q2_K.bin --prompt "Once upon a time"

Conversation mode (Allow for continuous interaction with the model)

./llama-cli -m LLaMA-O1-Supervised-1129-Q2_K.bin -cnv --chat-template gemma

Infinite text from a starting prompt (you can use Ctrl-C to stop it):

./llama-cli -m LLaMA-O1-Supervised-1129-Q2_K.bin --ignore-eos -n -1
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference API
Unable to determine this model's library. Check the docs .