Spaces:

Victorano
/

Ecommerce-Customer-Support-Chatbot

Running

set batch_size to 64

18b0674 verified 3 months ago

277 Bytes

	from llama_cpp import Llama
	import os

	model_path = "llama-3.2-1B-it-Ecommerce-ChatBot-merged-F16.gguf"
	n_threads = os.cpu_count()

	llm = Llama(
	model_path=model_path,
	n_ctx=512,
	n_batch=64,
	n_threads=n_threads,
	n_gpu_layers=-1,
	chat_format="llama-3"
	)