alexgusevski
/

anemll-Llama-3.2-1B-Instruct-ctx4096_0.1.2

Apple Neural Engine

Model card Files Files and versions Community

anemll-Llama-3.2-1B-Instruct-ctx4096_0.1.2 / meta.yaml

alexgusevski's picture

Upload model

9b5ae4d verified 6 days ago

history blame contribute delete

450 Bytes

	model_info:
	name: anemll-Llama-3.2-1B-Instruct-ctx4096
	version: 0.1.1
	description: \|
	Demonstarates running Llama-3.2-1B-Instruct on Apple Neural Engine
	Context length: 4096
	Batch size: 64
	Chunks: 2
	license: MIT
	author: Anemll
	framework: Core ML
	language: Python
	parameters:
	context_length: 4096
	batch_size: 64
	lut_embeddings: none
	lut_ffn: 4
	lut_lmhead: 6
	num_chunks: 2
	model_prefix: llama