alexgusevski's picture
Upload model
9b5ae4d verified
raw
history blame contribute delete
450 Bytes
model_info:
name: anemll-Llama-3.2-1B-Instruct-ctx4096
version: 0.1.1
description: |
Demonstarates running Llama-3.2-1B-Instruct on Apple Neural Engine
Context length: 4096
Batch size: 64
Chunks: 2
license: MIT
author: Anemll
framework: Core ML
language: Python
parameters:
context_length: 4096
batch_size: 64
lut_embeddings: none
lut_ffn: 4
lut_lmhead: 6
num_chunks: 2
model_prefix: llama