Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
14
Follow
AWS Inferentia and Trainium
84
License:
apache-2.0
Model card
Files
Files and versions
Community
335
main
optimum-neuron-cache
/
neuronxcc-2.15.128.0+56dc5a86
/
0_REGISTRY
/
0.0.25
/
inference
/
llama
/
princeton-nlp
Commit History
Synchronizing local compiler cache.
a74c2af
verified
dacorvo
HF staff
commited on
Oct 1, 2024
Synchronizing local compiler cache.
1fe9ce0
verified
dacorvo
HF staff
commited on
Oct 1, 2024