Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ISTA-DASLab
/
Llama-2-7b-AQLM-2Bit-1x16-hf
like
5
Follow
IST Austria Distributed Algorithms and Systems Lab
52
Text Generation
Transformers
Safetensors
llama
text-generation-inference
Inference Endpoints
aqlm
arxiv:
2401.06118
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
main
Llama-2-7b-AQLM-2Bit-1x16-hf
/
config.json
Commit History
new dispatch
cffb465
BlackSamorez
commited on
Feb 21
try except flash-attn
f48478c
Andrei Panferov
commited on
Feb 6
newer inference
115e749
Andrei Panferov
commited on
Jan 20
new code
dfb8eb3
Andrei Panferov
commited on
Jan 18
inference and autoloading
5c0d7ef
Andrei Panferov
commited on
Jan 18
config
d1f8951
Andrei Panferov
commited on
Jan 18