Dragan Stoll
dragstoll
·
AI & ML interests
NLP, Prediction
Recent Activity
upvoted
an
article
19 days ago
Open-R1: a fully open reproduction of DeepSeek-R1
liked
a model
7 months ago
unsloth/Mistral-Large-Instruct-2407-bnb-4bit
new activity
9 months ago
mistralai/Mixtral-8x22B-v0.1:Support for quantized cache
Organizations
None yet
dragstoll's activity
Support for quantized cache
#5 opened 9 months ago
by
dragstoll
AutoModelForCausalLM does not seem to work for Mixtral
8
#8 opened about 1 year ago
by
Mauceric
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1676198180316-5e2034f3691aad406a803a22.jpeg)