Adding `safetensors` variant of this model
#3 opened 9 days ago
by
SFconvertbot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/635fd4cc14657fb8cff2a081/GDkyDwAcuqDBpaOvQgJuq.png)
Adding `safetensors` variant of this model
#2 opened almost 2 years ago
by
SFconvertbot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/635fd4cc14657fb8cff2a081/GDkyDwAcuqDBpaOvQgJuq.png)
Mismatch in attention weights for causal masked tokens vs attention masked tokens
#1 opened about 2 years ago
by
LakshyAAAgrawal