Adding `safetensors` variant of this model
#4 opened about 2 months ago
by
SFconvertbot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/635fd4cc14657fb8cff2a081/GDkyDwAcuqDBpaOvQgJuq.png)
Adding `safetensors` variant of this model
#3 opened 2 months ago
by
SFconvertbot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/635fd4cc14657fb8cff2a081/GDkyDwAcuqDBpaOvQgJuq.png)
Allow for attention weights to be extracted.
#2 opened 2 months ago
by
FJFehr
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/DHgiwJzcvDh9FVOWOmzKS.jpeg)
Included gradient checkpointing
#1 opened 2 months ago
by
FJFehr
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/DHgiwJzcvDh9FVOWOmzKS.jpeg)