Adding `safetensors` variant of this model
#3 opened about 2 months ago
by
SFconvertbot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/635fd4cc14657fb8cff2a081/GDkyDwAcuqDBpaOvQgJuq.png)
Cannot set sequence length higher than 2048 & doesn't support the optimized triton implementation of FlashAttention
#2 opened over 1 year ago
by
t83714
Would it work well with sequence length > 2048?
2
#1 opened over 1 year ago
by
SamuelAzran