Adding `safetensors` variant of this model
#4 opened 27 days ago
by
SFconvertbot
Adding `safetensors` variant of this model
#3 opened about 1 month ago
by
SFconvertbot
Allow for attention weights to be extracted.
#2 opened about 2 months ago
by
FJFehr
Included gradient checkpointing
#1 opened about 2 months ago
by
FJFehr