Adding `safetensors` variant of this model
#4 opened 24 days ago
by
SFconvertbot
Adding `safetensors` variant of this model
#3 opened about 1 month ago
by
SFconvertbot
Allow for attention weights to be extracted.
#2 opened about 1 month ago
by
FJFehr
Included gradient checkpointing
#1 opened about 1 month ago
by
FJFehr