Not distilled!
1
#5 opened about 4 hours ago
by
supercharge19
Do the distilled models also have 128K context?
1
#4 opened about 11 hours ago
by
Troyanovsky
How was this quantized?
#3 opened about 19 hours ago
by
imq
missing special_tokens_map.json file
#2 opened about 22 hours ago
by
vince62s