Not distilled!
3
#5 opened about 15 hours ago
by
supercharge19
Do the distilled models also have 128K context?
1
#4 opened about 22 hours ago
by
Troyanovsky
How was this quantized?
#3 opened 1 day ago
by
imq
missing special_tokens_map.json file
#2 opened 1 day ago
by
vince62s