halfmoon
halfmoon039
AI & ML interests
None yet
Organizations
None yet
halfmoon039's activity
What is the max sequence length that model can compute if I use flash attention?
1
#20 opened 9 months ago
by
halfmoon039