Oleg Dmitriev
qilowoq
AI & ML interests
NLP (mainly in Russian)
Recent Activity
liked
a dataset
3 days ago
ChicagoHAI/CaseSumm
liked
a dataset
11 days ago
google/FACTS-grounding-public
new activity
14 days ago
google/gemma-2-9b-it:Sliding window vs. Global Attention
Organizations
qilowoq's activity
Sliding window vs. Global Attention
6
#41 opened 5 months ago
by
tanliboy
Adding `safetensors` variant of this model
#4 opened about 2 months ago
by
SFconvertbot
Adding `safetensors` variant of this model
#1 opened about 2 months ago
by
SFconvertbot
How can we access the logits from this model output?
5
#3 opened over 1 year ago
by
vishwasprabhub
Methodology questions
2
#2 opened over 1 year ago
by
justinbarton
Different size between tokenizer vocab and embedding
2
#1 opened over 1 year ago
by
demharters
Different size between tokenizer vocab and embedding
2
#1 opened over 1 year ago
by
demharters