Pretraining Using HF Tokenizers and Transformers
#36 opened 1 day ago
by
akhooli
Update README.md
1
#35 opened 2 days ago
by
solankibhargav
Unpadding and Sequence Packing inference example?
2
#34 opened 3 days ago
by
denti
Interview Request: Thoughts on Model Documentation
#33 opened 3 days ago
by
evatang
Training Data?
#32 opened 5 days ago
by
binarymax
What is the position of this model in MTEB leaderboard?
2
#31 opened 5 days ago
by
deepak-banka
tokenizer
1
#24 opened 6 days ago
by
ulasarikaya
RuntimeError: Failed to import transformers.models.modernbert.modeling_modernbert
2
#21 opened 7 days ago
by
SantoshHF
Pretraining data cutoff?
#17 opened 7 days ago
by
ytsaig
How to use ModernBERT with the AutoModelForQuestionAnswering class?
1
#15 opened 9 days ago
by
sraj
Is ModernBERT already fine-tuned for IR tasks?
3
#13 opened 9 days ago
by
belerico
Question about output embedding vector of ModernBERT
#12 opened 9 days ago
by
Youm9602
ModernBert for multi-vector embeddings
#11 opened 9 days ago
by
admarcosai
Inference fails on CPU: `ValueError: Pointer argument (at 0) cannot be accessed from Triton (cpu tensor?)`
6
#10 opened 10 days ago
by
umarbutler
How to use ModernBERT as a sentence transformer?
30
#9 opened 11 days ago
by
hungrybiker
multilingual
2
#8 opened 11 days ago
by
ale-volpe
Is this model meant for full bfloat16, AMP bfloat16 or no bfloat16?
#7 opened 11 days ago
by
umarbutler
# Fine-tuning ModernBERT on a Large Dataset with Masked Language Modelling
1
#6 opened 11 days ago
by
ssmits
Precisions about the config properties wrt the paper
1
#5 opened 12 days ago
by
TomSchelsen
bug: model output logits have detached gradient
#4 opened 12 days ago
by
andersonbcdefg
How to see which version of Transformers library is needed to get access to this model
16
#3 opened 12 days ago
by
aero-artem