Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Paper • 2501.09686 • Published 3 days ago • 23
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 5 days ago • 105
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention By sirluk • Oct 7, 2024 • 11
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated about 1 month ago • 124