view article Article 🐺🐦⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram • 1 day ago • 24
GLiNER Collection Knowledgator GLiNER models for information extraction • 8 items • Updated 25 days ago • 9
mrm8488/ModernBERT-base-ft-fineweb-edu-annotations Text Classification • Updated 10 days ago • 364 • 10
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 • 5 days ago • 17
YuLan-Mini Collection A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 5 items • Updated 6 days ago • 10
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated 17 days ago • 45