view article Article 🐺🐦⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram • 1 day ago • 24
GLiNER Collection Knowledgator GLiNER models for information extraction • 8 items • Updated 25 days ago • 9
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 • 5 days ago • 17
YuLan-Mini Collection A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 5 items • Updated 6 days ago • 10
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated 17 days ago • 45
Smol but mighty Collection A collection of smoll but mighty models • 10 items • Updated 16 days ago • 4
LLaMat Collection Foundational Large Language Models for Materials Research • 6 items • Updated 22 days ago • 3
Bad Data Toolbox Collection PleIAs collection of models for the data processing of challenging document and data sources. • 5 items • Updated Jul 18, 2024 • 15
Common Models Collection The first generation of models pretrained on Common Corpus. • 5 items • Updated 30 days ago • 28
view article Article Accelerating Embedding & Reranking Models on AMD Using Infinity By michaelfeil • Dec 3, 2024 • 4
view article Article The case for specialized pre-training: ultra-fast foundation models for dedicated tasks By Pclanglais • Aug 4, 2024 • 27