ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated 15 days ago β’ 112
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. β’ 8 items β’ Updated 17 days ago β’ 45
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. β’ 40 items β’ Updated 16 days ago β’ 75
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper β’ 2412.10360 β’ Published 21 days ago β’ 136
EXAONE-3.5 Collection EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B. β’ 10 items β’ Updated 25 days ago β’ 84
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ about 1 month ago β’ 75
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. β’ 23 items β’ Updated 21 days ago β’ 122
view article Article Ultimate Guide to Website Crawling for Offline Use: Top 20 Methods By luigi12345 β’ Nov 24, 2024 β’ 2
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb β’ Nov 28, 2024 β’ 127
Qwen2-VL Collection Vision-language model series based on Qwen2 β’ 16 items β’ Updated 29 days ago β’ 186
OLMo 2 Collection Artifacts for the second set of OLMo models. β’ 20 items β’ Updated about 12 hours ago β’ 62
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct β’ 5 items β’ Updated 12 days ago β’ 30
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 7 items β’ Updated Nov 27, 2024 β’ 31