HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published 10 days ago • 82
view article Article 🐺🐦⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram • about 1 month ago • 75
view article Article Towards a Fully Arabic Retrieval-Augmented Generation (RAG) Pipeline: By Omartificial-Intelligence-Space • Nov 30, 2024 • 6
view article Article To what extent are we responsible for our content and how to create safer Spaces? By davidberenstein1957 • Aug 30, 2024 • 3
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb • Nov 28, 2024 • 127
view article Article Let’s make a generation of amazing image generation models By burtenshaw • Nov 26, 2024 • 34
view article Article Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models By mikelabs • Nov 21, 2024 • 2
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 • Nov 21, 2024 • 35
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • Nov 13, 2024 • 98
Marqo-Ecommerce-Embeddings Collection State-of-the-art embedding models fine-tuned for the ecommerce domain. +67% increase in evaluation metrics vs ViT-B-16-SigLIP. • 10 items • Updated Nov 14, 2024 • 17
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Paper • 2411.04996 • Published Nov 7, 2024 • 49
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • Jul 5, 2024 • 183
view article Article Recipe: Preparing Multilingual Speech Datasets for TTS Training By PHBJT • Nov 4, 2024 • 14
AMD-OLMo Collection AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinct™ MI250 GPUs based on OLMo. • 4 items • Updated Oct 31, 2024 • 17