view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka • Nov 19, 2024 • 99
Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation Paper • 2407.10817 • Published Jul 15, 2024 • 13
Flow Judge v0.1 held-out test datasets Collection This collection contains held-out splits for testing Flow-Judge-v0.1. • 4 items • Updated Sep 14, 2024 • 2
Flow-Judge-v0.1 out-of-domain evaluation datasets Collection This collection contains out-of-domain datasets used to evaluate the generalization capabilities of Flow-Judge-v0.1 • 5 items • Updated Sep 13, 2024 • 1
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 14 days ago • 206
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch Paper • 2311.03099 • Published Nov 6, 2023 • 28
Model Merging Papers Collection Collection of relevant papers about model merging • 13 items • Updated Apr 2, 2024 • 5