Magpie-Llama3.3 Datasets Collection Dataset built with Meta Llama 3.3 70B. • 3 items • Updated 4 days ago • 1
GRAPE: Generalizing Robot Policy via Preference Alignment Paper • 2411.19309 • Published Nov 28, 2024 • 42
Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper • 2411.07133 • Published Nov 11, 2024 • 35
view article Article 🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦⬛ By anakin87 • Oct 21, 2024 • 18
Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published Sep 19, 2024 • 136
MagpieLM Collection Aligning LMs with Fully Open Recipe + Synthetic Data Generated from Open-Source LMs. • 9 items • Updated 2 days ago • 15
Magpie Open Recipes Collection Open-aligned models using Magpie datasets. • 11 items • Updated 4 days ago • 1
Magpie-Llama3.1 Datasets Collection Dataset built with Meta Llama 3.1 70B. • 6 items • Updated 4 days ago • 3
Zebra Logic Bench Collection ZebraLogic Bench: Testing the Limits of LLMs in Logical Reasoning • 4 items • Updated Nov 27, 2024 • 5
ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates Paper • 2406.12935 • Published Jun 17, 2024 • 1
synthetic-data-generation-demos Collection A collection of demos for various approaches to synthetic data generation • 4 items • Updated Jun 25, 2024 • 14
Magpie-Qwen2 Datasets Collection Dataset built with Qwen2 72B and Qwen2 7B. • 6 items • Updated 4 days ago • 10
view article Article Introducing Synthetic Data Workshop: Your Gateway to Easy Synthetic Dataset Creation By davanstrien • Jun 20, 2024 • 12
FP8 LLMs for vLLM Collection Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 44 items • Updated Oct 17, 2024 • 61
Magpie-Air Datasets (Llama-3) Collection Built with Meta Llama 3 8B • 6 items • Updated 4 days ago • 1
ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs Paper • 2402.11753 • Published Feb 19, 2024 • 5
Magpie-Pro Datasets (Llama-3) Collection Dataset built with Meta Llama 3 70B. Models are fine-tuned from Llama 3 8B. • 6 items • Updated 4 days ago • 16
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing Paper • 2406.08464 • Published Jun 12, 2024 • 65
SimPO Collection This collections contains a list of SimPO and baseline models. • 49 items • Updated Nov 7, 2024 • 17