Llama-3.1-Nemotron-70B Collection SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. β’ 6 items β’ Updated about 11 hours ago β’ 149
view article Article Using π€ to Train a GPT-2 Model for Music Generation By juancopi81 β’ Oct 5, 2023 β’ 8
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. β’ 4 items β’ Updated about 11 hours ago β’ 161
argilla/distilabel-capybara-dpo-7k-binarized Viewer β’ Updated Jul 16, 2024 β’ 7.56k β’ 3.71k β’ 179