--- datasets: - HuggingFaceFW/fineweb - PleIAs/YouTube-Commons - allenai/WildChat-1M - mlabonne/orpo-dpo-mix-40k - HuggingFaceM4/the_cauldron - Anthropic/persuasion - H-D-T/Buzz - PleIAs/Post-OCR-Correction language: - en metrics: - accuracy - bertscore ---