datasets: | |
- HuggingFaceFW/fineweb | |
- PleIAs/YouTube-Commons | |
- allenai/WildChat-1M | |
- mlabonne/orpo-dpo-mix-40k | |
- HuggingFaceM4/the_cauldron | |
- Anthropic/persuasion | |
- H-D-T/Buzz | |
- PleIAs/Post-OCR-Correction | |
language: | |
- en | |
metrics: | |
- accuracy | |
- bertscore | |