Multilingual Web datasets
Occiglot
community
AI & ML interests
Open Source Language Models for Europe
Recent Activity
View all activity
Organization Card
Occiglot is an ongoing open research project for multilingual language models.
If you want to train a model for your own language or are working on evaluations, please contact us or join our Discord server. We are actively seeking collaborations!
Collections
3
models
10
occiglot/occiglot-7b-es-en-instruct
Text Generation
β’
Updated
β’
306
β’
2
occiglot/occiglot-7b-eu5
Text Generation
β’
Updated
β’
76
β’
27
occiglot/occiglot-7b-de-en-instruct
Text Generation
β’
Updated
β’
211
β’
23
occiglot/occiglot-7b-eu5-instruct
Text Generation
β’
Updated
β’
358
β’
8
occiglot/occiglot-7b-it-en-instruct
Text Generation
β’
Updated
β’
5.2k
β’
5
occiglot/occiglot-7b-fr-en-instruct
Text Generation
β’
Updated
β’
33
β’
3
occiglot/occiglot-7b-it-en
Text Generation
β’
Updated
β’
29
β’
6
occiglot/occiglot-7b-fr-en
Text Generation
β’
Updated
β’
373
β’
2
occiglot/occiglot-7b-de-en
Text Generation
β’
Updated
β’
921
β’
8
occiglot/occiglot-7b-es-en
Text Generation
β’
Updated
β’
2.96k
β’
4
datasets
6
occiglot/euro-llm-leaderboard-requests
Preview
β’
Updated
β’
2.95k
β’
2
occiglot/arcX
Viewer
β’
Updated
β’
1.17k
β’
38
occiglot/hellaswagX
Viewer
β’
Updated
β’
9.98k
β’
40
occiglot/occiglot-fineweb-v1.0
Updated
β’
106
β’
3
occiglot/occiglot-fineweb-v0.5
Viewer
β’
Updated
β’
226M
β’
31
β’
15
occiglot/tokenizer-wiki-bench
Viewer
β’
Updated
β’
84.4M
β’
4.77k
β’
4