Florian Zimmermeister
flozi00
AI & ML interests
ASR,
German LLM
Recent Activity
updated
a dataset
about 18 hours ago
flozi00/Fineweb2-German-Eduscore-4andMore
new activity
3 days ago
flozi00/Fineweb2-German-Eduscore-4andMore:Librarian Bot: Add language metadata for dataset
replied to
their
post
4 days ago
π Progress in the German FineWeb edu reproduction π
We're delighted to share the launch of our new Data Quality Classification Model, designed specifically for evaluating educational content in German. This tool uses advanced machine learning techniques to assess texts across all educational levels, from primary school to university.
π Inspired by Huggingface's fine web edu dataset, we've worked hard to refine data classification methods ensuring educators and learners access top-quality resources.
We're excited about the future as we continue improving our models and expanding our datasets.
Access the model here: https://huggingface.co/pL-Community/GermanEduScorer-Qwen2-1.5b
π A huge thank you to David and Daryoush from Vago Solutions; BjΓΆrn and Jan from Ellamind / DiscoResearch for their expert insights throughout this project. Your support has been crucial.
This project was made possible by the support of PrimeLine AI.
Organizations
flozi00's activity
Librarian Bot: Add language metadata for dataset
#2 opened 3 days ago
by
librarian-bot
Librarian Bot: Add language metadata for dataset
#1 opened 16 days ago
by
librarian-bot
Convert to .bin?
4
#4 opened 2 months ago
by
Artmart23
german or swiss-german
2
#5 opened 2 months ago
by
jschoene
Comparison with the distilled model
4
#3 opened 3 months ago
by
eustlb
Evaluations are a bit disingenuous
9
#1 opened 3 months ago
by
Laurin-myreha
Nur ca. die erste Minute wird transkribiert
3
#6 opened 4 months ago
by
DonatusOrth
Update config.json
#1 opened 3 months ago
by
flozi00
Update config.json
#1 opened 3 months ago
by
flozi00
Update config.json
#1 opened 3 months ago
by
flozi00
Update config.json
#1 opened 3 months ago
by
flozi00
Update config.json
#1 opened 3 months ago
by
flozi00
Upload tokenizer.json
#2 opened 4 months ago
by
erikinfo
Compatibility with other Clients
3
#5 opened 6 months ago
by
Narbs
Adding `safetensors` variant of this model
#2 opened 7 months ago
by
SFconvertbot
Adding ONNX file of this model
#1 opened 8 months ago
by
rost01
Adding `safetensors` variant of this model
#1 opened 9 months ago
by
SFconvertbot
Base Model or Finetuned Version?
13
#2 opened 9 months ago
by
jphme
Adding `safetensors` variant of this model
#1 opened 10 months ago
by
SFconvertbot
Upload tokenizer.json
1
#3 opened 10 months ago
by
aementio