Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
de-RodrigoΒ 
posted an update 1 day ago
Post
301
MERIT Dataset πŸŽ’πŸ“ƒπŸ† Updates: The Token Classification Version is Now Live on the Hub!

This new version extends the previous dataset by providing richer labels that include word bounding boxes alongside the already available images. πŸš€

We can't wait to see how you use this update! Give it a try, and let us know your thoughts, questions, or any cool projects you build with it. πŸ’‘

Resources:

- Dataset: de-Rodrigo/merit
- Code and generation pipeline: https://github.com/nachoDRT/MERIT-Dataset
- Paper: The MERIT Dataset: Modelling and Efficiently Rendering Interpretable Transcripts (2409.00447)
In this post