OCRBench-v2-leaderboard / OCRBench.csv
ling99's picture
Update OCRBench.csv
9f35b6d verified
raw
history blame
3.66 kB
Model,Language Model,Open Source,Text Recognition,Text Referring,Text Spotting,Relation Extraction,Element Parsing,Mathematical Calculation,Visual Text Understanding,Knowledge Reasoning,Average Score,Link
Qwen-VL,,,34.6,7.5,0,18.2,20,8.1,57.2,41.1,23.3,https://arxiv.org/abs/2308.12966
Qwen-VL-chat,,,34.5,4.1,0,25.9,14,13.8,55.7,39.5,23.4,https://arxiv.org/abs/2308.12966
Qwen2-VL-8B,,Yes,72.1,47.9,17.5,82.5,25.5,25.4,78.4,61.5,51.4,https://arxiv.org/abs/2409.12191
InternVL2-8B,,,49.9,23.1,0.5,65.2,24.8,26.7,73.5,52.9,39.6,https://arxiv.org/abs/2312.14238
InternVL2-26B,,,63.4,26.1,0,76.8,37.8,32.3,79.4,58.9,46.8,https://arxiv.org/abs/2312.14238
InternVL2.5-8B,,,59.0 ,25.0 ,1.4,77.5,35.1,29.4,75.3,57.2,45.0 ,https://arxiv.org/abs/2412.05271
InternVL2.5-26B,,,65.6,26.1,1.6,86.9,36.2,37.4,78.3,62.9,49.4,https://arxiv.org/abs/2412.05271
TextMonkey,,,39.1,0.7,0,19.0 ,12.2,19.0 ,61.1,40.2,23.9,https://arxiv.org/abs/2403.04473
LLaVA-Next-8B,,,41.3,18.8,0,49.5,21.2,17.3,55.2,48.9,31.5,https://github.com/Darren-greenhand/LLaVA-Next
Monkey,,,35.2,0,0,16.6,16.3,14.4,59.8,42.3,23.1,https://arxiv.org/abs/2311.06607
XComposer2-4KHD,,,45.1,21.8,0.1,15.9,11.7,15.7,66.8,45.9,27.9,https://arxiv.org/abs/2404.06512
Molmo-7B,,,52.4,21.3,0.1,45.5,7.6,28.5,65.3,55.0 ,34.5,https://arxiv.org/abs/2409.17146
EMU2-chat,,,42.1,0.2,0,12.5,8.1,11.2,42.7,33.4,18.8,https://arxiv.org/abs/2312.13286
mPLUG-Owl3,,,41.6,14,0.6,24.4,10.9,11.1,52.2,46.0 ,25.1,https://arxiv.org/abs/2408.04840
CogVLM-chat,,,50.9,0,0,0.2,8.4,15.0 ,58.1,41.7,21.8,https://arxiv.org/abs/2311.03079
Deepseek-VL-7B,,,37.1,15.4,0,23.5,14.6,20.8,53.3,52.9,27.2,https://arxiv.org/abs/2403.05525
GLM-4V-9B,,,61.8,22.6,0,71.7,31.6,22.6,72.1,58.4,42.6,https://arxiv.org/abs/2406.12793
MiniCPM-V-2.6,,,66.8,6,0.8,62.0 ,28.8,32.4,73.7,52.1,40.3,https://arxiv.org/abs/2408.01800
TextHarmony,,,25.8,2.5,0,1.8,8.5,10.4,46.1,33.1,16.0 ,https://arxiv.org/abs/2407.16364
VILA1.5-8B,,,35.3,15.5,0,21.1,12.7,17.3,46.3,40.3,23.6,https://arxiv.org/abs/2412.04468
LLaVAR,,,37.3,0,0,1.0 ,9.9,12.3,34.6,27.0 ,15.3,https://arxiv.org/abs/2306.17107
DocOwl2,,,24.0 ,9.7,0,13.4,13.5,8.8,53.7,32.0 ,19.4,https://arxiv.org/abs/2409.03420
UReader,,,22.4,0.1,0,0,9.2,7.9,41.0 ,29.1,13.7,https://arxiv.org/abs/2310.05126
Yi-VL-6B,,,28.9,2.9,0,9.7,12.9,15.8,36.1,32.0 ,17.3,https://arxiv.org/abs/2403.04652
Janus-1.3B,,,46.1,0,0,0.2,14.5,13.5,36.0 ,39.1,18.7,https://arxiv.org/abs/2410.13848
Cambrian-1-8B,,,45.3,21.5,0,53.6,19.2,19.5,63.5,55.5,34.7,https://arxiv.org/abs/2406.16860
LLaVA-OV-7B,,,46.0 ,20.8,0.1,58.3,25.3,23.3,64.4,53.0 ,36.4,https://arxiv.org/abs/2408.03326
Eagle-X5-7B,,,34.7,17.8,0,21.7,20.6,21.5,61.0 ,42.6,27.5,https://arxiv.org/abs/2408.15998
Idefics3-8B,,,23.8,13.2,0,63.2,23.8,23.0 ,65.8,44.9,32.2,https://arxiv.org/abs/2408.12637
Ovis1.6-3B,,,59.2,14.3,0,65.0 ,32.1,29.0 ,69.8,56.8,40.8,https://arxiv.org/abs/2405.20797
Pixtral-12B,,,48.9,21.6,0,66.3,35.5,29.8,66.9,53.7,40.3,https://arxiv.org/abs/2410.07073
GLM-4V-Plus,,,60.3,25.2,0,74.7,37.6,26.4,61.4,57.2,42.9,https://arxiv.org/abs/2406.12793
GPT-4V,,,69.7,26.9,0.3,75.6,36.7,42.9,71.5,57.9,47.7,https://openai.com/index/gpt-4v-system-card/
GPT-4o,,,61.2,26.7,0,77.5,36.3,43.4,71.1,55.5,46.5,https://arxiv.org/abs/2303.08774
GPT-4o-mini,,,57.9,23.3,0.6,70.8,31.5,38.8,65.9,55.1,43,https://openai.com/index/gpt-4o-mini-advancing-cost-efficient-intelligence/
Gemini-Pro,,No,61.2,39.5,13.5,79.3,39.2,47.7,75.5,59.3,51.9,https://arxiv.org/abs/2312.11805
Claude3.5-sonnet,,,62.2,28.4,1.3,56.6,37.8,40.8,73.5,60.9,45.2,https://www.anthropic.com/news/claude-3-5-sonnet
Step-1V,,,67.8,31.3,7.2,73.6,37.2,27.8,69.8,58.6,46.7,https://www.stepfun.com/#step1v