OCRBench-v2-leaderboard / OCRBench_cn.csv
ling99's picture
Update OCRBench_cn.csv
fbb0340 verified
Model,Open Source,Text Recognition,Relation Extraction,Element Parsing,Visual Text Understanding,Knowledge Reasoning,Average Score,Link
InternVL2.5-8B,Yes,52.8,52.8,28.6,56.4,40.5,46.2,https://arxiv.org/abs/2412.05271
InternVL2.5-26B,Yes,32.4,56.1,32.6,56.3,43.6,44.2,https://arxiv.org/abs/2412.05271
Gemini-Pro,No,52.5,47.3,30.9,51.5,33.4,43.1,https://arxiv.org/abs/2312.11805
Qwen2-VL-8B,Yes,51.3,51.4,21.6,52.5,37.5,42.9,https://arxiv.org/abs/2409.12191
Step-1V,No,56.7,41.1,37.6,38.3,39.2,42.6,https://www.stepfun.com/#step1v
GPT-4V,No,49.9,52.2,34.6,40.8,22.9,40.1,https://openai.com/index/gpt-4v-system-card/
Claude3.5-sonnet,No,21,56.2,35.2,55,30.5,39.6,https://www.anthropic.com/news/claude-3-5-sonnet
GLM-4V-Plus,No,34.5,60.6,23.9,49.8,28.2,39.4,https://arxiv.org/abs/2406.12793
InternVL2-26B,Yes,21.9,46,34.8,50.9,34.8,37.7,https://arxiv.org/abs/2312.14238
GLM-4V-9B,Yes,24.4,60.6,20.4,52.8,25.2,36.6,https://arxiv.org/abs/2406.12793
InternVL2-8B,Yes,20.6,45.2,23.2,54.4,38.1,36.3,https://arxiv.org/abs/2312.14238
MiniCPM-V-2.6,Yes,51,29.9,21.2,34,33.6,33.9,https://arxiv.org/abs/2408.01800
GPT-4o,No,21.6,53,29.8,38.5,18.2,32.2,https://arxiv.org/abs/2303.08774
GPT-4o-mini,No,13.1,38.9,27.2,28.8,16.9,25,https://openai.com/index/gpt-4o-mini-advancing-cost-efficient-intelligence/
Ovis1.6-3B,Yes,11.5,23.7,22.8,28.8,18.9,21.1,https://arxiv.org/abs/2405.20797
LLaVA-OV-7B,Yes,14.8,15.7,13.7,16,28.7,17.8,https://arxiv.org/abs/2408.03326
TextMonkey,Yes,23.5,14.8,8.4,19.9,12.2,15.8,https://arxiv.org/abs/2403.04473
XComposer2-4KHD,Yes,16.7,18.8,12.1,27.5,2.3,15.5,https://arxiv.org/abs/2404.06512
Pixtral-12B,Yes,13.4,10.9,21,7,20.7,14.6,https://arxiv.org/abs/2410.07073
mPLUG-Owl3,Yes,6.6,17.9,9.7,6,26.1,13.3,https://arxiv.org/abs/2408.04840
Monkey,Yes,4.6,11.2,8.4,21.5,20,13.1,https://arxiv.org/abs/2311.06607
Idefics3-8B,Yes,7,15.5,15.9,9,18.1,13.1,https://arxiv.org/abs/2408.12637
Molmo-7B,Yes,7.1,15,9.2,9,23.7,12.8,https://arxiv.org/abs/2409.17146
Deepseek-VL-7B,Yes,8,13.3,15.7,5.5,18.5,12.2,https://arxiv.org/abs/2403.05525
Qwen-VL-chat,Yes,9.5,8.2,9.3,11,21.1,11.8,https://arxiv.org/abs/2308.12966
Eagle-X5-7B,Yes,7.5,12,11.6,5,19.2,11.1,https://arxiv.org/abs/2408.15998
Cambrian-1-8B,Yes,5.3,14.9,12.6,8.5,8.1,9.9,https://arxiv.org/abs/2406.16860
Yi-VL-6B,Yes,4.8,4.4,8.5,4,25,9.4,https://arxiv.org/abs/2403.04652
Qwen-VL,Yes,7.2,5.3,10.7,11.5,11.2,9.2,https://arxiv.org/abs/2308.12966
LLaVA-Next-8B,Yes,5.7,2.9,12.2,7.5,17.2,9.1,https://github.com/Darren-greenhand/LLaVA-Next
Janus-1.3B,Yes,7.6,8.7,11.4,4.5,10.7,8.6,https://arxiv.org/abs/2410.13848
ViLA1.5-8B,Yes,5.4,8.8,8.5,3,15.5,8.2,https://arxiv.org/abs/2412.04468
DocOwl2,Yes,4.2,10.3,8.6,4,9.6,7.3,https://arxiv.org/abs/2409.03420
CogVLM-chat,Yes,5.5,10,9.8,1.5,2.5,5.9,https://arxiv.org/abs/2311.03079
TextHarmony,Yes,1.8,4.5,8.2,1.5,11.9,5.6,https://arxiv.org/abs/2407.16364
UReader,Yes,6.8,2.7,8.4,2.5,7.2,5.5,https://arxiv.org/abs/2310.05126
EMU2-chat,Yes,2.3,0.5,8.5,1,7.3,3.9,https://arxiv.org/abs/2312.13286
LLaVAR,Yes,2.3,1.7,8.9,0,2.5,3.1,https://arxiv.org/abs/2306.17107