OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations Paper β’ 2412.07626 β’ Published 27 days ago β’ 21
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper β’ 2412.03555 β’ Published Dec 4, 2024 β’ 121
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 75
X-ALMA: Plug & Play Modules and Adaptive Rejection for Quality Translation at Scale Paper β’ 2410.03115 β’ Published Oct 4, 2024 β’ 1
Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis Paper β’ 2409.20059 β’ Published Sep 30, 2024 β’ 15
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning Paper β’ 2409.12568 β’ Published Sep 19, 2024 β’ 48
Training Language Models to Self-Correct via Reinforcement Learning Paper β’ 2409.12917 β’ Published Sep 19, 2024 β’ 136
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning Paper β’ 2405.12130 β’ Published May 20, 2024 β’ 46
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites Paper β’ 2404.16821 β’ Published Apr 25, 2024 β’ 55
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 26 items β’ Updated Nov 14, 2024 β’ 541
Transformers Can Represent n-gram Language Models Paper β’ 2404.14994 β’ Published Apr 23, 2024 β’ 18
TextSquare: Scaling up Text-Centric Visual Instruction Tuning Paper β’ 2404.12803 β’ Published Apr 19, 2024 β’ 29
BRAVE: Broadening the visual encoding of vision-language models Paper β’ 2404.07204 β’ Published Apr 10, 2024 β’ 18
The Unreasonable Ineffectiveness of the Deeper Layers Paper β’ 2403.17887 β’ Published Mar 26, 2024 β’ 78