EuroLLM: Multilingual Language Models for Europe Paper β’ 2409.16235 β’ Published Sep 24, 2024 β’ 26
Contextual Position Encoding: Learning to Count What's Important Paper β’ 2405.18719 β’ Published May 29, 2024 β’ 5
Building and better understanding vision-language models: insights and future directions Paper β’ 2408.12637 β’ Published Aug 22, 2024 β’ 124
Harvesting Textual and Structured Data from the HAL Publication Repository Paper β’ 2407.20595 β’ Published Jul 30, 2024 β’ 22
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens Paper β’ 2406.11271 β’ Published Jun 17, 2024 β’ 21
view article Article Docmatix - a huge dataset for Document Visual Question Answering Jul 18, 2024 β’ 72
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5, 2024 β’ 188
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output Paper β’ 2407.03320 β’ Published Jul 3, 2024 β’ 93
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems Paper β’ 2407.01370 β’ Published Jul 1, 2024 β’ 86
ColPali: Efficient Document Retrieval with Vision Language Models Paper β’ 2407.01449 β’ Published Jun 27, 2024 β’ 43
Adam-mini: Use Fewer Learning Rates To Gain More Paper β’ 2406.16793 β’ Published Jun 24, 2024 β’ 68
view article Article An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct By leonardlin β’ Jun 11, 2024 β’ 50