Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey Paper β’ 2412.18619 β’ Published 21 days ago β’ 49
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper β’ 2412.18925 β’ Published 12 days ago β’ 86
Reasoning Datasets Collection Reasoning datasets that are trending π₯ β’ 10 items β’ Updated 3 days ago β’ 17
OLMo 2 Collection Artifacts for the second set of OLMo models. β’ 20 items β’ Updated 3 days ago β’ 66
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ 4 days ago β’ 30
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 75
Common Models Collection The first generation of models pretrained on Common Corpus. β’ 5 items β’ Updated Dec 5, 2024 β’ 28
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models Paper β’ 2310.08659 β’ Published Oct 12, 2023 β’ 25
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published 19 days ago β’ 117
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations Paper β’ 2412.13171 β’ Published 20 days ago β’ 31
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks Paper β’ 2412.14161 β’ Published 19 days ago β’ 48
Large Action Models: From Inception to Implementation Paper β’ 2412.10047 β’ Published 24 days ago β’ 31
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper β’ 2412.10360 β’ Published 24 days ago β’ 136
Byte Latent Transformer: Patches Scale Better Than Tokens Paper β’ 2412.09871 β’ Published 24 days ago β’ 83