HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Paper • 2412.21199 • Published 4 days ago • 9
SelfCodeAlign: Self-Alignment for Code Generation Paper • 2410.24198 • Published Oct 31, 2024 • 23
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Paper • 2408.03314 • Published Aug 6, 2024 • 53
Evaluating and Aligning CodeLLMs on Human Preference Paper • 2412.05210 • Published 29 days ago • 47
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published Nov 22, 2024 • 57
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation Paper • 2410.23090 • Published Oct 30, 2024 • 54
C4AI Aya Expanse Collection Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated 18 days ago • 30
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization Paper • 2410.08815 • Published Oct 11, 2024 • 44
Law of the Weakest Link: Cross Capabilities of Large Language Models Paper • 2409.19951 • Published Sep 30, 2024 • 54
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models Paper • 2409.17146 • Published Sep 25, 2024 • 104
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 29 days ago • 551
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 17 items • Updated 12 days ago • 93
EuroLLM: Multilingual Language Models for Europe Paper • 2409.16235 • Published Sep 24, 2024 • 25
OLMoE Collection Artifacts for open mixture-of-experts language models. • 13 items • Updated Nov 27, 2024 • 29
view article Article Synthetic dataset generation techniques: generating custom sentence similarity data By davanstrien • May 23, 2024 • 16