O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Paper • 2411.16489 • Published Nov 25, 2024 • 41
Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale Paper • 2409.17115 • Published Sep 25, 2024 • 61
OpenResearcher: Unleashing AI for Accelerated Scientific Research Paper • 2408.06941 • Published Aug 13, 2024 • 31
Data Contamination Report from the 2024 CONDA Shared Task Paper • 2407.21530 • Published Jul 31, 2024 • 10
ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation Paper • 2407.06135 • Published Jul 8, 2024 • 21
OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far? Paper • 2406.16772 • Published Jun 24, 2024 • 2
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI Paper • 2406.12753 • Published Jun 18, 2024 • 14
SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization Paper • 2106.01890 • Published Jun 3, 2021
Towards a Unified Multi-Dimensional Evaluator for Text Generation Paper • 2210.07197 • Published Oct 13, 2022
BARTScore: Evaluating Generated Text as Text Generation Paper • 2106.11520 • Published Jun 22, 2021 • 1
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios Paper • 2307.13528 • Published Jul 25, 2023
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing Paper • 2107.13586 • Published Jul 28, 2021
On Learning to Summarize with Large Language Models as References Paper • 2305.14239 • Published May 23, 2023
FELM: Benchmarking Factuality Evaluation of Large Language Models Paper • 2310.00741 • Published Oct 1, 2023
Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization Paper • 2311.09184 • Published Nov 15, 2023 • 1
InFoBench: Evaluating Instruction Following Ability in Large Language Models Paper • 2401.03601 • Published Jan 7, 2024 • 7