CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean Paper • 2403.06412 • Published Mar 11, 2024 • 3
Retrieval-Augmented Data Augmentation for Low-Resource Domain Tasks Paper • 2402.13482 • Published Feb 21, 2024
Block Transformer: Global-to-Local Language Modeling for Fast Inference Paper • 2406.02657 • Published Jun 4, 2024 • 37
FEVER: a large-scale dataset for Fact Extraction and VERification Paper • 1803.05355 • Published Mar 14, 2018
FEVEROUS: Fact Extraction and VERification Over Unstructured and Structured information Paper • 2106.05707 • Published Jun 10, 2021
Can Large Language Models Infer and Disagree Like Humans? Paper • 2305.13788 • Published May 23, 2023
ORPO: Monolithic Preference Optimization without Reference Model Paper • 2403.07691 • Published Mar 12, 2024 • 64
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models Paper • 2310.08491 • Published Oct 12, 2023 • 53
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets Paper • 2307.10928 • Published Jul 20, 2023 • 12