-
Holistic Evaluation of Text-To-Image Models
Paper • 2311.04287 • Published • 11 -
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks
Paper • 2311.07463 • Published • 13 -
Trusted Source Alignment in Large Language Models
Paper • 2311.06697 • Published • 10 -
DiLoCo: Distributed Low-Communication Training of Language Models
Paper • 2311.08105 • Published • 14
Collections
Discover the best community collections!
Collections including paper arxiv:2311.12983
-
GAIA: a benchmark for General AI Assistants
Paper • 2311.12983 • Published • 187 -
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models
Paper • 2404.02575 • Published • 48 -
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs
Paper • 2404.05719 • Published • 83 -
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Paper • 2409.02897 • Published • 45