Automatic Evaluation of Attribution by Large Language Models Paper • 2305.06311 • Published May 10, 2023
eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data Paper • 2402.08831 • Published Feb 13, 2024
When is Tree Search Useful for LLM Planning? It Depends on the Discriminator Paper • 2402.10890 • Published Feb 16, 2024
Bootstrapping a User-Centered Task-Oriented Dialogue System Paper • 2207.05223 • Published Jul 11, 2022
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery Paper • 2410.05080 • Published Oct 7, 2024 • 20
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents Paper • 2410.05243 • Published Oct 7, 2024 • 18
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery Paper • 2410.05080 • Published Oct 7, 2024 • 20