arxiv:2412.14161
Graham Neubig
gneubig
AI & ML interests
NLP
Recent Activity
updated
a dataset
11 days ago
gneubig/aime-1983-2024
authored
a paper
13 days ago
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World
Tasks
upvoted
a
paper
13 days ago
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World
Tasks
Organizations
Papers
20
models
None public yet