MobA: A Two-Level Agent System for Efficient Mobile Task Automation Paper • 2410.13757 • Published Oct 17, 2024 • 32
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments Paper • 2404.07972 • Published Apr 11, 2024 • 47
Mobile-Env: An Evaluation Platform and Benchmark for Interactive Agents in LLM Era Paper • 2305.08144 • Published May 14, 2023