OpenHands

community

https://github.com/All-Hands-AI/OpenHands

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

huybery authored a paper about 16 hours ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

xingyaoww authored a paper 2 days ago

Training Software Engineering Agents and Verifiers with SWE-Gym

huybery authored a paper 12 days ago

Iterative Forward Tuning Boosts In-Context Learning in Language Models

View all activity

spaces 1

OpenHands Evaluation Benchmark

models 1

OpenHands/CodeQwen1.5-7B-OpenDevin

Text Generation • Updated May 25, 2024 • 26 • 15

datasets 7

OpenHands/eval-output-webarena

Updated Jul 20, 2024 • 11

OpenHands/eval-browsing-instructions

Viewer • Updated Jul 15, 2024 • 933 • 2

OpenHands/eval-output-miniwob

Updated Jun 10, 2024 • 7

OpenHands/SWE-bench-devin-passed

Viewer • Updated Apr 9, 2024 • 79 • 32

OpenHands/SWE-bench-devin-full-filtered

Viewer • Updated Apr 9, 2024 • 450 • 39 • 1

OpenHands/SWE-bench-devin-full

Viewer • Updated Apr 9, 2024 • 570 • 43

OpenHands/Devin-SWE-bench-output

Viewer • Updated Mar 21, 2024 • 1.14k • 43