Commit History

Added reproducibility journal metadata
52facf3
Running

meghsn commited on

Added readme, visualwebarena
90d6776

meghsn commited on

Update results/GenericAgent-Claude-3.5-Sonnet/README.md (#3)
2a1e680
verified

meghsn recursix commited on

Update results/GenericAgent-GPT-4o-mini/README.md (#4)
68ac77b
verified

meghsn commited on

Result updates
d5581cc

meghsn commited on

Added new benchmarks
97d7e59

meghsn commited on

Updated latest results
51b9b31

meghsn commited on

Cosmetic changes, update results
f4d95d8

meghsn commited on

test-agent (#1)
e80279f
verified

meghsn commited on

Updated readme for PR
3d7a66f

meghsn commited on

Readme details
92c92ae

meghsn commited on

Security checks
b667dc2

meghsn commited on

Removed old results files
2705446

meghsn commited on

Update app.py
ea81237
verified

meghsn commited on

Init leaderboard
c71b246

meghsn commited on

Init leaderboard
8627a70

meghsn commited on

initial commit
cc74085
verified

meghsn commited on