Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
agent-evals
/
leaderboard
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
leaderboard
1 contributor
History:
36 commits
benediktstroebl
fixed tooltip error
2c6a894
about 2 months ago
agent_monitor
minor tweaks
3 months ago
utils
fixed tooltip error
about 2 months ago
.gitattributes
Safe
1.62 kB
Update .gitattributes
2 months ago
.gitignore
Safe
138 Bytes
added trace download links
about 2 months ago
README.md
Safe
236 Bytes
init v1
3 months ago
about.md
Safe
5.39 kB
init v1
3 months ago
agent_performance_analysis.json
Safe
5.08 kB
init v1
3 months ago
agent_submission.md
Safe
766 Bytes
init v1
3 months ago
agent_submission_core.md
Safe
2.77 kB
init v1
3 months ago
agents_metadata.yaml
Safe
12.1 kB
added o1 core hard
2 months ago
app.py
Safe
61.4 kB
added creator section
about 2 months ago
benchmark_submission.md
Safe
496 Bytes
init v1
3 months ago
config.py
Safe
3.8 kB
big update with dynamic pricing, agent metadata, about page on top, and new benchmarks
2 months ago
cost_explanation.md
Safe
703 Bytes
added cost and heatmap explanation
about 2 months ago
creators.md
Safe
1.53 kB
added creator section
about 2 months ago
css.css
Safe
2.35 kB
update fontsize
about 2 months ago
envs.py
Safe
191 Bytes
init v1
3 months ago
hal.ico
Safe
15.4 kB
init v1
3 months ago
hal.png
Safe
1.03 kB
init v1
3 months ago
heatmap_explanation.md
Safe
379 Bytes
added cost and heatmap explanation
about 2 months ago
process.py
Safe
142 Bytes
added creator section
about 2 months ago
requirements.txt
Safe
1.84 kB
init v1
3 months ago