Hub
Docs
Try for Free
xiangyi-li
/
webarena
mirrored 18 minutes ago
Benchmark Card
Files and versions
Leaderboard
like
1
__init__.py
181 B
evaluators.py
12.3 kB
helper_functions.py
6.05 kB
fix type errors
2 years ago
fix type errors
2 years ago
Shuyan Zhou
Update README.md
3e5c8f9
release commit
3 years ago
shuyanzhou-patch-1
evaluation_harness