Hub
Docs
Try for Free
xiangyi-li
/
webarena
mirrored 13 minutes ago
Benchmark Card
Files and versions
Leaderboard
like
1
configs
-
test_evaluators.py
10.3 kB
test_helper_functions.py
946 B
shuyanzhou-patch-1
/
tests
test_evaluation_harness
remove exact from evalutor names
3 years ago
update test example due to html escape
3 years ago
Shuyan Zhou
Update README.md
3e5c8f9
remove beartype for efficency purpose
3 years ago