Hub
    Docs
Try for Free
xiangyi-li
/
webarena
mirrored 13 minutes ago
Benchmark CardFiles and versionsLeaderboard
  • Hub
  • Contact
DiscordGitHubXLinkedIn
1
  • configs
    -
    ​
  • test_evaluators.py
    10.3 kB
    ​
  • test_helper_functions.py
    946 B
    ​
  1. /
  2. tests
  3. test_evaluation_harness
remove exact from evalutor names
3 years ago
update test example due to html escape
3 years ago
Shuyan ZhouUpdate README.md3e5c8f9
remove beartype for efficency purpose
3 years ago