Refactor examples; Start to load examples into benchmark; vlc initialization
2 years ago
Dunjie LuMerge pull request #452 from xlang-ai/dev_djlu/gpt54_agent
optimize gpt5.4 promptcda933f
feat: add gpt-5.4 computer-use runner (#450)
* feat: add gpt-5.4 computer-use runner
* fix: trim unrelated gpt54 debug logging
* chore: default gpt54 reasoning effort to xhigh
2 days ago
Updated misc:get_rule_relativeTime to support list in relativeRules[expected][time] (#447)
5 days ago
Add Claude Sonnet 4.5 support and improve action handling (#362)
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-authored-by: Claude <noreply@anthropic.com>
4 months ago
Add Claude Sonnet 4.5 support and improve action handling (#362)
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-authored-by: Claude <noreply@anthropic.com>
4 months ago
fix(os_symphony_evaluation) (#410)
* fix(os_symphony)
* Update desktop_env_os_symphony.py
* fix(os_symphony_desktop)
* fix(os_symphony_start)
* Add docstring to run_multienv_os_symphony.py
Added documentation header for the evaluation script.
2 months ago
Fix #347: Fix NameError in open_file timeout message (#351)
- Fix undefined 'timeout' variable in error message
- Use defined TIMEOUT constant instead of undefined timeout variable
- Prevents NameError when LibreOffice crashes during file opening
5 months ago
Refactoring VMware Integration and Implementing AWS Support (#44)
* Initailize aws support
* Add README for the VM server
* Refactor OSWorld for supporting more cloud services.
* Initialize vmware and aws implementation v1, waiting for verification
* Initlize files for azure, gcp and virtualbox support
* Debug on the VMware provider
* Fix on aws interface mapping
* Fix instance type
* Refactor
* Clean
* hk region; debug
* Fix lock
* Remove print
* Remove key_name requirements when allocating aws vm
* Clean README
---------
Co-authored-by: XinyuanWangCS <xywang626@gmail.com>
2 years ago