OSWorld-Verified
Benchmark ComputerAgent on OSWorld tasks using HUD
OSWorld-Verified is a curated subset of OSWorld tasks that can be run using the HUD framework.
Use ComputerAgent with HUD to benchmark on these tasks.
Benchmark ComputerAgent on OSWorld tasks using HUD
OSWorld-Verified is a curated subset of OSWorld tasks that can be run using the HUD framework.
Use ComputerAgent with HUD to benchmark on these tasks.