GUI Agent Harness
Date:
A vision-based GUI agent that observes the screen, plans, clicks, and verifies by vision, driving desktop applications and OSWorld VMs. It runs on macOS, Windows, and Linux, with perception tuned for macOS. On the OSWorld Multi-Apps benchmark it reaches 79.8% (72.6 / 91).
