LogoCua Documentation

Interactive Tool

Real-time testing and visualization tool for GUI grounding models

This tool allows you to test multiple models interactively by providing natural language instructions. It automatically captures screenshots and tests all configured models sequentially, providing immediate feedback and visual results.

Usage

# Start the interactive tool
cd libs/python/agent/benchmarks
python interactive.py

Commands

  • Type instruction: Screenshot + test all models
  • screenshot: Take screenshot without prediction
  • models: List available models
  • quit/exit: Exit the tool

On this page