Quickstart (CLI)
Get started with the cua Agent CLI in 4 steps
Get up and running with the cua Agent CLI in 4 simple steps.
Introduction
cua combines Computer (interface) + Agent (AI) for automating desktop apps. The Agent CLI provides a clean terminal interface to control your remote computer using natural language commands.
Set Up Your Computer Environment
Choose how you want to run your cua computer. Cloud containers are recommended for the easiest setup:
Easiest & safest way to get started
- Go to trycua.com/signin
- Navigate to Dashboard > Containers > Create Instance
- Create a Medium, Ubuntu 22 container
- Note your container name and API key
Your cloud container will be automatically configured and ready to use.
- Install lume cli
/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/trycua/cua/main/libs/lume/scripts/install.sh)"
- Start a local cua container
lume run macos-sequoia-cua:latest
- Enable Windows Sandbox (requires Windows 10 Pro/Enterprise or Windows 11)
- Install pywinsandbox dependency
pip install -U git+git://github.com/karkason/pywinsandbox.git
- Windows Sandbox will be automatically configured when you run the CLI
-
Install Docker Desktop or Docker Engine
-
Pull the CUA Ubuntu container
docker pull --platform=linux/amd64 trycua/cua-ubuntu:latest
Install cua
Install uv
# Use curl to download the script and execute it with sh:
curl -LsSf https://astral.sh/uv/install.sh | sh
# If your system doesn't have curl, you can use wget:
# wget -qO- https://astral.sh/uv/install.sh | sh
# Use irm to download the script and execute it with iex:
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"
Install Python 3.12
uv python install 3.12
# uv will install cua dependencies automatically when you use --with "cua-agent[cli]"
Run cua CLI
Choose your preferred AI model:
OpenAI Computer Use Preview
uv run --with "cua-agent[cli]" -m agent.cli openai/computer-use-preview
python -m agent.cli openai/computer-use-preview
Anthropic Claude
uv run --with "cua-agent[cli]" -m agent.cli anthropic/claude-opus-4-20250514
uv run --with "cua-agent[cli]" -m agent.cli anthropic/claude-opus-4-1-20250805
uv run --with "cua-agent[cli]" -m agent.cli anthropic/claude-sonnet-4-20250514
uv run --with "cua-agent[cli]" -m agent.cli anthropic/claude-3-5-sonnet-20241022
python -m agent.cli anthropic/claude-opus-4-1-20250805
python -m agent.cli anthropic/claude-opus-4-20250514
python -m agent.cli anthropic/claude-sonnet-4-20250514
python -m agent.cli anthropic/claude-3-5-sonnet-20241022
Omniparser + LLMs
uv run --with "cua-agent[cli]" -m agent.cli omniparser+anthropic/claude-3-5-sonnet-20241022
uv run --with "cua-agent[cli]" -m agent.cli omniparser+openai/gpt-4o
uv run --with "cua-agent[cli]" -m agent.cli omniparser+vertex_ai/gemini-pro
python -m agent.cli omniparser+anthropic/claude-3-5-sonnet-20241022
python -m agent.cli omniparser+openai/gpt-4o
python -m agent.cli omniparser+vertex_ai/gemini-pro
Local Models
# Hugging Face models (local)
uv run --with "cua-agent[cli]" -m agent.cli huggingface-local/ByteDance-Seed/UI-TARS-1.5-7B
# MLX models (Apple Silicon)
uv run --with "cua-agent[cli]" -m agent.cli mlx/mlx-community/UI-TARS-1.5-7B-6bit
# Ollama models
uv run --with "cua-agent[cli]" -m agent.cli omniparser+ollama_chat/llama3.2:latest
# Hugging Face models (local)
python -m agent.cli huggingface-local/ByteDance-Seed/UI-TARS-1.5-7B
# MLX models (Apple Silicon)
python -m agent.cli mlx/mlx-community/UI-TARS-1.5-7B-6bit
# Ollama models
python -m agent.cli omniparser+ollama_chat/llama3.2:latest
Interactive Setup
If you haven't set up environment variables, the CLI will guide you through the setup:
- Container Name: Enter your cua container name (or get one at trycua.com)
- CUA API Key: Enter your cua API key
- Provider API Key: Enter your AI provider API key (OpenAI, Anthropic, etc.)
Start Chatting
Once connected, you'll see:
💻 Connected to your-container-name (model, agent_loop)
Type 'exit' to quit.
>
You can ask your agent to perform actions like:
- "Take a screenshot and tell me what's on the screen"
- "Open Firefox and go to github.com"
- "Type 'Hello world' into the terminal"
- "Close the current window"
- "Click on the search button"
For advanced Python usage and GUI interface, see the Quickstart (GUI) and Quickstart for Developers.
For running models locally, see Running Models Locally.