Quick Reference
| Tool | Agent | Default Resolution |
|---|---|---|
AnthropicComputerTool | Claude | 1280×720 |
OpenAIComputerTool | OpenAI / Operator | 1920×1080 |
GeminiComputerTool | Gemini | 1280×720 |
GLMComputerTool | GLM-V | 1024×768 |
HudComputerTool | Any (function calling) | 1280×720 |
AnthropicComputerTool
For Claude. Usescomputer_20250124 native API.
screenshot, click, double_click, mouse_move, left_click_drag, type, key, scroll, wait
OpenAIComputerTool
For OpenAI and Operator. Usescomputer_use_preview native API.
screenshot, click, double_click, scroll, type, wait, move, keypress, drag
GLMComputerTool
For GLM-4.6V and later. Uses normalized 0–999 coordinates automatically rescaled to screen pixels.left_click, right_click, middle_click, hover, left_double_click, left_drag, key, type, scroll, screenshot, WAIT, DONE, FAIL
HudComputerTool
Generic computer tool for any agent via function calling. Use when you need provider-agnostic control.screenshot, click, write, press, scroll, drag, move, wait
Executors
Computer tools use executors for the actual system interaction:| Executor | Platform | Notes |
|---|---|---|
PyAutoGUIExecutor | Cross-platform | Default on Windows/macOS |
XDOExecutor | Linux/X11 | Faster, uses xdotool |
Coordinate Scaling
Agent coordinates don’t have to match actual display resolution. The tool handles scaling:rescale_images=True resizes screenshots to agent dimensions. rescale_images=False keeps full resolution (coordinates still scale).