computer-control-mcp
An MCP server that enables desktop automation, including mouse control, keyboard input, screenshots, OCR, and window management, allowing LLMs to interact with graphical user interfaces.
About this tool
computer-control-mcp
Description
computer-control-mcp is an MCP server that enables desktop automation, including mouse control, keyboard input, screenshots, OCR, and window management. It allows large language models (LLMs) or other agents to interact with graphical user interfaces. The server is implemented using PyAutoGUI, RapidOCR, and ONNXRuntime, and is designed to have zero external dependencies. It is similar in functionality to 'computer-use' by Anthropic.
Features
- Mouse control (move, click, etc.)
- Keyboard control (type, press keys, etc.)
- Screen management (taking screenshots)
- Window management
- Optical Character Recognition (OCR) using RapidOCR and ONNXRuntime
- Zero external dependencies (all required packages are included)
- API for programmatic interaction
Category
code-execution-automation-mcp-servers
Tags
mcp, automation, desktop, gui
Pricing
No pricing information provided. The project is open source and released under the MIT License.
Loading more......
Information
Categories
Tags
Similar Products
6 result(s)An MCP server providing control over Android devices via ADB, including screenshot capture, UI analysis, package management, and command execution.
A secure MCP server for executing shell commands with robust security controls, enabling protected command-line operations. Directly relevant as a widely-used MCP server for secure CLI automation.
MCP servers for automating and controlling the user's browser using existing browser profiles, ideal for tasks requiring logged-in sessions and avoiding bot detection. Includes implementations by browsermcp and modelcontext.
An MCP server that integrates with 3D printer management systems (OctoPrint, Klipper, Duet, etc.) enabling remote control, STL file manipulation, slicing, and advanced print job workflows, highlighting MCP servers in IoT and device automation.
Integrates Playwright with MCP to enable automated web interactions, scraping, and testing across browsers.
An MCP server paired with a browser extension that allows LLMs to control Firefox browsers.