A CDP-direct browser automation MCP server with 27 intelligent subsystems for AI agent constraint, supporting multi-tab parallelism, token compression, and anti-detection.
OpenChrome is an open-source browser automation MCP (Model Context Protocol) server that enables AI agents to control real Chrome browsers directly via Chrome DevTools Protocol with zero middleware layers. Its core differentiator is the "Harness Engineering" system — 27 intelligent subsystems across 7 categories (Guidance, Resilience, Protection, Feedback, Learning, Optimization, Detection) that prevent error cascades, auto-recover from failures, and stop token consumption runaway.
OpenChrome exposes 46 MCP tools covering navigation/interaction, page reading (accessibility tree / dom / css modes), environment control, storage/debugging, parallel workflows, and cross-session memory. DOM Mode achieves 15x compression to significantly reduce token costs. Anti-detection uses real Chrome fingerprints with a 3-level auto-fallback chain and built-in Cloudflare Turnstile/CDN-WAF bypass. It supports 1-process N-tab parallelism (claimed ~300MB for 20 sessions) with session persistence maintaining login state across restarts. Built with TypeScript + Node.js (≥18), CDP communication via rebrowser-puppeteer-core, Webpack bundling, and Jest testing. A cross-platform desktop app (Beta) is available alongside the CLI (macOS/Windows/Linux).
Harness Engineering Subsystems#
| Category | Key Subsystems | Purpose |
|---|---|---|
| Guidance | Hint Engine (30+ rules, 9 types), Progress Tracker, Usage Guide | Prevent before error cascades |
| Resilience | Ralph Engine (7-strategy waterfall), Auto-Reconnect, Ref Self-Healing | Auto-recover from failures |
| Protection | 3-level Circuit Breaker, Rate Limiter, Domain Guard | Stop token consumption runaway |
| Feedback | Outcome Classifier, DOM Delta, Visual Summary, Hit Detection | Report actual operation results |
| Learning | Pattern Learner, Strategy Learner, Domain Memory | Cross-session intelligence improvement |
| Optimization | DOM Mode (15x compression), Adaptive Screenshot, Snapshot Delta | Minimize token consumption |
| Detection | Auth Redirect Detection, Blocking Page, Pagination Detector | Early identification of anomalous page states |
MCP Tool Set (46 tools, 6 categories)#
- Navigation & Interaction:
navigate,interact,fill_form,find,computer - Reading & Extraction:
read_page(accessibility tree / dom / css modes),page_content,javascript_tool,selector_query,xpath_query - Environment Control:
emulate_device,geolocation,user_agent,network - Storage & Debugging:
cookies,storage,console_capture,performance_metrics,request_intercept - Parallel Workflows:
workflow_init,workflow_collect,worker_create,batch_execute - Memory:
memory_record,memory_query,memory_validate
Quick Start#
npx openchrome-mcp setup # Auto-configure Claude Code
npx openchrome-mcp setup --client codex # Auto-configure Codex CLI
oc serve --auto-launch # Start server
oc serve --headless-shell # Headless mode
oc doctor # Diagnose issues
Unconfirmed Information#
- npm package
openchrome-mcppublication status not directly verified - Desktop app framework (Tauri or Electron) unconfirmed
- Performance benchmark data ("80x faster", "5x cheaper", etc.) are project claims, not independently verified
- Full list of 49 reliability mechanisms not completely documented in README