How to Control Chrome with AI: The Complete Guide to Browserbase MCP Server

Learn how to use Browserbase MCP Server to let AI models control Chrome browsers for web automation. Step-by-step setup for developers, plus real-world examples and troubleshooting. Integrate with Apidog for streamlined workflows.

Ashley Goolam

Ashley Goolam

29 January 2026

How to Control Chrome with AI: The Complete Guide to Browserbase MCP Server

Ever wanted to empower your AI to interact with the web like a real user—navigating sites, clicking buttons, filling out forms, and extracting data? With the Browserbase MCP Server, this is not just possible—it's surprisingly straightforward. In this developer-focused tutorial, you'll learn how to connect AI models like Claude, GPT, or Gemini to real browser sessions and automate web tasks efficiently.

Whether you're building intelligent bots, automating QA workflows, or prototyping new data-driven tools, this guide will show you why Browserbase MCP Server is essential and how to set it up seamlessly.


💡 Looking for an API testing platform that creates beautiful API documentation and powers collaborative developer workflows? Apidog combines powerful testing, streamlined docs, and team productivity—all at a fraction of Postman's cost.

button

What Is Browserbase MCP Server?

The Browserbase MCP Server is an open-source tool that connects large language models (LLMs) to web browsers using the Model Context Protocol (MCP). Think of it as a secure, programmable bridge between AI and real-world browser automation.

With Browserbase MCP Server, an AI can:

This capability unlocks advanced automation and data extraction without manual intervention—ideal for API-driven teams, backend engineers, and QA automation specialists.

Compatible with: OpenAI GPT, Anthropic Claude, Google Gemini, and other MCP-supported models.


Why Use Browserbase MCP Server for AI-Driven Automation?

For developers and teams, Browserbase MCP Server stands out for several reasons:

Whether you're automating repetitive web tasks, building data pipelines, or integrating browser automation into CI/CD flows, Browserbase MCP Server saves engineering hours while enabling robust and reliable testing.


How to Install and Configure Browserbase MCP Server

You can start using the Browserbase MCP Server in three ways:

  1. Hosted remote server (easiest)
  2. Via npm (recommended)
  3. Local installation (full control)

Prerequisites

Before setup, ensure you have:

browserbase official website

For local setup, also install:


Option 1: Hosted Remote Server (Fastest Setup)

Skip installation and use Browserbase’s cloud-hosted MCP Server:

  1. Get Your Credentials:
    Go to smithery.ai to obtain your API keys and MCP server URL.

  2. Configure Your AI Client:
    Add the URL to your AI client's MCP settings.
    For Claude Desktop, update your claude_desktop_config.json:

    {
      "mcpServers": {
        "browserbase": {
          "url": "https://your-smithery-url.com"
        }
      }
    }
    

    For the mcp-remote CLI:

    {
      "mcpServers": {
        "browserbase": {
          "command": "npx",
          "args": ["mcp-remote", "https://your-smithery-url.com"]
        }
      }
    }
    
  3. Restart Your Client to activate the new settings.

Your AI client now has full browser automation via Browserbase MCP Server.

smithery ai


Run the server directly via npm for maximum compatibility:

  1. In your MCP config JSON, add:

    {
      "mcpServers": {
        "browserbase": {
          "command": "npx",
          "args": ["@browserbasehq/mcp"],
          "env": {
            "BROWSERBASE_API_KEY": "",
            "BROWSERBASE_PROJECT_ID": "",
            "GEMINI_API_KEY": ""
          }
        }
      }
    }
    
  2. Insert your actual API keys and project ID.

  3. Restart your AI client.


Option 3: Local Installation for Advanced Users

For teams needing full control or integration into custom pipelines, install and run locally:

  1. Clone the Repo:

    git clone https://github.com/browserbase/mcp-server-browserbase.git
    cd mcp-server-browserbase
    
  2. Install Dependencies:

    pnpm install && pnpm build
    
  3. Start the Server:

    • STDIO (local process):
      Configure your MCP client as:

      {
        "mcpServers": {
          "browserbase": {
            "command": "node",
            "args": ["path/to/mcp-server-browserbase/cli.js"],
            "env": {
              "BROWSERBASE_API_KEY": "your_api_key",
              "BROWSERBASE_PROJECT_ID": "your_project_id",
              "GEMINI_API_KEY": "your_gemini_key"
            }
          }
        }
      }
      
    • HTTP Transport:
      Start with:

      node cli.js --port 3000
      

      Then, point your MCP client to http://localhost:3000.

  4. Add API Keys:
    Replace placeholders with your actual Browserbase and model API keys.

  5. Restart Your Client to enable the server.


Customizing the Server

Browserbase MCP Server supports several flags for fine-tuning:

Example:

"args": ["path/to/cli.js", "--proxies", "--browserWidth=1920"]

Using Browserbase MCP Server to Automate Chrome

Once setup is complete, you can issue natural language commands through your AI client to control Chrome. Here’s how:

1. Connect Your AI Client

Ensure your client (e.g., Claude Desktop, Cursor) is linked to Browserbase MCP Server using one of the above methods.

2. Issue Browser Automation Commands

In the AI chat, you can instruct the browser with plain language, such as:

The AI interprets your instructions and executes browser actions live.

3. Review the Results

You’ll receive real-time feedback, such as:


Test your setup with this workflow:

  1. Open your AI client (e.g., Claude Desktop)
  2. Issue command:
    “Go to https://google.com, search for 'Browserbase MCP Server', and click the first result”
  3. Watch Chrome automate the actions—visible if running with local HTTP transport
  4. Review the response—the AI should confirm navigation and may extract page content

Install claude


Troubleshooting Tips


Conclusion

Browserbase MCP Server bridges the gap between powerful AI models and real-world browser automation, unlocking new possibilities for API development, QA, and workflow automation. Whether you choose the hosted, npm, or local approach, you can rapidly automate web tasks and integrate them into your team's toolchain.

Ready to streamline your API or QA workflows? Consider combining Browserbase automation with Apidog’s robust API testing and documentation features for best-in-class productivity.

💡 Discover how Apidog offers beautiful API documentation, streamlined team collaboration, and a smarter alternative to Postman.

button

Explore more

Apidog AI Agent Debugger: Availability, Coverage, and Setup

Apidog AI Agent Debugger: Availability, Coverage, and Setup

Apidog's AI Agent Debugger is available on all plans. This article covers supported model providers, MCP transports, what the debugger covers, what stays the same, and how to enable it.

20 May 2026

Google Antigravity 2.0: Agent-First Dev Platform Has Landed

Google Antigravity 2.0: Agent-First Dev Platform Has Landed

Google Antigravity 2.0 ships at I/O 2026: standalone desktop app, CLI, SDK, Managed Agents API, and Gemini Enterprise tier. Pricing, features, and Apidog setup.

20 May 2026

Google Information Agent: Find Anything With AI Search

Google Information Agent: Find Anything With AI Search

Google's new Information Agent runs 24/7, scans web and Google data, notifies you on matches. Launch date, pricing, Gemini 3.5 Flash, and developer angle.

20 May 2026

Practice API Design-first in Apidog

Discover an easier way to build and use APIs