How to Control Chrome with AI: The Complete Guide to Browserbase MCP Server

Learn how to use Browserbase MCP Server to let AI models control Chrome browsers for web automation. Step-by-step setup for developers, plus real-world examples and troubleshooting. Integrate with Apidog for streamlined workflows.

Ashley Goolam

Ashley Goolam

29 January 2026

How to Control Chrome with AI: The Complete Guide to Browserbase MCP Server

Ever wanted to empower your AI to interact with the web like a real user—navigating sites, clicking buttons, filling out forms, and extracting data? With the Browserbase MCP Server, this is not just possible—it's surprisingly straightforward. In this developer-focused tutorial, you'll learn how to connect AI models like Claude, GPT, or Gemini to real browser sessions and automate web tasks efficiently.

Whether you're building intelligent bots, automating QA workflows, or prototyping new data-driven tools, this guide will show you why Browserbase MCP Server is essential and how to set it up seamlessly.


💡 Looking for an API testing platform that creates beautiful API documentation and powers collaborative developer workflows? Apidog combines powerful testing, streamlined docs, and team productivity—all at a fraction of Postman's cost.

button

What Is Browserbase MCP Server?

The Browserbase MCP Server is an open-source tool that connects large language models (LLMs) to web browsers using the Model Context Protocol (MCP). Think of it as a secure, programmable bridge between AI and real-world browser automation.

With Browserbase MCP Server, an AI can:

This capability unlocks advanced automation and data extraction without manual intervention—ideal for API-driven teams, backend engineers, and QA automation specialists.

Compatible with: OpenAI GPT, Anthropic Claude, Google Gemini, and other MCP-supported models.


Why Use Browserbase MCP Server for AI-Driven Automation?

For developers and teams, Browserbase MCP Server stands out for several reasons:

Whether you're automating repetitive web tasks, building data pipelines, or integrating browser automation into CI/CD flows, Browserbase MCP Server saves engineering hours while enabling robust and reliable testing.


How to Install and Configure Browserbase MCP Server

You can start using the Browserbase MCP Server in three ways:

  1. Hosted remote server (easiest)
  2. Via npm (recommended)
  3. Local installation (full control)

Prerequisites

Before setup, ensure you have:

browserbase official website

For local setup, also install:


Option 1: Hosted Remote Server (Fastest Setup)

Skip installation and use Browserbase’s cloud-hosted MCP Server:

  1. Get Your Credentials:
    Go to smithery.ai to obtain your API keys and MCP server URL.

  2. Configure Your AI Client:
    Add the URL to your AI client's MCP settings.
    For Claude Desktop, update your claude_desktop_config.json:

    {
      "mcpServers": {
        "browserbase": {
          "url": "https://your-smithery-url.com"
        }
      }
    }
    

    For the mcp-remote CLI:

    {
      "mcpServers": {
        "browserbase": {
          "command": "npx",
          "args": ["mcp-remote", "https://your-smithery-url.com"]
        }
      }
    }
    
  3. Restart Your Client to activate the new settings.

Your AI client now has full browser automation via Browserbase MCP Server.

smithery ai


Run the server directly via npm for maximum compatibility:

  1. In your MCP config JSON, add:

    {
      "mcpServers": {
        "browserbase": {
          "command": "npx",
          "args": ["@browserbasehq/mcp"],
          "env": {
            "BROWSERBASE_API_KEY": "",
            "BROWSERBASE_PROJECT_ID": "",
            "GEMINI_API_KEY": ""
          }
        }
      }
    }
    
  2. Insert your actual API keys and project ID.

  3. Restart your AI client.


Option 3: Local Installation for Advanced Users

For teams needing full control or integration into custom pipelines, install and run locally:

  1. Clone the Repo:

    git clone https://github.com/browserbase/mcp-server-browserbase.git
    cd mcp-server-browserbase
    
  2. Install Dependencies:

    pnpm install && pnpm build
    
  3. Start the Server:

    • STDIO (local process):
      Configure your MCP client as:

      {
        "mcpServers": {
          "browserbase": {
            "command": "node",
            "args": ["path/to/mcp-server-browserbase/cli.js"],
            "env": {
              "BROWSERBASE_API_KEY": "your_api_key",
              "BROWSERBASE_PROJECT_ID": "your_project_id",
              "GEMINI_API_KEY": "your_gemini_key"
            }
          }
        }
      }
      
    • HTTP Transport:
      Start with:

      node cli.js --port 3000
      

      Then, point your MCP client to http://localhost:3000.

  4. Add API Keys:
    Replace placeholders with your actual Browserbase and model API keys.

  5. Restart Your Client to enable the server.


Customizing the Server

Browserbase MCP Server supports several flags for fine-tuning:

Example:

"args": ["path/to/cli.js", "--proxies", "--browserWidth=1920"]

Using Browserbase MCP Server to Automate Chrome

Once setup is complete, you can issue natural language commands through your AI client to control Chrome. Here’s how:

1. Connect Your AI Client

Ensure your client (e.g., Claude Desktop, Cursor) is linked to Browserbase MCP Server using one of the above methods.

2. Issue Browser Automation Commands

In the AI chat, you can instruct the browser with plain language, such as:

The AI interprets your instructions and executes browser actions live.

3. Review the Results

You’ll receive real-time feedback, such as:


Test your setup with this workflow:

  1. Open your AI client (e.g., Claude Desktop)
  2. Issue command:
    “Go to https://google.com, search for 'Browserbase MCP Server', and click the first result”
  3. Watch Chrome automate the actions—visible if running with local HTTP transport
  4. Review the response—the AI should confirm navigation and may extract page content

Install claude


Troubleshooting Tips


Conclusion

Browserbase MCP Server bridges the gap between powerful AI models and real-world browser automation, unlocking new possibilities for API development, QA, and workflow automation. Whether you choose the hosted, npm, or local approach, you can rapidly automate web tasks and integrate them into your team's toolchain.

Ready to streamline your API or QA workflows? Consider combining Browserbase automation with Apidog’s robust API testing and documentation features for best-in-class productivity.

💡 Discover how Apidog offers beautiful API documentation, streamlined team collaboration, and a smarter alternative to Postman.

button

Explore more

Claude vs Claude Code vs Claude Cowork: Which One Should You Use?

Claude vs Claude Code vs Claude Cowork: Which One Should You Use?

Understand the differences between Claude, Claude Code, and Claude Cowork. Find the right Anthropic AI product for your workflow - coding, chat, or agentic tasks

28 February 2026

Why Stripe's API is the Gold Standard: Design Patterns That Every API Builder Should Steal

Why Stripe's API is the Gold Standard: Design Patterns That Every API Builder Should Steal

A deep dive into the architectural decisions that made Stripe the most beloved API among developers.

28 February 2026

Nano Banana 1 vs Nano Banana 2: The Only Comparison You Need

Nano Banana 1 vs Nano Banana 2: The Only Comparison You Need

Complete comparison of Nano Banana 1 vs Nano Banana 2: resolution, text rendering, prompt understanding, and features. Find out which AI image generator is right for you.

27 February 2026

Practice API Design-first in Apidog

Discover an easier way to build and use APIs