# MastraBrowser class

The `MastraBrowser` class is the abstract base class for browser automation providers. It defines the common interface for launching browsers, managing thread isolation, streaming screencasts, and handling input events.

You don't instantiate `MastraBrowser` directly. Instead, use a provider implementation:

- [`AgentBrowser`](https://mastra.ai/reference/browser/agent-browser): Deterministic browser automation using refs
- [`StagehandBrowser`](https://mastra.ai/reference/browser/stagehand-browser): AI-powered browser automation using natural language

## Usage example

```typescript
import { Agent } from '@mastra/core/agent'
import { AgentBrowser } from '@mastra/agent-browser'

const browser = new AgentBrowser({
  headless: true,
  viewport: { width: 1280, height: 720 },
  scope: 'thread',
})

export const browserAgent = new Agent({
  name: 'browser-agent',
  instructions: 'You can browse the web to find information.',
  model: 'openai/gpt-5.4',
  browser,
})
```

## Constructor parameters

**headless** (`boolean`): Whether to run the browser in headless mode (no visible UI). (Default: `true`)

**viewport** (`{ width: number; height: number }`): Browser viewport dimensions. Controls the size of the browser window. (Default: `{ width: 1280, height: 720 }`)

**timeout** (`number`): Default timeout in milliseconds for browser operations. (Default: `10000`)

**cdpUrl** (`string | (() => string | Promise<string>)`): CDP WebSocket URL, HTTP endpoint, or sync/async provider function. When provided, connects to an existing browser instead of launching a new one. HTTP endpoints are resolved to WebSocket internally. Can't be used with scope: 'thread' (automatically uses shared scope).

**scope** (`'shared' | 'thread'`): Browser instance scope across threads. 'shared' means all threads share a single browser instance. 'thread' means each thread gets its own browser instance (full isolation). (Default: `'thread' (or 'shared' when cdpUrl is provided)`)

**onLaunch** (`(args: { browser: MastraBrowser }) => void | Promise<void>`): Callback invoked after the browser reaches 'ready' status.

**onClose** (`(args: { browser: MastraBrowser }) => void | Promise<void>`): Callback invoked before the browser is closed.

**screencast** (`ScreencastOptions`): Configuration for streaming browser frames.

**screencast.format** (`'jpeg' | 'png'`): Image format for screencast frames.

**screencast.quality** (`number`): Image quality (1-100). Only applies to JPEG format.

**screencast.maxWidth** (`number`): Maximum width for screencast frames.

**screencast.maxHeight** (`number`): Maximum height for screencast frames.

**screencast.everyNthFrame** (`number`): Capture every Nth frame to reduce bandwidth.

## Properties

The following properties (`id`, `name`, `provider`) are abstract and must be defined by concrete provider implementations:

**id** (`string`): Unique identifier for this browser instance. Abstract - defined by provider.

**name** (`string`): Human-readable name of the browser provider (e.g., 'AgentBrowser', 'StagehandBrowser'). Abstract - defined by provider.

**provider** (`string`): Provider identifier (e.g., 'vercel-labs/agent-browser', 'browserbase/stagehand'). Abstract - defined by provider.

**headless** (`boolean`): Whether the browser is running in headless mode.

**status** (`BrowserStatus`): Current browser status: 'pending', 'launching', 'ready', 'error', 'closing', or 'closed'.

## Methods

### Lifecycle

#### `ensureReady()`

Ensures the browser is launched and ready for use. Automatically called before tool execution. Implemented in the base class.

```typescript
await browser.ensureReady()
```

#### `close()`

Closes the browser and cleans up all resources. Implemented in the base class with race-condition-safe handling.

```typescript
await browser.close()
```

#### `isBrowserRunning()`

Checks if the browser is currently running.

```typescript
const isRunning = browser.isBrowserRunning()
```

**Returns:** `boolean`

### Thread management

#### `setCurrentThread(threadId)`

Sets the current thread ID for browser operations. Used internally by the agent runtime.

```typescript
browser.setCurrentThread('thread-123')
```

#### `getCurrentThread()`

Gets the current thread ID.

```typescript
const threadId = browser.getCurrentThread()
```

**Returns:** `string`

#### `hasThreadSession(threadId)`

Checks if a thread has an active browser session.

```typescript
const hasSession = browser.hasThreadSession('thread-123')
```

**Returns:** `boolean`

#### `closeThreadSession(threadId)`

Closes a specific thread's browser session. For 'thread' scope, this closes that thread's browser instance. For 'shared' scope, this clears the thread's state.

```typescript
await browser.closeThreadSession('thread-123')
```

### Tools

#### `getTools()`

Returns the browser tools for use with agents. Each provider returns different tools based on its paradigm.

```typescript
const tools = browser.getTools()
```

**Returns:** `Record<string, Tool>`

### Screencast

#### `startScreencast(options?, threadId?)`

Starts streaming browser frames. Returns a `ScreencastStream` that emits frame events.

```typescript
const stream = await browser.startScreencast({ format: 'jpeg', quality: 80 }, 'thread-123')

stream.on('frame', frame => {
  console.log('Frame received:', frame.data.length, 'bytes')
})

stream.on('stop', reason => {
  console.log('Screencast stopped:', reason)
})
```

**Returns:** `Promise<ScreencastStream>`

### Input injection

#### `injectMouseEvent(params, threadId?)`

Injects a mouse event into the browser. Used by Studio for live interaction.

```typescript
await browser.injectMouseEvent({
  type: 'mousePressed',
  x: 100,
  y: 200,
  button: 'left',
  clickCount: 1,
})
```

#### `injectKeyboardEvent(params, threadId?)`

Injects a keyboard event into the browser. Used by Studio for live interaction.

```typescript
await browser.injectKeyboardEvent({
  type: 'keyDown',
  key: 'Enter',
  code: 'Enter',
})
```

### State

#### `getState(threadId?)`

Gets the current browser state including URL and tabs.

```typescript
const state = await browser.getState('thread-123')
console.log('Current URL:', state.currentUrl)
console.log('Tabs:', state.tabs)
```

**Returns:** `Promise<BrowserState>`

```typescript
interface BrowserState {
  currentUrl: string | null
  tabs: BrowserTabState[]
  activeTabIndex: number
}

interface BrowserTabState {
  id: string
  url: string
  title: string
}
```

#### `getCurrentUrl(threadId?)`

Gets the current page URL.

```typescript
const url = await browser.getCurrentUrl()
```

**Returns:** `Promise<string | null>`

## Browser scope

The `scope` option controls how browser instances are shared across conversation threads:

| Scope      | Description                                 | Use case                                 |
| ---------- | ------------------------------------------- | ---------------------------------------- |
| `'shared'` | All threads share a single browser instance | Cost-efficient for non-conflicting tasks |
| `'thread'` | Each thread gets its own browser instance   | Full isolation for concurrent users      |

```typescript
// Shared browser for all threads
const sharedBrowser = new AgentBrowser({
  scope: 'shared',
})

// Isolated browser per thread
const isolatedBrowser = new AgentBrowser({
  scope: 'thread',
})
```

When using `cdpUrl` to connect to an external browser, the scope automatically falls back to `'shared'` since you can't spawn new browser instances.

## Cloud browser providers

Connect to cloud browser services using the `cdpUrl` option:

```typescript
// Static CDP URL
const browser = new AgentBrowser({
  cdpUrl: 'wss://browser.example.com/ws',
})

// Dynamic CDP URL (e.g., session-based)
const browser = new AgentBrowser({
  cdpUrl: async () => {
    const session = await createBrowserSession()
    return session.wsUrl
  },
})
```

## Related

- [AgentBrowser](https://mastra.ai/reference/browser/agent-browser): Deterministic browser automation
- [StagehandBrowser](https://mastra.ai/reference/browser/stagehand-browser): AI-powered browser automation
- [Browser overview](https://mastra.ai/docs/browser/overview): Conceptual guide to browser automation