Browser
Deploy a Browser instance
BrowserInstance
The BrowserInstance
is a lightweight Chromium instance that supports interactive streaming, computer actions, Playwright CDP control, and saving/loading auth states. We recommend using this instance type if your task is constrained to the browser.
- Fastest start up time
- 1x compute cost
Start a browser instance
Python
TypeScript
Available actions
get_cdp_url
Get the Playwright CDP URL
Python
TypeScript
save_auth
Save the browser auth state
Python
TypeScript
authenticate
Authenticate the browser using a saved auth state
Python
TypeScript
screenshot
Take a base64 encoded image of the current desktop
Python
TypeScript
get_stream_url
Get the interactive stream URL
Python
TypeScript
computer
Perform computer actions with the mouse and keyboard
key
Press a key or combination of keys
text
Python
TypeScript
type
Type text into the active window
text
Python
TypeScript
mouse_move
Move mouse cursor to specific coordinates
coordinate
[x, y]
Python
TypeScript
left_click_drag
Click and drag from current position to specified coordinates
coordinate
[x, y]
Python
TypeScript
scroll
Scroll horizontally and/or vertically (pixels converted to clicks)
coordinate
[x, y]
Python
TypeScript
left_click
Perform a left mouse click at current position
Python
TypeScript
right_click
Perform a right mouse click at current position
Python
TypeScript
middle_click
Perform a middle mouse click at current position
Python
TypeScript
double_click
Perform a double left click at current position
Python
TypeScript
screenshot
Take a screenshot of the desktop
Python
TypeScript
cursor_position
Get current mouse cursor coordinates
Python
TypeScript
wait
Wait for 3 seconds
Python
TypeScript
stop
Stop the instance
Python
TypeScript
pause
Pause the instance
Python
TypeScript
resume
Resume the instance
Python
TypeScript
Compatible tools
ComputerTool
BrowserTool