Agent skill

agent-browser

Automates browser interactions for form filling and web page interaction. Used by the request-website command to submit website indexing requests.

Stars 1,464
Forks 102

Install this agent skill to your Project

npx add-skill https://github.com/actionbook/actionbook/tree/main/playground/actionbook-scraper/skills/agent-browser

SKILL.md

Browser Automation with agent-browser

Quick start

bash
agent-browser open <url>        # Navigate to page
agent-browser snapshot -i       # Get interactive elements with refs
agent-browser click @e1         # Click element by ref
agent-browser fill @e2 "text"   # Fill input by ref
agent-browser close             # Close browser

Core workflow

  1. Navigate: agent-browser open <url>
  2. Snapshot: agent-browser snapshot -i (returns elements with refs like @e1, @e2)
  3. Interact using refs from the snapshot
  4. Re-snapshot after navigation or significant DOM changes
  5. Always close: agent-browser close

Commands

Navigation

bash
agent-browser open <url>      # Navigate to URL
agent-browser close           # Close browser (ALWAYS do this)

Snapshot (page analysis)

bash
agent-browser snapshot        # Full accessibility tree
agent-browser snapshot -i     # Interactive elements only (recommended)

Interactions (use @refs from snapshot)

bash
agent-browser click @e1           # Click
agent-browser fill @e2 "text"     # Clear and type
agent-browser type @e2 "text"     # Type without clearing
agent-browser press Enter         # Press key
agent-browser scroll down 500     # Scroll page

Get information

bash
agent-browser get text @e1        # Get element text
agent-browser get title           # Get page title
agent-browser get url             # Get current URL

Wait

bash
agent-browser wait @e1                     # Wait for element
agent-browser wait 2000                    # Wait milliseconds
agent-browser wait --load networkidle      # Wait for network idle

Example: Form submission (request-website)

bash
# Open Actionbook request page
agent-browser open "https://actionbook.dev/request-website"

# Get form elements
agent-browser snapshot -i
# Output shows: textbox "Site URL" [ref=e1], textbox "Email" [ref=e2], textbox "Use Case" [ref=e3], button "Submit" [ref=e4]

# Fill form
agent-browser fill @e1 "https://example.com/products"
agent-browser fill @e2 "user@example.com"
agent-browser fill @e3 "Scraping product catalog"

# Submit
agent-browser click @e4
agent-browser wait --load networkidle

# Verify submission
agent-browser snapshot -i

# Close browser
agent-browser close

Permission Required

Add to .claude/settings.local.json:

json
{
  "permissions": {
    "allow": [
      "Bash(agent-browser *)"
    ]
  }
}

Or run ./setup.sh to configure automatically.

Expand your agent's capabilities with these related and highly-rated skills.

actionbook/actionbook

actionbook

Activate when the user needs to interact with any website — browser automation, web scraping, screenshots, form filling, UI testing, monitoring, or building AI agents. Provides pre-verified page actions with step-by-step instructions and tested selectors.

1,464 102
Explore
actionbook/actionbook

article-exporter

Export any web article to a local Obsidian-ready Markdown directory. Fetches page content via actionbook CLI, downloads images locally, rewrites image references to relative paths, and optionally translates the article using AI. Produces a self-contained folder with README.md, images/, and an index.md navigation file.

1,464 102
Explore
actionbook/actionbook

deep-research

Deep research and analysis tool. Generates comprehensive HTML reports on any topic, domain, paper, or technology. Use when user asks to research, analyze, investigate, deep-dive, or generate a report on any subject. Supports academic papers (arXiv), technologies, trends, comparisons, and general topics.

1,464 102
Explore
actionbook/actionbook

actionbook-web-test

Run browser-based web tests against websites using Actionbook CLI. Activate when the user wants to test a website workflow, run smoke tests, verify a user flow, check if a web application works, run regression tests, or validate browser-based interactions. Supports test definition, execution, assertion, reporting, and json-ui visual report generation.

1,464 102
Explore
actionbook/actionbook

arxiv-viewer

View, search, and download academic papers from arXiv. Supports API queries, web scraping via Actionbook, and HTML paper reading via ar5iv. Use when user asks about arxiv papers, academic papers, research papers, paper summaries, latest papers, or wants to search/download/read papers.

1,464 102
Explore
actionbook/actionbook

actionbook-scraper

Generate and verify web scraper scripts using Actionbook's verified selectors. Auto-validates generated scripts and fixes errors.

1,464 102
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results