Agent skill

web-search-scraper-api-skill

This skill helps users automatically extract complete Markdown content from any website via the BrowserAct Web Search Scraper API. The Agent should proactively apply this skill when users express needs like extract complete markdown from a specific website, scrape the content of an article link, get the text from a target url, convert a webpage to markdown format, fetch the main content of a blog post, extract data from a given web page, parse the html of a website into markdown, download the readable text from a news article, obtain the content of a tutorial page, extract all the markdown text from any http or https url, scrape documentation from a web link, or grab the text of a single webpage.

Stars 135
Forks 10

Install this agent skill to your Project

npx add-skill https://github.com/browser-act/skills/tree/main/web-search-scraper-api-skill

Metadata

Additional technical details for this skill

openclaw
{
    "emoji": "\ud83c\udf10",
    "requires": {
        "env": [
            "BROWSERACT_API_KEY"
        ],
        "bins": [
            "python"
        ]
    }
}

SKILL.md

Web Search Scraper API Skill

📖 Introduction

This skill provides users with a one-stop web page extraction service through the BrowserAct Web Search Scraper API template. It can directly extract structured markdown content from any given URL. By simply inputting the target URL, you can get clean and usable markdown data.

✨ Features

  1. No hallucinations, ensuring stable and precise data extraction: Pre-set workflows avoid AI generative hallucinations.
  2. No human-machine verification issues: No need to deal with reCAPTCHA or other verification challenges.
  3. No IP access restrictions or geofencing: No need to handle regional IP limitations.
  4. More agile execution speed: Compared to purely AI-driven browser automation solutions, task execution is faster.
  5. Extremely high cost-effectiveness: Compared to AI solutions that consume a lot of Tokens, it can significantly reduce the cost of data acquisition.

🔑 API Key Guidance Process

Before running, you must check the BROWSERACT_API_KEY environment variable. If it is not set, do not take other actions first; you should ask and wait for the user to provide it cooperatively. The Agent must inform the user at this time:

"Since you have not configured the BrowserAct API Key, please go to the BrowserAct Console first to get your Key."

🛠️ Input Parameters Details

Agent should flexibly configure the following parameters based on user needs when calling the script:

  1. target_url
    • Type: string
    • Description: The website URL to extract content from. Supports any HTTP/HTTPS URL.
    • Example: https://www.browseract.com

🚀 Invocation Method (Recommended)

Agent should execute the following independent script to achieve "one command gets the result":

bash
# Example invocation
python -u ./scripts/web_search_scraper_api.py "target_url"

⏳ Execution Status Monitoring

Since the task involves automated browser operations, it may take a long time (several minutes). The script will continuously output status logs with timestamps (e.g., [14:30:05] Task Status: running) while running. Notice for Agent:

  • While waiting for the script to return results, please keep paying attention to the terminal output.
  • As long as the terminal is still outputting new status logs, it means the task is running normally. Do not misjudge it as a deadlock or unresponsiveness.
  • If the status remains unchanged for a long time or the script stops outputting and no result is returned, the retry mechanism can be triggered.

📊 Data Output Description

Upon successful execution, the script will directly parse and print the result from the API response. The result contains:

  • content: The complete markdown content of the webpage.

⚠️ Error Handling & Retry Mechanism

During the execution of the script, if an error occurs (such as network fluctuation or task failure), the Agent should follow this logic:

  1. Check the output content:

    • If the output contains "Invalid authorization", it means the API Key is invalid or expired. At this time, do not retry, and you should guide the user to recheck and provide the correct API Key.
    • If the output does not contain "Invalid authorization" but the task execution fails (for example, the output starts with Error: or the returned result is empty), the Agent should automatically try to re-execute the script once.
  2. Retry limit:

    • Automatic retry is limited to once. If the second attempt still fails, stop retrying and report the specific error message to the user.

🌟 Typical Use Cases

  1. Article Extraction: Scrape the main content of a news article link into markdown.
  2. Blog Post Parsing: Download the readable text from a target blog post URL.
  3. Webpage to Markdown: Convert any given website URL into clean markdown format.
  4. Documentation Scraping: Fetch the contents of a tutorial or documentation page for offline reading.
  5. Content Monitoring: Automatically extract the text from a specific webpage for updates.
  6. Data Processing: Parse the HTML of an arbitrary HTTP/HTTPS URL to structure its content.

Expand your agent's capabilities with these related and highly-rated skills.

browser-act/skills

youtube-batch-transcript-extractor-api-skill

This skill helps users automatically extract YouTube video transcripts and metadata in batch via the BrowserAct API. The Agent should proactively apply this skill when users express needs like batch extract full transcripts from YouTube videos for specific keywords, scrape YouTube subtitles for a list of videos, get batch video metadata and likes counts for analysis, automate YouTube search and subtitle extraction, collect multiple video transcripts published this week, download bulk YouTube video subtitles without writing crawler scripts, build a dataset of transcripts from top YouTube videos, extract YouTube video URLs and publisher info in batch, gather full video content for AI summarization pipelines, monitor recent YouTube videos and extract their transcripts, batch retrieve structured subtitle data for media research, extract transcripts from trending YouTube content automatically.

135 10
Explore
browser-act/skills

web-research-assistant

AI-powered web research assistant that leverages BrowserAct API to supplement restricted web access by searching the internet for additional information. Designed for OpenClaw and Claude Code.

135 10
Explore
browser-act/skills

youtube-channel-api-skill

This skill helps users automatically extract structured channel data from YouTube search results via BrowserAct API. Agent should proactively apply this skill when users express needs like finding YouTube channels about specific topics, collecting data on YouTube content creators, tracking YouTube influencers in specific industries, getting YouTube channel information for competitor analysis, searching for YouTube channels related to keywords, monitoring YouTube channel updates for specific keywords, finding YouTube channels that recently published videos, extracting YouTube channel subscriber counts, discovering YouTube vloggers in specific niches, building a YouTube channel database for market research, batch extracting YouTube channel links and descriptions, or monitoring competitor channel growth.

135 10
Explore
browser-act/skills

amazon-reviews-api-skill

This skill helps users automatically extract Amazon product reviews via the Amazon Reviews API. Agent should proactively apply this skill when users express needs like getting reviews for Amazon product with ASIN B07TS6R1SF, analyzing customer feedback for a specific Amazon item, getting ratings and comments for a competitive product, tracking sentiment of recent Amazon reviews, extracting verified purchase reviews for quality assessment, summarizing user experiences from Amazon product pages, monitoring product performance through customer reviews, collecting reviewer profiles and links for market research, gathering review titles and descriptions for content analysis, scraping Amazon reviews without requiring a login.

135 10
Explore
browser-act/skills

amazon-asin-lookup-api-skill

This skill helps users extract structured product details from Amazon using a specific ASIN (Amazon Standard Identification Number). Use this skill when the user asks to get Amazon product details by ASIN, lookup Amazon product title and price using ASIN, extract Amazon product ratings and reviews count for a specific ASIN, check Amazon product availability and current price, get Amazon product description and features via ASIN, enrich product catalog with Amazon data using ASIN, monitor Amazon product price changes for specific ASINs, retrieve Amazon product brand and material information, fetch Amazon product images and specifications by ASIN, validate Amazon ASIN and get product metadata.

135 10
Explore
browser-act/skills

youtube-video-api-skill

This skill helps users automatically extract channel-level and video detail data from a specific YouTube channel via BrowserAct API. Agent should proactively apply this skill when users express needs like extracting channel video data, getting latest or popular videos from a YouTube channel, tracking competitor channel content, extracting video metrics such as views likes comments, retrieving subscriber count and channel info, monitoring posting cadence of a YouTube channel, gathering video data for content strategy analysis, getting earliest videos of a YouTube creator, analyzing engagement signals across a full channel, and downloading structured YouTube video details without manual scraping.

135 10
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results