Agent skill
dev-browser
Browser automation with persistent page state. Use when users ask to navigate websites, fill forms, take screenshots, extract web data, test web apps, or automate browser workflows. Trigger phrases include "go to [url]", "click on", "fill out the form", "take a screenshot", "scrape", "automate", "test the website", "log into", or any browser interaction request.
Install this agent skill to your Project
npx add-skill https://github.com/SawyerHood/dev-browser/tree/main/skills/dev-browser
SKILL.md
Dev Browser
A CLI for controlling browsers with sandboxed JavaScript scripts.
Installation
npm install -g dev-browser
dev-browser install
Usage
Run dev-browser --help to learn more.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
brave-search
Web search and content extraction via Brave Search API. Use for searching documentation, facts, or any web content. Lightweight, no browser required.
memory
Update persistent swarm memory in ${SWARM_MEMORY_FILE} when the user explicitly asks to remember, update, or forget durable information.
image-generation
Generate images using Google Gemini (gemini-3-pro-image-preview). Requires GEMINI_API_KEY.
cron-scheduling
Create, list, and remove persistent scheduled tasks using cron expressions.
verl-rl-training
Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexible infrastructure backends.
openrlhf-training
High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2× faster than DeepSpeedChat with distributed architecture and GPU resource sharing.
Didn't find tool you were looking for?