ClawExplorer logo

ClawExplorer

OpenClaw skill

browse

An OpenClaw skill named "browse" that enables agents to interact with web pages. It supports capabilities such as navigating to URLs, extracting text, taking screenshots, filling forms, clicking elements, and scrolling. Usage involves the `browse_page` tool with required `url` parameter and optional `instructions`.

Files

Review the files below to add this skill to your agents.

SKILL.md content

Unable to load SKILL.md content from source.

How this skill works

  • The skill defines metadata including name 'Browse', namespace 'pkiv', description, inputs (url: string required, action: string optional), outputs (content: string, screenshot: string, title: string), and configuration (headless: boolean default true)
  • It launches a headless Chromium browser using Playwright
  • Creates a new browser page
  • Navigates the page to the input URL
  • Waits for the load state to be 'networkidle'
  • Extracts page content as HTML or text based on action input
  • Captures full-page screenshot as base64 if requested
  • Retrieves the page title
  • Closes the browser page and context
  • Returns extracted content, screenshot, and title as outputs

When to use it

  • When needing to launch a browser instance to visit and interact with web pages
  • When instructed to navigate to a specific URL and extract information using natural language directives
  • When required to perform actions like scrolling, clicking elements, or filling forms on a webpage
  • When capturing screenshots or page content for further analysis in a task

Example use cases

  • Extract main heading and first paragraph from a webpage: Navigate to https://example.com and use instructions 'Extract the main heading and first paragraph.' to retrieve specified content.

FAQs

More similar skills to explore

  • achurch

    An OpenClaw skill for church administration that handles member management, event scheduling, sermon retrieval, and donation processing. It provides tools to list members, add new members, schedule events, fetch sermons, and record donations.

  • agent-config

    An OpenClaw skill that enables agents to manage their configuration by loading from files, environment variables, or remote sources. It supports retrieving, setting, and validating configuration values. The skill allows for hot-reloading of configurations.

  • agent-council

    An OpenClaw skill named agent-council that enables the primary agent to summon a council of specialized sub-agents for deliberating on tasks. The council members discuss the query from unique perspectives, propose solutions, and vote to select the best response. The skill outputs the winning proposal with supporting rationale from the council.

  • agent-identity-kit

    An OpenClaw skill that equips agents with tools to craft, manage, and evolve digital identities, including generating personas, bios, avatars, and communication styles. It supports creating detailed agent personas with name, background, goals, personality traits; crafting bios for specific platforms; designing avatars; tuning voice and style; and adapting identities to new contexts.

  • agenticflow-skill

    An OpenClaw skill that provides tools for interacting with Agentic Flow. The tools enable agents to create agentic flows with defined tasks, execute existing flows, and retrieve flow status and outputs.

  • agentlens

    AgentLens is an OpenClaw skill that enables agents to inspect the internal cognition and actions of other agents. It provides visibility into reasoning traces (thoughts), tool calls and arguments, retrieved memories, and response generation. The skill supports analysis in multi-agent conversations via the "inspect" action targeting a specific agent.