Web Scraper turns any website into structured, readable data. It handles JavaScript-heavy pages, anti-bot protection, geographic targeting, and AI-powered data extraction — all without writing any code. Whether you need the text from a single page, a full site crawl, or a structured dataset extracted to a schema, it's one tool.
Standard web scrapers break on modern sites that require JavaScript to render. This one renders fully, bypasses bot detection, supports authentication headers, and can extract specific data fields using AI so you get exactly what you asked for, not raw HTML.
What you can do
- scrape_page — scrape a single page with JS rendering; returns markdown, HTML, or filtered content; handles JSON APIs too
- crawl_site — recursively crawl a site up to a depth and page limit, collecting all content
- map_site — fast URL discovery without content, useful before deciding what to crawl
- extract_data — AI-powered structured extraction — describe what you want or pass a JSON schema
- search_web — search the web and optionally scrape the top results in one call
- stealth_scrape / stealth_crawl — enhanced anti-bot bypass for heavily protected pages
Who it's for
Researchers aggregating content from multiple sites. Data teams building datasets from web sources. Developers integrating web content into pipelines. Marketers monitoring competitor pricing and messaging. Analysts tracking content changes over time.
How to use it
- Start with scrape_page for a single URL — add onlyMainContent to strip navigation and footers
- Use map_site to discover URLs on a domain before deciding what to crawl
- Use crawl_site for recursive multi-page collection with depth and path filters
- Use extract_data when you want specific fields — pass a schema or describe what to extract in plain English
- If you get 403 errors, switch to stealth_scrape for enhanced proxy routing
Getting started
All skills work without configuration. For geo-targeted results, add a country code. For authenticated pages, pass custom headers with your credentials.