Skip to content
Tools / Site Crawler
Site Crawler icon

Site Crawler

Map pages, links, and errors

Site Crawler walks any website and collects per-page metadata — titles, descriptions, heading structure, word counts, response times, and link counts. Give it a URL and get back a structured inventory of everything it finds.

It's the fastest way to audit a site's content landscape without manually clicking through pages. Use it to find thin content, spot broken links, map internal structure, or build a content inventory before an SEO project.

What you can do

  • Crawl up to 100 pages from any starting URL
  • Collect title, description, heading counts, word count, link count, and response time per page
  • Detect 404s, timeouts, and other errors
  • Control crawl scope — same-origin only, or extend to subdomains and external links
  • Adjust timeout per page for slow-loading sites

Who it's for

SEO professionals running content audits. Developers mapping site structure before a migration. Content teams identifying thin or duplicate pages. Anyone who needs a fast overview of what's actually on a site.

How to use it

  1. Use crawl_site with your starting URL — set max_pages to 10 for a quick overview or up to 100 for full coverage
  2. Check the errors array for 404s and timeouts
  3. Sort by word_count to find thin content pages
  4. Set include_subdomains to true for multi-subdomain sites like docs.example.com

Getting started

Save your primary domain as a credential so it's used as the default — or just pass any URL directly to crawl_site.

Crawl Site

Crawl pages from a starting URL and collect metadata including title, description, heading counts, link counts, response time, and crawl diagnostics.

Returns: Bounded crawl results with per-page SEO metadata, link graph hints, response timing, and failure diagnostics
Loading reviews...

Loading activity...

v0.022026-03-22
  • Added subtitle, expanded description, and agent instructions
v0.012026-03-20
  • Initial release

Site Crawler Use Cases(8)

Browse all 8 Site Crawlerguides →
Open Build a Full Site Inventory

Build a Full Site Inventory

Crawl your entire website to build a complete inventory of every page, its status, and its metadata.

Site Crawler icon
Site Crawler
4 agent guides
Open Find 404 Errors Across Your Site

Find 404 Errors Across Your Site

Crawl your website to discover all pages returning 404 errors and the internal links pointing to them.

Site Crawler icon
Site Crawler
4 agent guides
Open Search Papers by Topic

Search Papers by Topic

Find relevant academic papers on any research topic across millions of scholarly publications.

Academic Research icon
Academic Research
4 agent guides
Open Analyze App Review Sentiment

Analyze App Review Sentiment

Understand how users feel about your app by analyzing sentiment patterns across hundreds or thousands of reviews.

App Review Analysis icon
App Review Analysis
4 agent guides
See every Site Crawleruse case (Claude, ChatGPT, Copilot, OpenClaw guides) →

Related Tools

Frequently Asked Questions

Can it crawl a whole site or just one page?

It walks a website from a starting URL and gathers data page by page, so it works for both single-page and full-site audits.

What page data does it collect?

It collects titles, descriptions, heading counts, word counts, response times, and link counts.

Is it useful for broken-link or content inventory work?

Yes. Those are two of the most practical uses for the crawler.

Do I need to start from the homepage?

No. You can start from any URL and let the crawl expand from there.