Build a Full Site Inventory
Crawl your entire website to build a complete inventory of every page, its status, and its metadata.
Find pages with duplicate or near-duplicate content that confuse search engines and dilute rankings.
ToolSite CrawlerDuplicate content is one of the most misunderstood SEO problems. It does not cause penalties, but it does cause confusion -- when multiple pages have the same content, search engines must choose which one to index and rank. Often they choose wrong, ranking a less important version while ignoring your preferred page.
Duplicate content commonly arises from URL parameters creating multiple versions of the same page, HTTP/HTTPS and www/non-www variations, print-friendly page versions, paginated content, and CMS-generated category and tag pages that overlap with main content.
ToolRouter's crawl_site skill detects pages with matching or near-matching titles, descriptions, and content hashes. It identifies URL patterns likely to create duplicates and checks for proper canonical tag implementation across all variations.