This page tests robots.txt compliance checking functionality
This page demonstrates robots.txt blocking. Some subpages are blocked by robots.txt rules:
/robots-blocking/product/2, /robots-blocking/product/4)/robots-blocking/category/blockedCheck /robots.txt to see the blocking rules.
This page is designed to test the robots.txt compliance checking feature. The robots.txt file blocks specific paths:
Disallow: /robots-blocking/product/2Disallow: /robots-blocking/product/4Disallow: /robots-blocking/category/blockedWhen robots.txt compliance checking is enabled, workflows attempting to scrape these blocked URLs should fail with a ROBOTS_TXT_BLOCKED error.