Question 1

What's the difference between robots.txt and meta robots?

Accepted Answer

robots.txt controls crawling; meta robots controls indexing. For noindex to work, bots must be able to crawl the page to see the directive. If robots.txt blocks crawling, the bot can't see the noindex and may still index the page.

Question 2

How fast does noindex take effect?

Accepted Answer

It depends on recrawl timing, typically days to weeks. Use Search Console URL Inspection to request a recrawl or submit an updated sitemap to speed it up. High-authority pages update faster.

Question 3

How should noindex and canonical work together?

Accepted Answer

Avoid using both on the same page. noindex means 'don't index this page' while canonical means 'this is the authoritative version' — conflicting intents. Use canonical to point to another page; use noindex to exclude completely.

Question 4

Does link equity pass through noindex pages?

Accepted Answer

With noindex,follow, Google still follows links and passes PageRank. However, long-term noindex pages may get crawled less frequently, reducing link discovery efficiency.

Question 5

How do I control indexing of PDFs and other non-HTML files?

Accepted Answer

Use the X-Robots-Tag HTTP header. In Nginx: location ~* \.pdf$ { add_header X-Robots-Tag 'noindex'; }. On Cloudflare, use Transform Rules to add the header.

Meta Robots

Definition

Why it matters

How to implement

Examples

Related

FAQ