Noindex
A meta robots directive that forbids search engines from indexing a page.
Noindex is a directive for search robots that forbids adding a page to the index. It is implemented via the meta robots tag or the X‑Robots‑Tag HTTP header. Used for duplicates, cart pages, and utility pages (user account, search results).
What is noindex
Noindex is a directive that tells search engines: 'do not add this page to the index'. The page may remain crawlable, but it will not appear in search results.
Implementation methods
- <meta name="robots" content="noindex"> — in the <head> section of an HTML page
- X-Robots-Tag: noindex — HTTP header (for PDFs, images, non‑HTML files)
When to use noindex
- Duplicate pages (e.g., print versions, sorting parameters)
- Shopping cart page
- Utility pages (user account, search results, captcha)
- Temporary campaign pages (after the campaign ends)
Removal time
Noindex does not remove a page from the index instantly. You must wait for the next bot visit, which will see the directive and drop the page from the index. In Google, this typically takes several days to weeks.
Relationship with other directives
- canonical — if the page is already indexed, canonical is better to consolidate link equity
- robots.txt — blocking via robots.txt prevents crawling, so Google won’t see the noindex directive. Do not combine both.
Common questions
Discuss your project?
Share your goals and website context — I will suggest a practical next step.