This Confluence instance is now read-only, please head over to the Algolia Confluence instance for the same more up-to-date information

Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 12 Next »

Problem

I want to prevent certain pages from being indexed and appear in search results.

Solution

You can add data-sj-noindex anywhere in a page and it will not be indexed. Most commonly this will be defined in the header of an HTML page as follows:

  1. Locate the <header> tag of the page you want to prevent from being crawled.

  2. Add the following code within the header:
    <meta name="robots" content="noindex" data-sj-noindex />

  3. Save the changes. The crawler will ignore this page next time it comes across it.

Additionally you can use crawling rules to programmatically exclude sections or certain pages of your web site. You can also set individual pages to not be indexed from the data sources tab of the admin Console.


Documentation

Advanced crawler documentation

  • No labels