/
How does crawling and indexing work?

This Confluence instance is now read-only, please head over to the Algolia Confluence instance for the same more up-to-date information

How does crawling and indexing work?

How does crawling and indexing work?

Answer

Our crawler visits the webpages of the domains you add to your collection. Read “How the crawler works” for more information.

Once the collection is indexed, all the webpages in the index are then re-visited by the crawler periodically between 3-7 days.

If you have our Instant Indexing ping-back code installed on your website, any new or updated webpage is updated in the collection when any of the following fields are updated and the page is visited within 30 minutes of being updated.

  1. title

  2. description

  3. canonical

  4. robots

  5. og:title

  6. og:image

  7. og:description

Instant indexing does not remove records from the index if a webpage’s status code is changed to a 404, 403, 301, or a 302. However, the regular crawl cycle does take the status code into account and will remove the page from the index if the page returns a 404, 403, 301, or a 302.

 

Related content

How to index a sitemap?
How to index a sitemap?
More like this
How to setup multilingual site search
How to setup multilingual site search
Read with this
Does the crawler automatically crawl my website content?
Does the crawler automatically crawl my website content?
More like this
How can I test search in staging or development environments?
How can I test search in staging or development environments?
Read with this
How does the crawler handle 301 or 302 redirect?
How does the crawler handle 301 or 302 redirect?
More like this
Does instant indexing remove pages that return a 404/403 response?
Does instant indexing remove pages that return a 404/403 response?
More like this