Google To Officially Stop Supporting NoIndex Directives from Robots.txt


The search engine giant is officially going to stop obeying robots.txt noindex directive starting 1 Sep 2019. Publishers still using robots.txt noindex directive will be required to use an alternative to stop Google from crawling or indexing their pages. According to the company, Google never documented unsupported robots.txt rules such as nofollow, crawl-delay, and noindex. And since Google Bots used to support it unofficially in the past, it will not be the case hereafter.

Google announced to withdraw robots.txt directive support on 2 July 2019, through a post on Google Webmasters Blog. The company confirmed that robtos.txt noindex was never officially supported by Google and starting today, the crawlers will also stop supporting it. Google shared the blog through twitter bidding its goodbyes to the unsupported robots.txt rules.

Alternate Ways From Google to Control Indexing

Google published 5 alternative methods to control crawling on its official blog:

  • Noindex in robots meta tags: While robots.txt noindex is no more supported, meta tags with noindex directive in either HTML header codes or HTTP response headers have become the most effective way to stop URLs from indexing. 
  • 404 and 410 HTTP status codes: These status codes inform the search engines that the page on respective URL does not exist. Google automatically drops these URLs after crawling them once.
  • Password protection: If a page is hiding behind a login, Google will drop it from the index unless the password protection indicates either paywalled content or subscription.
  • Disallow in robots.txt: If the search engines don’t know about a page because they were blocked from being crawled, it implies that their content won’t be indexed. Search engines may index a URL if other pages have links to it but Google aims to make pages less visible if the crawlers cannot see the content.
  • Search Console Remove URL tool: The tool is widely known for temporarily removing a URL from Google’s search results.


Make sure you are not using robots.txt noindex on any pages. If you are, we recommend you to immediately use one of the above-mentioned methods to avoid your pages from indexing.
 

Related Post

blog
Online Marketing Trends To Watch For In 2021

An online business is what everyone wants in the present era. The online business is something which can provide an audience with an immersive user experience to the customers as well as the business. The EMERGING TRENDS IN DIGITAL MARKETING keep on changing in terms of functionality,

...
blog
Check Out The 13 SEO Tools Which Every Small Business Should Use!

Get ready to be successful with the SEO by using these tools which are a perfect fit for your website. One must know that it would be impossible to conduct keyword researches, track rankings, conversion trends, analyze competitors, identify the technical problems, implement effective content

...
blog
Google My Business Post: How We Helped Our Client Increasing Website Traffic

Since Google has allowed to publish posts on its popular platform, Google My Business (GMB) using website or any of its app version for Android or iOS devices. It gives industries, organizations, & non-government services a platform to mainstream their businesses and services by publishing

...

© 2024 Leading Edge Info Solutions (P) Ltd. All Rights Reserved.

To Top
Book a Slot