Google To Officially Stop Supporting NoIndex Directives from Robots.txt


Listen to this article

The search engine giant is officially going to stop obeying robots.txt noindex directive starting 1 Sep 2019. Publishers still using robots.txt noindex directive will be required to use an alternative to stop Google from crawling or indexing their pages. According to the company, Google never documented unsupported robots.txt rules such as no follow, crawl-delay, and no index. And since Google Bots used to support it unofficially in the past, it will not be the case hereafter.

Google announced to withdraw robots.txt directive support on 2 July 2019, through a post on Google Webmasters Blog. The company confirmed that robtos.txt noindex was never officially supported by Google and starting today, the crawlers will also stop supporting it. Google shared the blog through twitter bidding its goodbyes to the unsupported robots.txt rules.

Alternate Ways From Google to Control Indexing

Google published 5 alternative methods to control crawling on its official blog:

  • Noindex in robots meta tags: While robots.txt no index is no more supported, meta tags with no index directive in either HTML header codes or HTTP response headers have become the most effective way to stop URLs from indexing. 
  • 404 and 410 HTTP status codes: These status codes inform the search engines that the page on respective URL does not exist. Google automatically drops these URLs after crawling them once.
  • Password protection: If a page is hiding behind a login, Google will drop it from the index unless the password protection indicates either paywalled content or subscription.
  • Disallow in robots.txt: If the search engines don’t know about a page because they were blocked from being crawled, it implies that their content won’t be indexed. Search engines may index a URL if other pages have links to it but Google aims to make pages less visible if the crawlers cannot see the content.
  • Search Console Remove URL tool: The tool is widely known for temporarily removing a URL from Google’s search results.


Make sure you are not using robots.txt no index on any pages. If you are, we recommend you to immediately use one of the above-mentioned methods to avoid your pages from indexing.
 

Related Post

blog
Link Building Strategy: Link Building Trends Ruling The Digital Space

Listen to this article Anyone pondering the fact that links still have the power to affect rankings should understand the significance of backlinks on their webpages. Though Google considers many factors while ranking a webpage on SERPs, the authority of your webpages and domain are

...
blog
Updates From Google On Shopping Insights (Product Search Data And Trends)

Listen to this article The marketers and retailers can get more comparative brand data with the latest update of Google   Google announced a latest version of the Google insights for the shopping insights. It provides you with the data on what people are searching

...
blog
Google My Business Post: How We Helped Our Client Increasing Website Traffic

Listen to this article Since Google has allowed to publish posts on its popular platform, Google My Business (GMB) using website or any of its app version for Android or iOS devices. It gives industries, organizations, & non-government services a platform to mainstream their businesses

...

© 2024 Leading Edge Info Solutions (P) Ltd. All Rights Reserved.

To Top
Book a Slot