Google To Officially Stop Supporting NoIndex Directives from Robots.txt


The search engine giant is officially going to stop obeying robots.txt noindex directive starting 1 Sep 2019. Publishers still using robots.txt noindex directive will be required to use an alternative to stop Google from crawling or indexing their pages. According to the company, Google never documented unsupported robots.txt rules such as nofollow, crawl-delay, and noindex. And since Google Bots used to support it unofficially in the past, it will not be the case hereafter.

Google announced to withdraw robots.txt directive support on 2 July 2019, through a post on Google Webmasters Blog. The company confirmed that robtos.txt noindex was never officially supported by Google and starting today, the crawlers will also stop supporting it. Google shared the blog through twitter bidding its goodbyes to the unsupported robots.txt rules.

Alternate Ways From Google to Control Indexing

Google published 5 alternative methods to control crawling on its official blog:

  • Noindex in robots meta tags: While robots.txt noindex is no more supported, meta tags with noindex directive in either HTML header codes or HTTP response headers have become the most effective way to stop URLs from indexing. 
  • 404 and 410 HTTP status codes: These status codes inform the search engines that the page on respective URL does not exist. Google automatically drops these URLs after crawling them once.
  • Password protection: If a page is hiding behind a login, Google will drop it from the index unless the password protection indicates either paywalled content or subscription.
  • Disallow in robots.txt: If the search engines don’t know about a page because they were blocked from being crawled, it implies that their content won’t be indexed. Search engines may index a URL if other pages have links to it but Google aims to make pages less visible if the crawlers cannot see the content.
  • Search Console Remove URL tool: The tool is widely known for temporarily removing a URL from Google’s search results.


Make sure you are not using robots.txt noindex on any pages. If you are, we recommend you to immediately use one of the above-mentioned methods to avoid your pages from indexing.
 

Related Post

blog
Leading Edge Info Solutions (P)Ltd. Featured as a Most Reviewed Company in India

It’s been nine years since we’ve been in business. Driven by a passion for service, Leading Edge Info Solutions Pvt. Ltd. has helped multiple businesses with their advertising. We’re always excited to see our partners grow, which is why we’re glad to be on The

...
blog
Top 5 Digital Marketing Strategies for Dentists

Are you a dentist who recently opened a new dental clinic and is searching for quality leads? Are you facing slow traffic and feeling clueless about how to attract more patients to your dental clinic? As a dentist, establishing a strong online presence is essential

...
blog
Digital Marketing Trends That You Can’t Ignore in 2021

2020 successfully upended how most businesses operate, thanks to COVID-19. We witnessed more services and products offered online, with many employees working remotely. While customer responses to this change were varying, we knew that the upcoming marketing trends would likely make the situation normal.  Engaging

...

© 2024 Leading Edge Info Solutions (P) Ltd. All Rights Reserved.

To Top
Book a Slot