Sunday, July 27, 2014

Testing robots.txt files made easier

To crawl, or not to crawl, that is the robots.txt question.

Making and maintaining correct robots.txt files can sometimes be difficult. While most sites have it easy (tip: they often don't even need a robots.txt file!), finding the directives within a large robots.txt file that are or were blocking individual URLs can be quite tricky. To make that easier, we're now announcing an updated robots.txt testing tool in Webmaster Tools.

You can find the updated testing tool in Webmaster Tools within the Crawl section:

Here you'll see the current robots.txt file, and can test new URLs to see whether they're disallowed for crawling. To guide your way through complicated directives, it will highlight the specific one that led to the final decision. You can make changes in the file and test those too, you'll just need to upload the new version of the file to your server afterwards to make the changes take effect. Our developers site has more about robots.txt directives and how the files are processed

Additionally, you'll be able to review older versions of your robots.txt file, and see when access issues block us from crawling. For example, if Googlebot sees a 500 server error for the robots.txt file, we'll generally pause further crawling of the website.

Since there may be some errors or warnings shown for your existing sites, we recommend double-checking their robots.txt files. You can also combine it with other parts of Webmaster Tools: for example, you might use the updated Fetch as Google tool to render important pages on your website. If any blocked URLs are reported, you can use this robots.txt tester to find the directive that's blocking them, and, of course, then improve that. A common problem we've seen comes from old robots.txt files that block CSS, JavaScript, or mobile content — fixing that is often trivial once you've seen it.

We hope this updated tool makes it easier for you to test & maintain the robots.txt file. Should you have any questions, or need help with crafting a good set of directives, feel free to drop by our webmaster's help forum! Know more here at

http://googlewebmastercentral.blogspot.in/2014/07/testing-robotstxt-files-made-easier.html

11 comments:

  1. Online Kundli matching for marriage by rashi & nakshatra. You can enter the birth rashi and nakshatra of the boy and girl.
    Thanks
    BinduBhatia

    ReplyDelete
  2. Main purpose of adding robots file in our website is let the Google to not crawl the particular page and URL which is mentioned in Robots file.
    Web Development Company | Website Design Companies

    ReplyDelete
  3. This blog is really helpful to generate the robots file for the website.I hope these step would helpful for beginners to make the robots file.
    Web Designing Companies Bangalore | Web Development Company Bangalore

    ReplyDelete
  4. Its very interesting blog and there are very useful information. I study your complete blog and I Find some important points which are very useful for us.
    Thank you,
    Website Redesign Services

    ReplyDelete
  5. Robots.txt file is the important factors in ON page optimization of SEO process.
    Website Design Companies London | Web Design Companies London

    ReplyDelete
  6. Your blog article is very intersting and fanstic,at the same time the blog theme is unique and perfect,great job. Bangalore Web Services

    ReplyDelete
  7. Great article. I like your post. This is very useful. Thanks for sharing this post.
    SEO Training Institute in Chennai

    ReplyDelete