Google Webmaster Tools has a new tool that would certainly be useful to site owners in making their site Googlebot friendly – the Robots.txt generator. Basically, what this translator does is to automatically generate robot.txt file. With a few clicks of the mouse, you can easily have your site’s robots.txt file. Thus it eliminates the technicalities of creating Robots.txt that has haunted novice webmasters like me before.
Using the Robots.txt generator, webmasters can easily instructs any robots which files or folders in your site’s root directory should be crawled by the Googlebot. You can even choose which specific robot you want to have access to your site’s index and restrict other robots from doing the same thing. Similarly, you can further refine this crawling activities by specifying which robot should access certain files in your root directory and which robot should access another file.

Although this might seem a pretty useful tool, there are still some limitations to its application. Some robots may ignore instruction in the robots.txt and continue to craw your site including those files or folders that have restrictions. So for highly sensitive files or documents, it would still be wise to put them behind password protection.
Although it is guaranteed that Googlebots will recognize your robots.txt based on the guidelines that you indicated when you were generating your robots.txt, other major robots may actually ignore it.
But still, the robots.txt generator is a great addition to the Webmaster Tools, as it will definitely make the lives of webmasters a little bit easier than before.








Comments
5 responses so far ↓
bluerank on Mar 28, 2008 at 10:49 am
useful info, thx
Terry Reeves - Memphis Seo on Mar 28, 2008 at 1:44 pm
Now there is no excuse at all for not having this file.
Loren Baker, Editor on Mar 28, 2008 at 1:57 pm
Agreed, this is an excellent tool for the small publisher who can not figure out Robots.txt.
Danny also did a great write up here :
http://searchengineland.com/080327-173946.php
Chris Blackwell on Mar 29, 2008 at 8:47 am
You shouldn’t rely on this tool for blocking search engines. You should take at least 15 minutes and understand the different robots.txt commands you can use and how to block different bots. Plus, remember that this won’t work with human powered search engines like Mahalo.
CBR on Mar 30, 2008 at 8:20 pm
Chris - I agree with you on this one, in the past I have a made errors with robots.txt and a whole directory got deindexed as a result. It took a while to recover from it. Take the time to learn it and then use webmaster tools to verify your work.
Leave a Comment