Frankly many of us are not even aware that there is something called robots.txt and what it does and what is the importance of using robots.txt in your blogs and websites.
Infact its not a compulsion that you need to have a robots.txt in your blog, in fact you don’t need one but still if you want to pass on any special instructions to search engine spiders then robots.txt is a must have file on your server, in this article we will see how you can effectively utilize this text file for your wordpress blogs when you want to pass on any special instructions to search engines.
First thing a search engine spider looks for when it reaches a site or blog is robots.txt file to see if there is any special instructions, instructions may be like you may not want search engines to crawl some parts of your blogs or some sections of your blog and these kind of instructions can be passed to search engine spiders with this file. But mostly you won’t be able to locate this file through your ftp program, so if you want to access your robots.txt file from your admin panel and alter it then you must use this plugin called KB Robots also this plugin is of any use only if your blog is hosted in the root of your server not to some other directories like say www.yourdamin.com/blog if that is the case create your custom robots.txt file in your machine and upload it to the root of your server most cases under public_html folder
Wordpress comes with a default robots.txt file that looks like this
User-agent: *
Disallow:
What does this mean? it means allow all the search engine bots to crawl through your entire blog without any exception.
To ban all the search engine bots from crawling through your entire blog you can use this code.
User-agent: *
Disallow: /
This will ban all the search engine bots from crawling any parts of your blog and will stay away from your blog which of course we don’t want.
You can object specific bots from crawling your webpages like Googlebot or Msn bot then you can pass that instruction to that specific bot like this.
User-agent: Googlebot
Disallow: /
In the code above we have asked Googlebot to stay away from the site, you can find the list of robots here
To allow only specific bots and ban others then you can do it like this.
User-agent: Googlebot
Disallow:
User-agent: *
Disallow: /
To disallow specific sections of your blog from being crawled then you can pass the instruction by using the code like this.
User-agent: *
Disallow: /archives/
you can substitute the archives with whatever you want, if you don’t want specific category not to be crawled then replace it with that category name and that category will not be indexed and crawled by search engines.
This is a copy of my robots.txt you can access it at http://seoyourblog.com/robots.txt
you can see even google uses a robots.txt file http://google.com/robots.txt almost all the websites on the internet uses robots.txt file and you can access this file by adding a robots.txt at the end of their domain.
Tags: changing robots.txt, wordpress robots.txt
Related posts:



I am Chakkravarthi, full time Blogger, webmaster and wordpress enthusiast . Apart from blogging I like to dwell in to SEO world most of the time.Use wordpress and support the open source community.
{ 2 comments… read them below or add one }
hi, thanks for your post. i faced this problem..I noticed that my traffic only come from links from other blogs/websites..not from search engine..then i change the robots.txt file using PC Robots.txt
how to know whether our blog has been indexed by search engine? Is it by checking it from the google webmaster tools again?
thanks.
Babyblogger´s last blog ..Letak navigation link pada post
You can do a search in Google with site:yoursiteurl.com it will display all the pages that are indexed by Google and yes webmaster tools is another way to know the number of indexed pages.