Using Robots.txt for WordPress Blogs

August 18th, 2009

Robots.txt file is very important for any website and especially for WordPress. Robots.txt helps search engines know what which content, file, directory is to be indexed and which is not. This helps avoid duplicate content on your blog.

Robots.txt for WordPress Self-Hosted blogs. This is just an example, you can customize it according to your blog.

User-agent: *
Disallow: /cgi-bin
Disallow: /wp-admin
Disallow: /wp-includes
Disallow: /wp-content/plugins
Disallow: /wp-content/cache
Disallow: /wp-content/themes
Disallow: /trackback
Disallow: /feed
Disallow: /comments
Disallow: /category/*/*
Disallow: */trackback
Disallow: */feed
Disallow: */comments
Disallow: /*?*
Disallow: /*?
Allow: /wp-content/uploads

# Google Image
User-agent: Googlebot-Image
Disallow:
Allow: /*

# Google AdSense
User-agent: Mediapartners-Google*
Disallow:
Allow: /*

# Internet Archiver Wayback Machine
User-agent: ia_archiver
Disallow: /

# digg mirror
User-agent: duggmirror
Disallow: /

Sitemap: http://www.example.com/sitemap.xml
SocialTwist Tell-a-Friend

Leave A Comment