Aug 6 2008

robots.txt tips for WordPress to prevent Duplicate Content

When you use the Wordpress.org blog script there is a lot of duplicate content created from the various pages and category pages that the WordPress script creates. If you’ve been reading about SEO, you probably know Google doesn’t like duplicate content and therefore your blog will likely be penalized for it.

Thanks to reading a post by Jeremy at ShoeMoney.com and reading about different User-agents, I was able to create a robots.txt file that will help eliminate duplicate content from Google indexing.

Add the following in your robot.txt file to prevent Google from indexing duplicate content in your WordPress blog:

User-agent: Googlebot

Disallow: /trackback/
Disallow: /wp-admin/
Disallow: /feed/
Disallow: /archives/
Disallow: /index.php
Disallow: /*.js$
Disallow: /*.inc$
Disallow: /*.css$
Disallow: */feed/
Disallow: /feed
Disallow: */trackback/
Disallow: /page/
Disallow: /pages/
Disallow: /tag/
Disallow: /category/
Disallow: /*?

User-agent: Googlebot-Image
Disallow: /wp-includes/

User-agent: Mediapartners-Google*
Disallow:

User-agent: ia_archiver
Disallow: /

User-agent: duggmirror
Disallow: /

The above should help with duplicate content on your WordPress blog website.

3 Comments on this post

Trackbacks

  1. Tony said:

    Your site regarding robots.txt tips for WordPress to prevent Duplicate Content looks very interesting to me. I found it doing a search for money tips.

    August 18th, 2008 at 11:55 pm
  2. Money Talk said:

    There is more reason to comment than ever before! Great post! I searched for a while to find the right answer to my questions!

    August 20th, 2008 at 5:32 pm
  3. Egor said:

    Hello webmaster been surfing the net for Seo Tips and found your blog reg ts.txt tips for WordPress to prevent Duplicate Content. You relly know your stuff! I\’d like to see more posts here. Will definitely bookmark this one and come back.

    August 22nd, 2008 at 5:52 am

LEAVE A COMMENT

Subscribe Form

Subscribe to Blog