Need a WordPress website this weekend? Start here...

Refreshing Your Pillar Content By Recycling Blog Posts

(Reading time: 5 – 8 minutes)

You have hundreds of articles buried in your archives, spread over months, possible years of time. Good articles, not recycled pap mindlessly parroting what everyone already hears in the echo chamber.

What to do with these shining pearls of wisdom?

You could:

  • Leave them alone. Anonymous. Neglected. Lonely.
  • Repurpose them for use as guest posts or for publication in article directories.
  • Revise and republish your best work, bringing timeless, accurate and updated information to a fresh generation of readers.

Let’s find our best articles. Let’s revise and republish them and for your latest cohort of readers. Your regular readers need a break anyway. There’s only so much brilliance (or BS) anyone should be expected to tolerate.

Before we get started, pop on over to Site Sketch 101 and read Nicholas Cardot‘s article on Recycling Blog Posts. Let’s recap: You want to recycle old posts to 1. save time; 2. build page rank; 3. build on previous successes. You increase the value of older articles by 1. leaving the permalink alone; 2. retitling if necessary; 3. revising or rewriting for clarity and accuracy; 4. adding new ideas, and 5. removing outdated material.

As it turns out, Nicholas’ in-depth article seems to have been inspired by Blogussion‘s article The Power of Recycling Posts. Alex, the author of the Blogussion article, notes that commentors often provide fresh insight about the article, which is very useful for updating and expanding on your existing article.

Recycling your older articles may or may not be an effective strategy for you. Even if you decide to recycle articles, not all your articles may be appropriate for recycling. Let’s take a closer look.

Is recycling blog posts right for you?

That’s a very good question, but I can’t really answer that question for you. You have to answer it for yourself. Here’s my point of view:

  1. There’s 25 articles in the Website In A Weekend eCourse. These articles need to be kept accurate and up to date. Putting the updated articles back in the feed motivates me to keep them fresh, and provides new readers a hook into the eCourse. This provides a twice-yearly update for each article in the eCourse.
  2. The 25 eCourse articles are part of the WordPress 101, which comprises the entire pillar content on Website In A Weekend. These articles must be maintained. Articles which require extensive revision will be republished.
  3. Many of the older articles on Website In A Weekend attract search engine traffic, but most were written before there was any commenting activity here. There’s a lot of good content with no comments at all. Recycling will put this in the hands of people who will find it valuable.
  4. I’m interested in the actual behavior of readers and search engines, not assumed behavior or “how things ought to be.” Only way to find out is by recycling articles myself and testing the results.
  5. Some people seem adamantly opposed to any form of revision or republication. I’ve carefully considered their opinions. People who aren’t taking “blogging” seriously, as in, making it part of their income or branding efforts, aren’t going to have much influence on my decisions.
  6. No way I’m going to revise 25% or less of some article and repost it as new content. By new content, I mean creating a new, unique permalink. That’s just silly to me.
  7. The capability to revise is one of the most powerful aspects of publishing on the web! For good or for ill, this is true. I prefer to leverage this for good uses. I’ve seen it leveraged for other reasons, but I will not publish such. I can’t prove anything and I don’t have the cash to fight a SLAPP. Shoot some pool with me sometime, I’ll tell you a story of flames and firemen and freeways. But I’m digressing…

    In 1994, I recall thinking: “Wow! I can rewrite stuff that’s incorrect!” So count on it.

  8. I consider my permalinks the most valuable asset I create on Website In A Weekend. Recycling articles is an excellent way to maintain ensure the value of my permalinks increase: better content gets more widely promoted.
  9. The Website In A Weekend editorial policy explicitly states articles on Website In A Weekend are subject to revision. Plans for revising and republishing have been in the works for months. Writing comprehensive articles takes time, a lot of time. Publishing first, then revising as new source material becomes available, or tehcnical changes requires updating, just makes perfect sense.

Should you choose to recycle, I’m going to break ranks and recommend you examine your permalink. Most people advise against changing permalinks, but if you really need to fix a bad one, this is a good time. The key is to have a plan.

If your permalink is decent, leave it alone.

Fix your permalink, if necessary

Take the time to evaluate an article’s permalink. If you have a crappy permalink on your article, now is a really good time to fix it.

It’s not difficult, but you need to take some care to get it right.

How you choose to create permalinks is largely your own business. There’s several methods, each with advantages and disadvantages. I use Godley’s Redirection plugin to handle permanent redirection tasks. The redirection plugin is not difficult to use, as long as you understand what you need to do.

Whether you decide to change permalinks or not, you need to pay close attention once you republish.

Check your RSS feed immediately

When you republish, you need to check your feed directly. Some reader, Google reader for example, only refresh your screen view with items fetched since your last update. These readers check feeds on their own times, and can be an hour or more after you publish before they show your update. This isn’t necessarily a bad thing. I’ve caught major blunders, really crappy drafts accidently published, before they hit the feed.

The problem is it’s difficult for you as Google reader user to force Reader to examine your feed. What you can do is paste your feed directly into your browser URL form, and check it there.

Note: you can’t do this with the same browser you’re logged into Reader. Reader is too smart for that. It checks the cache for an existing subscription, then displays the current state of your subscription. You can check it at Feedburner though. I use Safari or Internet Explorer.

What articles have you revised?

I’d be absolutely delighted to list three people who have revised and republished articles, especially for retaining permalinks while updating content as advocated here.

  1. Holly Jahangiri gets the first slot with Chores: to Pay, or Not to Pay? This article was originally published on Blogger, then moved when Holly self-hosted with WordPress.
  2. Second slot, it could be yours.
  3. Third slot is better than no slot.

Questions? Comments? Leave a note in the comments.

How Much Does Google Punish Duplicate Content On Blogs?

(Reading time: 3 – 5 minutes)

“Duplicate content? What is it and why should I care?”

An excellent question, and a question not easily answered from a technical point of view.

For example, if every one of your blog posts is totally original and published only once, but within two categories which are indexed by a search engine, you may get dinged for having “duplicate content.”

Sounds stupid I know, but there’s reasons, good reasons, which you need to know about.

In a perfect world, everyone sitting down at their computer would write excellent, all-original blog posts and articles. In the real world, search engines hve trouble distinguishing unique content, and can’t figure out the quality of the material.

So the search engines get “gamed” by web pages containing the appearance of useful content.

It’s important that search engines be able to distinguish duplicate content: If they didn’t, the web would disintegrate into pure advertising spam blogs stuffed with keywords, with the same “highly ranked” web page propogating it’s way through the search engine index. Spammers would have every incentive to stuff pages full of keywords, then leverage the traffic to consume higher and higher search results by propogating such pages web-wide. This would suck. The web would become useless. Tragedy of the commons and all that.

Clearly, duplicate content must be dealt with in some way.

Here’s a few of the techniques I’ve read about that are important to understand how duplicate content is handled by search engines.

IP address matching

Google evidently splits the difference by “punishing” duplicate content delivered from the same IP address. This is a pretty good solution. Modern hosting companies allow “addon” domains to be served from subdomains, all of which are (of course) running from the same IP address as the main hosting account. If you want to serve identical content, you need to pay a little more money to get another IP address.

This IP address policy may also reduce the SERPs effectiveness of a group of unrelated websites all served from the same address. I’ve read that you should keep all the content served from a particular IP address “on topic” so that you aren’t “punished” as a spammer, but I don’t know how to evaluate whether this is true or not. Comments, links, more information definitely welcome!

Multiple categories, yes or no?

I recently spent an afternoon discussing this with a business partner, citing Matt Cutts talk at WordCamp. And was really embarrassed when my partner showed me that Matt uses multiple categories on his blog!

Using nofollow tag with multiple categories: Evidently, with clever use of the “nofollow” attribute in your links, you can allow posts to be filed in multiple categories…. so that a post is only indexed by one category and doesn’t suffer duplicate content penalties. My recommendation if you’re just starting out, or you want to keep your thinking confined to what you know best (your blog’s subject matter), pick smart categories defining topics, use only one category for each blog post, and don’t worry about it.

When Google returns a note saying other pages were very similar but not shown, it’s essentially “punishing” the collection of pages.

More information on duplicate content on the web

Here’s a good discussion about duplicate content from Searching Solutions, where the author Justin Smith makes a distinction between “onsite” and “offsite” duplicate content. Onsite duplication is what happens when pages are multiply indexed as discussed above. Unfortunately, when I read the article, I didn’t read any discussion of offsite duplicate content.

In the end, all the discussion in the world cannot answer the question definitively. Only Google can say with certainty exactly how it deals with duplicate content. Everyone else’s opinion is just that, opinion. Find someone whose opinion seems accurate, pay attention to their advice.