7 Excellent Tips for Handling Content Robbers (’cause you cain’t shoot ‘em)

(Reading time: 3 – 5 minutes)

Old Skool Bloggering! We’re taking a trip back in time, to when blogging was new, like last summer. Ok, new to me, but still.

First, a little backstory; context matters.

A week ago I unsubscribed from every one of my 385 RSS feeds.

385 feeds.

The mind boggles… what was I thinking? (we’ve covered some of that ground, more later)

So, Boom! All gone.

I’ve been rebuilding my RSS over the last week, and stopped in to visit one of my favorite pro-bloggers, Gabe Young at Free Blog Help. (Yes, Gabe is a pro, and no, he doesn’t bandy it about.)

Free Blog Help focuses on what I guess you might call “classical” blogging: short, tightly focused articles covering a specific aspect of our craft. In his latest (as of this writing), Gabe found a web scrape thief – and wonders, “now what?”

I was outraged when content theft first happened to me. Adding insult to injury was seeing my article To Digg Or Not To Digg — That is the question posted on Digg, attributed to someone else.

Here’s what I’m doing about it now: not much.

I just don’t care, because I believe this behavior has a limited lifespan preying on unsophisticated users, and my Web Heroes (that’s y’all) are definitely smarter than that. These days, I don’t believe anything on the web that doesn’t have a real face behind it.

Anonymous information has no value.

That being said, there 7 things you can do about it:

  1. Avoid using the letters ESS EEE OHH in the title element, url or any header element. Articles with those letters get fully scraped – apparently – within seconds of hitting your feed. It’s truly amazing. It’s like pr0n or something.
  2. Avoid trending topics in those same elements. I published 2 articles on the EYE PAT this week, and again, both were scraped instantly.
  3. Consider moving to partial feeds. WordPress doesn’t really handle partial feeds very well, and people generally stink at writing teaser copy, but a partial feed will stop a lot of this.
  4. Internal linking is how I catch most of it. If you aren’t doing any internal linking, consider making it a regular practice. I’ve heard it’s good for search results too.
  5. Consider whether fighting it might be a waste of time. I don’t fight it anymore because the last time I tried, I spent hours attempting to determine where the site was actually hosted. Turns out it had some sort of shrouding or something, and none of the hosting companies would own up to it. “Not our problem.”
  6. Posting your article to Digg right after publishing gets you a time stamp, should that ever be necessary. It will certainly help when someone else posts it to Digg with their attribution.
  7. Assume the universe will send you customers who are reading your stuff because you wrote it, not because they found some article in a search engine, which is displayed on a content scraper.

Check out Gabe’s articles if you haven’t already:

I hope you enjoyed this little back-to-basics excursion. Sometimes, we get all caught up in the “bigger picture,” and forget it’s made up of little pieces like this.

Hrm… if you thought that “hardening” your blog posts against content theft might be a part of the Blog Maintenance Challenge Curriculum, you might be right!

Here’s a few questions:

  1. When did you last have something ripped off, and why do you think they wanted your article?
  2. What did you do about it?
  3. How did you find out?

Comments

  1. To anybody considering it: please please don’t move to a partial feed. You’ll lose a considerable proportion of your RSS readers – myself included – and those people generally won’t bother coming back to your blog to read every article. Partial feeds are too much trouble to bother with, generally.

    If you run WordPress, get yourself the “RSS Footer” plugin and set it up so it links back to your original post with your original byline.

    Personally I don’t mind if people spread my articles around, so my RSS footer content is this:

    Share this
    Do you know somebody else who would find this interesting or useful? Please forward it to them.
    
    Did somebody forward this post to you? Visit %%BLOGLINK%% and subscribe to receive the posts for free!
    
    Originally published here: %%POSTLINK%%

    (%%BLOGLINK%% and %%POSTLINK%% are resolved by the plugin to link to my blog and the specific post, respectively)

    You could put a copyright notice there too, and your name if you’re the only one who posts to your blog.

    Now if some scraper pulls your feed, at least you get a backlink out of it :)
    .-= Ricky Buchanan´s last blog ..SpeakingFox: Tell Firefox To Talk =-.

  2. jan geronimo says:

    Reverting to partial feeds? Ugh. Don’t know about you, but finding that an interesting blogger uses partial feeds makes me itch to unsubscribe.

    Have to admit though that I can also find the latest posts on my favorite people on my Twitter lists and Facebook page so partial feeds can still be an acquired taste. :)

  3. Thanks Dave,
    You’ve introduced me (again!) to something I knew zero about. I need to stop hanging around here, you keep exposing my inadequacies ;)

    That said, good to know. Just not even sure what to do with this information, other than maybe try out the footer plugin Ricky mentioned.
    .-= Eleanor Edwards´s last blog ..Dr Egg’s Hair Shaving Adventure =-.

  4. Hmm, inspired by your massive unsubscription mayhem, I think I have to do the same. I’m at 200+ for which only handful I actually read nowadays?

    As for content scraping; I do internal linking, not because of scrapers, but because it’s good for everyone, including SEO as you mentioned. Apart from that, I don’t care. I’ve stopped self-submitting to Digg or any other bookmarking site a while ago.
    .-= Antti Kokkonen´s last blog ..Achieve nothing – Little advice for getting nowhere =-.

    • Dave Doolin says:

      Do it!

      You’ll find your way back to where you need to be over time.

      And the people that matter (like me, heh…) won’t mind at all. We know you’ll be back if we have something for you, and if not, that’s cool too.

      I have a lot more to say about Digg & friends, article worthy.

  5. Just coming back quickly to point 4: “inline” links should really be on your agenda when you write an article (for SEO purposes but also for your readers). Search engine simply LOVE them because they can’t really be manipulated as easily than links in the footer or side bar. Plus the surrounding text give them more context.
    .-= Tom@NetAccountant´s last blog ..10 web design conventions that will make your website as good as Amazon.com =-.

    • Dave Doolin says:

      Absolutely! I do a fair bit of internal linking, but can be time consuming, and sometimes I slack.

      I also use structured linking techniques, where a certain type of article will always have the same kind of links. The Practical WordPress Tips series always links forward and backwards.

  6. Heather says:

    I had one of my first original ‘manga’ drawings ripped off a few years back; created the creature from scratch, coloured it digitally, then one of my friends saw it somewhere else. Lol.

    Never did find the site though, little annoying.

    Reminds me though, I have to go through my articles and do a spot of interlinking again; been a while since the last time I did it.
    .-= Heather´s last blog ..The Elf Blacksmith =-.

  7. Deacon says:

    You mean someone scraped my eye-paddy-pad article? Hrrm. That’s well, what it is.
    .-= Deacon´s last blog ..I’m a Printmaker, Not an Artist =-.

  8. Deacon says:

    Wow. Between posting my comment above and now, my deliberate practice post was scraped by some site.

    On the one hand, they are ripping off my stuff, on the other hand, I am getting a do-follow link out of it, as are you Dave, since I mentioned you in the post.

    I guess I could call up GoDaddy and complain to them.
    .-= Deacon´s last blog ..I’m a Printmaker, Not an Artist =-.

  9. One incident I remember was a site that was aggregating sites with content similar to mine into their own blog. I wasn’t too pleased that my stuff was getting posted as theirs. However for at least one of those posts the author of the site created a unique post responding directly to one of the articles they scraped, with a backlink to my site. So it was probably a bad implementation of good intentions. That site lasted about a month before it disappeared. I wouldn’t doubt if some others who were getting scrapped complained.

    Only other incident was someone taking my post, translating it into Spanish, and posting it to their blog. No big deal though in that case. They credited my original article, and provided a way for my content to get out to non-english readers. I kind of wish that would happen more often actually.
    .-= K. Praslowicz´s last blog ..Alyssa Milano’s Nikon FA =-.

  10. Ralph says:

    I’m probably just clueless but how do you know if your stuff is being swiped? And if you are too clueless to know, are you too clueless to care?

    • Dave Doolin says:

      Thanks, that’s an excellent article all by itself. I’m *crushed* for time ATM… I’m going to put this one out to the Heroes:

      Anyone care to write up an answer for Ralph? Let me know, and I’ll link to it. (or you can post it here, whatever you want)

      I think there is something to being too clueless to care… up to a point… then you need to work out your own strategy. Because it is infuriating.

    • I found mine out because I had some internal linking in my article. When the scraper’s system posted the article on their site, it fired off a trackback to my site.
      .-= K. Praslowicz´s last blog ..Alyssa Milano’s Nikon FA =-.

  11. Kelly Diels says:

    EVERYTHING I write at ProBlogger gets scraped by a multitude of sites (that’s the ProBlogger cred, not mine), immediately.

    My Cleavage stuff – it’s personal and not related to much that makes money (for anyone except me), so not a whole lot of scraping.

    Possibly I’m being a bit Pollyanna, but I just don’t care. Maybe I’d get to caring if I saw my stuff ranking higher under someone else’s name – but I doubt that will happen in my subject matter.

    (Also, that would require that I ranked highly for any of my keywords. Don’t worry – I’m on it.)

    WRT RSS: I’ve abandoned my RSS reader. It is just so accusatory these days.

    I prefer receiving posts direct to my e-mail inbox – and so a partial feed would (and does) irritate me.

    PS I like what Danielle LaPorte and Chris Guillebeau do with their direct-to-inbox posts (using Aweber, I believe). They make them pretty. I always open them.

    • Dave Doolin says:

      When I drop the posting frequency I may send to the list as well. Right now I think it would be too much.

      High ranking on relevant keywords = passive income.

      On my less-inspired days, I wonder whether it’s any use though. I have this nagging suspicion that any keyword yielding even a few pennies of profit will, at some point, be commandeered by outfits with heavy SEO artillery. We’ll see.

  12. Took me a while but I finally figured out how to send the whole post to the RSS. Didn’t know that was an issue but… It will be interesting to see if anyone bothers to steal my stuff, I think the hunt down and shoot method is good to take care of it. too bad there is not a digital version of that. Thanks for keeping us up to date and interested Dave!
    .-= Justin Matthews´s last blog ..My Blog Is Calling And I Must Go! =-.

  13. Valentina says:

    Let those thieve eat nails (or spiders) and get horrible indigestion. Honestly, I knew that this kind of thievery went on but had no idea that it was so prevalent. I think one of the problems would be centered around too much automation as some services will find and deliver content for you to post – hopefully with credit – or for use as research material for your own, original content.

    hmmmm…. it is a conundrum
    .-= Valentina´s last blog ..WordPress Direct Review =-.

  14. Ralph says:

    Did vou update this posts? Because it looks like you linked to the answer to my question. Checking it out now. Thanks again.
    .-= Ralph´s last blog ..Saturday Bonus =-.

  15. Dave Doolin says:

    Excellent!

    Post an update!

  16. Love how we think alike and in this case, even about the same time.

    Thanks for the plug!
    .-= Gabe | freebloghelp.com´s last blog ..Found a web scrape thief – now what? =-.

  17. Bert Padilla says:

    I know it s*cks, but I will consider the 5th item you mentioned. I personally hate content scraping. If you are asking when was the last time it happened to my blog- my answer is almost every time I published a post! Ridiculous…
    .-= Bert Padilla´s last blog ..Selecting the Best Niche Blog =-.

    • Dave Doolin says:

      Bert, try putting in a couple of internal links in each article, the RSS footer, and tracking your hits.

      Then do the opposite. If you get more traffic with scrapers, let ‘em scrape!

      Seriously, the world is changing. I’ll pay – and have many times – twice as much for an ebook which I know is cutting edge and I can email the author, than something out of date on Amazon. Amazon doesn’t care about me.

      People will learn to NOT care about scrapers or to do business with them soon enough. Build your reputation and be ready to help them when they appear on your virtual doorstep.
      .-= Dave Doolin´s last blog ..Why the Apple iPad Will Make Me More Productive =-.

Trackbacks

  1. [...] the internet, as soon as I try to focus, I lose steam. The design community is saturated with content scrapers, lists, and magazines. I desperately want to avoid that. Having a unique voice is what sets one [...]

Speak Your Mind

*

CommentLuv badge