“the sons of the kingdom will be cast out into the outer darkness; in that place there will be weeping and gnashing of teeth” Matthew 8:12

Google Penguin Update

Hello Boys

Google launched their Penguin update on April 24th, and Matt Cutts wrote about how it was another laudable step in “rewarding high quality sites”.    In the post, Matt made the comment that in many cases:

“Sites affected by this change might not be easily recognizable as spamming without deep analysis or expertise.”

He also linked to the original Google Webmaster Central post about high quality websites that included a list of pointers to keep in mind when building content, which focussed very heavily on the idea of trust:

  • Would you be comfortable giving your credit card information to this site?
  • For a health related query, would you trust information from this site?
  • Would you recognize this site as an authoritative source when mentioned by name?

In the same post, there were notes on the depth of content on web pages being a key determinant of quality – points about whether a piece of writing contained a balanced opinion, or had been properly spell checked.  This is pretty much stuff that any serious business should be focusing on anyway.  Think about a restaurant.  If the floor is dirty and the waiters are rude and can’t answer questions about the ingredients, you’re not going to trust them enough to eat there.

Penguin comes hot on the heels of (and I’m guessing is closely related to) the recent updates about the anchor text used for link, but it’s interesting to note that there are only tangential mentions of links in Matt Cutts’ post:

  •  The net result of making a great site is … more people linking to or visiting a site
  • webspam techniques … link schemes that attempt to propel sites higher in rankings
  • a site with unusual linking patterns that is also affected by this change
  •  the outgoing links are completely unrelated to the actual content

The site that was “affected by this change” is described in the same sentence as outgoing links, and spun content.  In a similar vein, we have this statement in Google’s Webmaster Guidelines:

Don’t participate in link schemes designed to increase your site’s ranking or PageRank. In particular, avoid links to web spammers or “bad neighborhoods” on the web, as your own ranking may be affected adversely by those links.

Google explicitly stating that outbound links to sites they consider to be spam will penalise you.

Recently, we had the situation where large link networks like Build My Rank were getting smashed into millions of glittering pieces.  Those networks were obvious spam.  They used spun content, and posted largely unrelated links into the text which didn’t benefit users of the sites in anything other than the regard of getting away from the crap they were looking at.

Google have talked about a notional 3% of queries being affected by this change compared to the 12% or so from Panda.  That’s interesting because it suggests that it’s more targeted to the kinds of terms that are being actively optimised for rather than the broad strokes impact of Panda that seemed to be more aligned to reducing the value assigned to low quality content that targeted long tail variations with large numbers of pages.

From my reading of the situation, it looks as though there’s a kind of cascade effect through the different stages of web spam that is being handled by Penguin.  Websites with what is assessed to be low quality content that is either spun or poorly written (a la Panda) are being penalised actively, and the outbound links from those websites are being wiped out.  This results in the sites which rely on those links seeing an initial  drop in their rankings.  Subsequent to this drop, the sites which are using this technique are being reviewed separately, and having a further ranking penalty based on the % of their activity which is spammy.

From comments on various boards I get the impression that there are a lot of people who don’t think they’re doing anything wrong.  They talk about the quality of their sites and the amount of effort that they put into them.  This could be naïveté.  These webmasters may have read up on some “SOE” techniques and thought that they were a quick route to success.  There are probably people who fall into this category.  But, I also think that there are plenty of people who in an equally naïve way have built content that they consider to be of a very high quality, but which is rushed and derivative, which they’ve inadvertently stuffed with keywords that they want to target.  Stuff like:

If you're looking for cheap

If you're looking for cheap, low quality content, you've come to the right place...

This type of content structure is rife on the SEO friendly web.  It’s easy to write, and as a result lots of people write it. Or use subtle variations of it across their site, which to an algorithmic eye might seem to be spun.

When you add to this the type of blog spam links that get submitted with a little bit of flattery to the writer, you get a situation with a false positive spun content issue, and also outbound links to unrelated and spammy websites that Matt Cutts alludes to in his post.

Google have added a form for people who think they’ve been unfairly penalised by penguin.  It’s here.

Tagged with:

2 Responses to Penguins, Spinning, Spam, and Outbound Links

  1. im seo Alan says:

    Seems like one of the biggest problems is footer links on irrelevant websites normally put there to share page rank, but I get the feeling that sites that produce things like wordpress themes will be hit also as they must have millions of links on irrelevant sites promoting their themes.

    • Quumf says:

      I don’t disagree with you, especially on the footer links in WP themes. A lot of people will be using the themes without necessarily knowing that those links are actually there, and potentially linking out to sites that fall into a spam category.

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>