16 Jun 2005: Interview with a Link Spammer

From earlier this year, an interview with a link spammer in The Register. (TB is mentioned as a fallback for when comment-based link spamming becomes too difficult.)

25 May 2005: Bought and sold.

Judging by some of the recent articles on SpamHuntress (another site dedicated to analysis and eradication of spam, including trackback spam), there are indeed lists of vulnerable weblogs floating around the Internet—just like the lists of live addresses that email spammers buy and sell. Update: More SpamHuntress links, including her catalog of TB spam solutions and the new Spamhuntress Wiki, which includes some very interesting spammer profiles.

25 May 2005: Spammer attacks on WordPress.

Found elsewhere: Analysis of one particular attack on WordPress blogs.

Read the rest of this entry »

24 May 2005: How to get spammed, part 1.

Unclear to me at this point is exactly the mechanism by which trackback spammers find their marks. There’s some evidence that there exist Web spiders which look for Movable Type trackback URLs at conventional locations (in particular, the amount of spam can be reduced by obfuscating this URL; see the previous post on best practices).

Presumably there are also more sophisticated spiders which examine pages for the magic bit of RDF XML which identifies the correct Trackback location; obviously obfuscation doesn’t help here. But how do the spiders find weblogs? It’s likely that they follow links from other weblogs that they know about. This would perhaps help account for the power-law distribution in spam: the more popular your site, the more inbound links, the more visible your site to attackers.

Here’s an interesting corollary, gleaned from the WordPress 1.5 announcement:

WordPress 1.5 aims to bring the joy back to comments. […] if you forget about your blog for a little while you won’t come back to find your domain a nest of spam (which begets more spam) […]

The quote is part of a section touting the new whitelisting features in version 1.5, meant to keep the comments flowing from known-good writers. But why would spam beget more spam?

What Matt seems to be saying is that, much like email spammers who use various techniques (Web bugs in spam emails, for example) to determine that an email address is “live”, weblog comment and Trackback spammers may rely on successfully posted spam links to identify vulnerable sites. If a spammer sees your website in his referrer logs (even if none of your readers click on the spam links, Google sure will), he knows your site is ripe for further exploitation.

It would be interesting to observe the increase in comment spam associated with a few deliberate clicks to spam websites (using your weblog URL as the referrer). Is it possible to go from zero to spam in this way?

Let’s find out!

24 May 2005: We hardly knew ye

No contemporary discussion of the viability of Trackback would be complete without a reference to Tom Coates’ fatalistic article last month: Trackback is dead. Are Comments dead too?

24 May 2005: Conventional wisdom.

Rounding up some reasonably authoritative links on current best practices in Trackback spam prevention:

Learning Movable Type: Trackback spam (Feb. 2, 2005). Techniques mentioned:

content-based filtering (the obvious approach, inspired by email anti-spam techniques) obfuscation (juggling the the TB URL as well as nearby text; more examples here) blacklisting (ignoring known spam IPs) expiration (turning off TBs for old posts)

Plenty of additional links at the end of the article.

Matt Mullenweg: Trackback spam (Jan. 5, 2005). The original post isn’t much, but the discussion (in the form of comments, and, yes, trackbacks) offers a pretty good look at current best practices (geared toward WordPress users). Add to the above list of techniques:

moderation (simply involve a human for every comment or TB received) whitelisting (remembering “known good” URLs, as pointed out by Matt in the comments; see the 1.5 announcement for more info)

WordPress wiki: Trackback Spam Tools/Plugins.

23 May 2005: Hello world!

The manual said,

Hello world!

Welcome to WordPress. This is your first post. Edit or delete it, then start blogging!

I took it to heart.

You can see the results.

[ http://www.cdn.coralcdn.org/noredirect.html


You are viewing a mobilized version of this site...
View original page here

Mobilized by Mowser Mowser