Spammers are employ­ing a new tac­tic to attack blogs, and it’s a tac­tic that could bring down anti-spam mea­sures pro­tect­ing not only the Blogosphere, but e-mail too.

Blog spam­mers are attack­ing blogs with their typ­i­cal mes­sages, but they have a new, inge­nious, and poten­tially cat­a­strophic trick. They’re incor­po­rat­ing links to legit­i­mate, respectable domains into those attacks. The net result is that auto­mated spam fil­ters, even so-called “smart fil­ters” like Dr. Dave’s Spam Karma plug-in sys­tem for WordPress-based blogs, are Blacklisting domains like CNN​.com, IMDB​.com, MacCentral​.com, and dozens of others.

The anti-spam logic engine on one of my high search engine rank­ing sites, for exam­ple, Blacklists 15–25 respectable domains daily. Should I actu­ally get a com­ment from some­one at MacCentral​.com, for exam­ple, the anti-spam sys­tem will kill the com­ment before I see it. In the past that site has received com­ments from peo­ple work­ing for, and e-mailing from, MacCentral​.com.

Although some­one might have already coined a dif­fer­ent term, I call this type of spam­ming Whitelist Attacks.

This is only the begin­ning of the Whitelist Attacks. Beginning with one or two per week 90 days ago, Whitelist Attacks are now up to an aver­age of two dozen per day on each of my Web sites and those of sev­eral other pro­fes­sional blog­gers. Whitelist Attacks are effec­tive, and their scope and fre­quency is increasing.

Whitelist Attacks are designed to accom­plish two goals:
1. Exploit Whitelists of respectable domains to sneak past spam fil­ters, and;

2. Cause a suf­fi­cient num­ber of erro­neously Blacklisted domains that blog­gers and e-mail admin­is­tra­tors aban­don auto­mated fil­ters entirely.

While the top spam fil­ter­ing engines are cur­rently too smart for goal #1 to work, #2 is quickly becom­ing a reality.

Follow Whitelist Attacking through to its log­i­cal con­clu­sion: Spam bots record where they attack–let’s say they try to hit YourBlog​.com. Once they’ve found and attacked that site, they can eas­ily incor­po­rate that domain into their future attacks against other sites. Therefore, hun­dreds (and even­tu­ally hun­dreds of thou­sands) of auto­mated spam fil­ters will begin Blacklisting YourBlog​.com. In the end, the blog­ging com­mu­nity will be crip­pled by the fact that we’ve all Blacklisted each other’s domains.

The effects won’t be lim­ited to pre­vent­ing blog­gers from com­ment­ing on each other’s blogs.

The Blacklists gen­er­ated by blog spam fil­ters are fre­quently shared–even among non-bloggers–and often exported for use as e-mail Blacklists. Imagine some, then half, then most of your per­sonal and pro­fes­sional e-mail being unde­liv­er­able because an auto­mated sys­tem has your domain on a Blacklist.

Blacklists are also often pub­lished online–viewable by the pub­lic and indexed by search engines. What would be the dam­age to your rep­u­ta­tion of being pub­licly labeled as a spam­mer? If you work at the Gap, it prob­a­bly wouldn’t bother you too much–it might even raise your street credit. But if you’re a professional…

Three-List Exploits
Current anti-spam sys­tems typ­i­cally eval­u­ate blog com­ment and e-mail con­tent look­ing for bad words, known bad domains, or the incor­po­ra­tion of more than an arbi­trary num­ber of links. If the sys­tems find any one of these con­di­tions, they stop the mes­sage from get­ting through by plac­ing into a mod­er­a­tion queue or mail folder, or by killing it out­right. The more advanced of such sys­tems oper­ate on a three-list principal:

Whitelists are known good domains and terms, and are usu­ally admin­is­tered manually–a human must delib­er­ately add to the list a con­di­tion that, if met, passes the mes­sage through with­out fur­ther challenge.

Blacklists are known bad domains and terms. These are the phar­ma­ceu­ti­cal, adult, and online game terms we all know and despise, and those (whom we despise even more) who ped­dle them through unso­licited e-mail and blog com­ments. Messages con­tain­ing domain names or terms on the Blacklist are stopped and held or deleted before delivery.

In between the two extremes of always deliver (Whitelists) and always stop (Blacklists) are Greylists. Greylists con­tain terms or con­di­tions that may be marks of spam­ming, but may also be innocuous–the eval­u­a­tion of which is too com­pli­cated for cur­rent tech­nol­ogy, and must be left to a human. Once a mes­sage meets Greylist con­di­tions, it is seg­re­gated from the rest of the e-mail or com­ments, and placed into a mod­er­a­tion queue or spe­cial folder for later human eval­u­a­tion. The e-mail admin­is­tra­tor or blog­ger will then man­u­ally enter the queue or folder and review the mes­sage con­tent to make a final deter­mi­na­tion of its fate.

Whitelist Spamming and Whitelist Attacks attempt to use the three-list sys­tem against itself by either slip­ping through on a Whitelist approval con­di­tion, or by caus­ing so many false pos­i­tives that denials and seg­re­ga­tion based on Blacklists and even Greylists become self-defeating and are abandoned.

Because of the con­tent and qual­ity of their mes­sages, spam­mers are often char­ac­ter­ized as une­d­u­cated, stu­pid, or ran­dom and dis­or­ga­nized. Nothing could be fur­ther from the truth.

Spamming is a prof­itable busi­ness, with annual global rev­enues mea­sured in bil­lions of dol­lars. While some spam­mers are the une­d­u­cated morons who believe every get-rich-quick scheme Carlton Sheets tries to sell them on late night tele­vi­sion, they are the not the ones from whom you will typ­i­cally receive spam. The major­ity of spam comes from large, excep­tion­ally orga­nized, and highly moti­vated syn­di­cates whose numer­ous crimes are grounded in the real world con­cerns of drugs, guns, and rack­e­teer­ing. Spam and spam-related activ­i­ties are merely one of their busi­ness inter­ests. These orga­ni­za­tions have vir­tu­ally unlim­ited fund­ing for research and devel­op­ment of new tech­niques and method­olo­gies to defeat anti-spam mea­sures, and they employ some very intel­li­gent peo­ple for that purpose.

Those who per­pe­trate Whitelist Attacks under­stand how com­put­ers, the Internet, and your mind oper­ate. They real­ize the lim­i­ta­tions of three-list anti-spam tech­niques, and, more to the point, they rec­og­nize that admin­is­tra­tors of such sys­tems are too busy to baby sit them. Whitelist spam­mers know that the more time they force us to man­u­ally scru­ti­nize our auto­mated White-, Grey-, and Blacklists, the less use­ful those lists become. Automated sys­tems only work for us so long as they remain auto­mated; the moment we per­ceive admin­is­tra­tion of those auto­mated sys­tems as becom­ing more labor-, time-, or mentally-intensive than our per­cep­tion of deal­ing with spam at the inbox phase, we will aban­don those auto­mated sys­tems entirely–thus open­ing the flood gates to spam once more.

As spam­mers well know, three-list fil­ter­ing is the most effec­tive and acces­si­ble anti-spam method­ol­ogy cur­rently avail­able. In the eyes of the pro­fes­sional spam indus­try, three-list fil­ter­ing on blogs and mail­boxes is the sin­gle largest imped­i­ment to grow­ing their bot­tom line. Beating it is their high­est pri­or­ity. With Whitelist Attacks–simply adding one more URL to their messages–they have indeed found an easy, effec­tive, and low-cost way of defeat­ing three-list spam filtering.

Someone needs to find a way to com­bat Whitelist Attacks–and they must do it swiftly. More advanced algo­rithms need to be devised, algo­rithms that eval­u­ate the style, struc­ture, and ver­biage of blog com­ments and e-mail mes­sages, but that also have the abil­ity to rec­og­nize and extract rep­utable domains. Global Whitelists must be cre­ated to pre­vent the auto­matic addi­tion of all domains ref­er­enced in a spam mes­sage from being added to Blacklists. If an eval­u­ated mes­sage con­tains adult-oriented text and a link to a domain that meets rule def­i­n­i­tions as being unde­sir­able, but just hap­pens to have a spoofed return address of Steve.​Jobs@​Apple.​com, the auto­mated fil­ters pro­tect­ing the mail­box need to be smart enough to add the spam domain to the Blacklist for future match­ing, but to not add Apple​.com to the Blacklist

Whitelist attack­ing is an inge­nious response by pro­fes­sional spam­mers to the most advanced anti-spam sys­tems cur­rently pro­tect­ing blogs and e-mail inboxes. It’s a method­ol­ogy that car­ries grave con­se­quences to hun­dreds of thou­sands of blog­gers, and whose effects will, if left unchecked, crip­ple the Blogosphere. More grave still, the reach and poten­tial dam­age of Whitelist Attacks hits e-mail fil­ter­ing sys­tems equally and threat­ens the Internet far, far beyond blogs.

Blog, Blogs, Blogging, Blogosphere, Spam, Spamming, Whitelist Attack, Whitelist Spam, Whitelist Spamming, Whitelist, Blacklist, Greylist, Graylist, Spam Karma, WordPress, Ant-Spam, Email, E-mail, Inbox, Spam Filter, Spam Fighting, Combating Spam

10 Responses to “WhiteList Spam Attacks Threaten Blogs and Email”

    Matthew Treder
    December 21st, 2005 at 16:15

    Am I being naïve to sug­gest that whitelist domains such as Apple​.com in your exam­ple above sim­ply be given “pro­tected” sta­tus? There are user-defined ways to iden­tify truly worth­while sites (StumbleUpon​.com being one pop­u­lar appli­ca­tion of the tech­nol­ogy) and sep­a­rate out the dri­vel, or worse.

    Pariah S. Burke
    December 21st, 2005 at 17:12

    Hi, Matthew.

    See, that’s just the prob­lem of whitelist spam: If you pro­tect Apple​.com, then any spam mes­sage that includes that domain would auto­mat­i­cally get through. That’s exactly what spam­mers are hop­ing for, which is why they’re includ­ing sites like Apple​.com in their mes­sages. Three-list anti-spam engines aren’t yet smart enough to fig­ure out what to do with a mes­sage con­tain­ing two URLs, one being unknown to it (Apple​.com) and the other known bad (a porn site, for exam­ple). In those cases, the anti-spam engine cre­ates an asso­ci­a­tion between the known bad and the unknown, decid­ing that the unknown must be had and there­fore should be blacklisted.

    While it is fea­si­ble for humans to go in and indi­vid­u­ally whitelist good domains, it’s totally unweildly to whitelist the mil­lions of respectable domains out there. While most sites will never get a frac­tion of those as blog com­ments or e-mail, there are still thou­sands of poten­tial domains from which desired mes­sages may come. One can­not whitelist them all, nor can one real­is­ti­cally inves­ti­gate every black­listed mes­sage or domain on a busy site.

    See the prob­lem now?

    Matthew Treder
    January 6th, 2006 at 20:02

    Yeah. And I def­i­nitely think you’re on to some­thing that’s not get­ting much ink in main­stream press, but prob­a­bly should be. (Or would that only make things expo­nen­tially worse?)

    Pariah S. Burke
    January 7th, 2006 at 01:28

    Spammers tend to share infor­ma­tion like any other pro­fes­sion. I don’t think main­stream press cov­er­age would exac­er­bate the prob­lem. It would, how­ever, get more peo­ple work­ing on a way to com­bat it.

    thinkcreation.net » Blog Archive » Whitelists in Spam Filters Might Become Spam’s Best Friend
    January 9th, 2006 at 09:25

    […] This is a threat that can be suc­cess­fully fought if aware­ness is raised in the net. I invite you to read the rest of his arti­cle to bet­ter under­stand this phenomenon. […]

    Tim
    April 10th, 2006 at 21:03

    I like your site.
    Blogs with com­ments win­dows that you have to click to open keep the dia­logue under wraps. Blogs like this one which string com­ments out in the open are much more proac­tive about spark­ing dialogue.

    derf
    July 7th, 2006 at 08:43

    See the Sender Policy Framework (SPF) at http://​www​.open​spf​.org. If the email says it’s from Apple​.com but the send­ing email server doesn’t match an allowed IP address for one of Apple’s listed email servers, it doesn’t get through. The SPF check needs to be before the whitelist check. It’s not per­fect, and sub­ject to a DNS attack, but it would make the spammer’s job more dif­fi­cult if every email domain had it implemented.

    Pozycjonowanie
    December 16th, 2006 at 06:18

    Someone else below asked this already about anti­spam scripts.
    I am get­ting nailed with Spam on my web­site mails and in our blog web­site — now its offline too

    much spam. Is there any­way to stop this? If not, there really isn’t any point in leav­ing it up

    and active. Any help will be greatly appreciated.

    Thanks for help, Keep up the good work. Greetings from Poland

    Coding Horror
    May 7th, 2007 at 17:00

    Whitelist, Blacklist, Greylist

    I recently got into a spir­ited dis­cus­sion about Akismet. What is Akismet? When a new com­ment, track­back, or ping­back comes to your blog it is sub­mit­ted to the Akismet web ser­vice which runs hun­dreds of tests on the comment…

    Web Marketing Services - Professional Internet Marketing by SEO Prodigy
    June 4th, 2009 at 22:32

    Blogs being spammed?…

    I hate it when peo­ple find ways to spam use­ful tools and con­tent. Blogs are immensely pop­u­lar these days, and word­press and blog­ger are just a few exam­ples of good free blog­ging inter­faces. Due to the pop­u­lar­ity of the blogs, Google and other search en…

Leave a Reply

Navigation

Latest Video

Upcoming Webinars

Testimonials

  • The actual demonstration of how to create epubs, information about which formats are used by which ereaders, and which fonts, I found most useful…I had not realized that it’s that easy to create a (simple) epub.
    —Margaret B., Editor
  • I am very glad I was able to find your seminar, because this workshop came in just in time for my career.
    —Michelle W., Freelance Designer
  • A good introduction to the ePublishing world. I am just starting my own freelance business and needed to see what the newest things are that I should get involved with and this is it!…I have had some offers to illustrate a children’s book and a book of poetry, This gives me a resource to make… children’s books and books of poetry… interactive and reach a whole other audience.
    —Kelly R.