Spam Filter Analysis

Unsolicited bulk email (aka. spam) is a major problem on the Internet. To counter spam, several techniques, ranging from spam filters to mail protocol extensions like hashcash, have been proposed. In this paper we investigate the effectiveness of several spam filtering techniques and technologies. Our analysis was performed by simulating email traffic under different conditions. We show that genetic algorithm based spam filters perform best at server level and naive Bayesian filters are the most appropriate for filtering at user level.

[1]  Georgios Paliouras,et al.  Learning to Filter Spam E-Mail: A Comparison of a Naive Bayesian and a Memory-Based Approach , 2000, ArXiv.

[2]  Georgios Paliouras,et al.  An evaluation of Naive Bayesian anti-spam filtering , 2000, ArXiv.

[3]  Serge Gauthronet,et al.  Unsolicited commercial communications and data protection , 2001 .

[4]  M. Angela Sasse,et al.  Successful multiparty audio communication over the Internet , 1998, CACM.

[5]  Markus Jakobsson,et al.  Curbing Junk E-Mail via Secure Classification , 1998, Financial Cryptography.

[6]  Jon Postel,et al.  On the junk mail problem , 1975, RFC.

[7]  Simson L. Garfinkel,et al.  Stopping spam - stamping out unwanted email and news postings , 1998 .

[8]  Shane Hird Technical Solutions for Controlling Spam , 2002 .

[9]  Gary Robinson,et al.  A statistical approach to the spam problem , 2003 .

[10]  Susan T. Dumais,et al.  A Bayesian Approach to Filtering Junk E-Mail , 1998, AAAI 1998.

[11]  Ted Wobber,et al.  Moderately hard, memory-bound functions , 2005, TOIT.

[12]  Frank E. Grubbs,et al.  An Introduction to Probability Theory and Its Applications , 1951 .

[13]  William Feller,et al.  An Introduction to Probability Theory and Its Applications , 1951 .

[14]  Sally Hambridge,et al.  DON'T SPEW A Set of Guidelines for Mass Unsolicited Mailings and Postings (spam*) , 1999, RFC.

[15]  Robert J. Hall,et al.  Channels: Avoiding unwanted electronic mail , 1996, Network Threats.

[16]  Gunnar Lindberg,et al.  Anti-Spam Recommendations for SMTP MTAs , 1999, RFC.

[17]  Moni Naor,et al.  Pricing via Processing or Combatting Junk Mail , 1992, CRYPTO.

[18]  Graham Chapman,et al.  Monty Python's Flying Circus: Just the Words , 1989 .