Wirebrush4SPAM: a novel framework for improving efficiency on spam filtering services

This paper introduces Wirebrush4SPAM, a plug‐in‐based C framework specifically designed for the development of fast spam filters by assembling different antispam schemes and techniques. Wirebrush4SPAM can be used to (i) build, execute and deploy simple spam filters and (ii) develop new techniques that can be easily combined and tested to achieve more accurate antispam models. To construct custom filters, programmers should manage three key concepts: filtering functions, parsers and event listeners. The main features of Wirebrush4SPAM include (i) a plug‐in‐based design, (ii) cache support for developing new plug‐ins, (iii) a smart filter evaluation heuristic for improving filter execution, (iv) configurable rule scheduling and (v) support for domain specific rules. Moreover, Wirebrush4SPAM is 10 times faster than SpamAssassin, which stands for the most popular and highly extensible framework for spam filtering. Wirebrush4SPAM is an open‐source project licensed under the terms of GNU lesser general public license and both source code and documentation are publicly available at http://www.wb4spam.org/. Copyright © 2012 John Wiley & Sons, Ltd.

[1]  Mathieu Fourment,et al.  A comparison of common programming languages used in bioinformatics , 2008, BMC Bioinformatics.

[2]  Kurt Fanning,et al.  Are you safe from son of spam , 2005 .

[3]  Yang Song,et al.  Better Naive Bayes classification for high‐precision spam detection , 2009, Softw. Pract. Exp..

[4]  Adam J. O'Donnell A Reputation-Based Approach for Effi cient Filtration of Spam , 2005 .

[5]  John R. Levine DNS Blacklists and Whitelists , 2010, RFC.

[6]  Meng Weng Wong,et al.  Sender Policy Framework (SPF) for Authorizing Use of Domains in E-Mail, Version 1 , 2006, RFC.

[7]  Avinoam Kolodny,et al.  Thread allocation directed by performance-power tradeoff in NoC-based CMPs , 2010, 2010 IEEE 26-th Convention of Electrical and Electronics Engineers in Israel.

[8]  Dongping Xu Performance Study and Dynamic Optimization Design for Thread Pool Systems , 2004 .

[9]  Juan M. Corchado,et al.  Managing irrelevant knowledge in CBR models for unsolicited e-mail classification , 2009, Expert Syst. Appl..

[10]  Juan M. Corchado,et al.  SpamHunting: An instance-based reasoning system for spam labelling and filtering , 2007, Decis. Support Syst..

[11]  Yang Song,et al.  Better Naive Bayes classification for high-precision spam detection , 2009 .

[12]  Ralph E. Johnson,et al.  Evolution of the MTA architecture: the impact of security , 2008, Softw. Pract. Exp..

[13]  Ralph Johnson,et al.  Evolution of the MTA architecture: the impact of security , 2008 .

[14]  G. Amdhal,et al.  Validity of the single processor approach to achieving large scale computing capabilities , 1967, AFIPS '67 (Spring).

[15]  Roberto Battiti,et al.  "May I borrow your filter?" Exchanging filters to combat spam in a community , 2006, 20th International Conference on Advanced Information Networking and Applications - Volume 1 (AINA'06).