Evolutionary Multi-objective Scheduling for Anti-Spam Filtering Throughput Optimization

This paper presents an evolutionary multi-objective optimization problem formulation for the anti-spam filtering problem, addressing both the classification quality criteria (False Positive and False Negative error rates) and email messages classification time (minimization). This approach is compared to single objective problem formulations found in the literature, and its advantages for decision support and flexible/adaptive anti-spam filtering configuration is demonstrated. A study is performed using the Wirebrush4SPAM framework anti-spam filtering and the SpamAssassin email dataset. The NSGA-II evolutionary multi-objective optimization algorithm was applied for the purpose of validating and demonstrating the adoption of this novel approach to the anti-spam filtering optimization problem, formulated from the multi-objective optimization perspective. The results obtained from the experiments demonstrated that this optimization strategy allows the decision maker (anti-spam filtering system administrator) to select among a set of optimal and flexible filter configuration alternatives with respect to classification quality and classification efficiency.

[1]  Florentino Fernández Riverola,et al.  Combining Scheduling Heuristics to Improve e-mail Filtering Throughput , 2015, DCAI.

[2]  Florentino Fernández Riverola,et al.  Effective scheduling strategies for boosting performance on rule-based spam filtering frameworks , 2013, J. Syst. Softw..

[3]  Michael T. M. Emmerich,et al.  A spam filtering multi-objective optimization study covering parsimony maximization and three-way classification , 2016, Appl. Soft Comput..

[4]  Florentino Fernández Riverola,et al.  Using new scheduling heuristics based on resource consumption information for increasing throughput on rule‐based spam filtering systems , 2016, Softw. Pract. Exp..

[5]  Licheng Jiao,et al.  Multiobjective optimization of classifiers by means of 3D convex-hull-based evolutionary algorithms , 2014, Inf. Sci..

[6]  Antonio J. Nebro,et al.  jMetal: A Java framework for multi-objective optimization , 2011, Adv. Eng. Softw..

[7]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[8]  Juan M. Corchado,et al.  Distributed Computing and Artificial Intelligence: 10th International Conference , 2013 .

[9]  David Beasley,et al.  Possible applications of evolutionary computation , 2018, Evolutionary Computation 1.

[10]  Florentino Fernández Riverola,et al.  Wirebrush4SPAM: a novel framework for improving efficiency on spam filtering services , 2013, Softw. Pract. Exp..

[11]  Eduardo Díaz,et al.  Grindstone4Spam: An optimization toolkit for boosting e-mail classification , 2012, J. Syst. Softw..

[12]  Iryna Yevseyeva,et al.  Optimising anti-spam filters with evolutionary algorithms , 2013, Expert Syst. Appl..

[13]  D. Harman,et al.  Text Retrieval Conference and Message Understanding Conference , 2006 .

[14]  Florentino Fernández Riverola,et al.  RuleSIM: a toolkit for simulating the operation and improving throughput of rule‐based spam filters , 2016, Softw. Pract. Exp..