SOCIAL: Self-Organizing ClassIfier ensemble for Adversarial Learning

Pattern recognition techniques are often used in environments (called adversarial environments) where adversaries can consciously act to limit or prevent accurate recognition performance. This can be obtained, for example, by changing labels of training data in a malicious way. While Multiple Classifier Systems (MCS) are currently used in several security applications, like intrusion detection in computer networks and spam filtering, there are very few MCS proposals that explicitly address the problem of learning in adversarial environments. In this paper we propose a general algorithm based on a multiple classifier approach to find out and clean mislabeled training samples. We will report several experiments to verify the robustness of the proposed approach to the presence of possible mislabeled samples. In particular, we will show that the performance obtained with a simple classifier trained on the training set “cleaned” by our algorithm is comparable and even better than those obtained by some state-of-the-art MCS trained on the original datasets.

[1]  Fabio Roli,et al.  Spam Filtering Based On The Analysis Of Text Information Embedded Into Images , 2006, J. Mach. Learn. Res..

[2]  Cristina L. Abad,et al.  An Analysis on the Schemes for Detecting and Preventing ARP Cache Poisoning Attacks , 2007, 27th International Conference on Distributed Computing Systems Workshops (ICDCSW'07).

[3]  Giorgio Valentini,et al.  Applications of Supervised and Unsupervised Ensemble Methods , 2009, Applications of Supervised and Unsupervised Ensemble Methods.

[4]  Carlo Sansone,et al.  Network Protocol Verification by a Classifier Selection Ensemble , 2009, MCS.

[5]  Fabio Roli,et al.  Multiple Classifier Systems for Adversarial Classification Tasks , 2009, MCS.

[6]  Nikunj C. Oza,et al.  AveBoost2: Boosting for Noisy Data , 2004, Multiple Classifier Systems.

[7]  Fabio Roli,et al.  Information fusion for computer security: State of the art and open issues , 2009, Inf. Fusion.

[8]  P. Smets,et al.  The measure of the degree of truth and the grade of membership , 1988 .

[9]  Fabio Roli,et al.  Multiple Classifier Systems, 9th International Workshop, MCS 2010, Cairo, Egypt, April 7-9, 2010. Proceedings , 2010, MCS.

[10]  Lakhmi C. Jain,et al.  Knowledge-Based Intelligent Information and Engineering Systems , 2004, Lecture Notes in Computer Science.

[11]  Antonio Picariello,et al.  A Personal Antispam System Based on a Behaviour-Knowledge Space Approach , 2009, Applications of Supervised and Unsupervised Ensemble Methods.

[12]  Donald F. Specht,et al.  Probabilistic neural networks , 1990, Neural Networks.

[13]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[14]  Raymond J. Mooney,et al.  Diverse ensembles for active learning , 2004, ICML.

[15]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[16]  Fabio Roli,et al.  Adversarial Pattern Classification Using Multiple Classifiers and Randomisation , 2008, SSPR/SPR.

[17]  Blaine Nelson,et al.  Can machine learning be secure? , 2006, ASIACCS '06.

[18]  Christian Thiel,et al.  Classification on Soft Labels Is Robust against Label Noise , 2008, KES.

[19]  Ludmila I. Kuncheva,et al.  Classifier Ensembles for Changing Environments , 2004, Multiple Classifier Systems.

[20]  Fabio Roli,et al.  Image Spam Filtering Using Visual Information , 2007, 14th International Conference on Image Analysis and Processing (ICIAP 2007).

[21]  Edwin R. Hancock,et al.  Structural, Syntactic, and Statistical Pattern Recognition, Joint IAPR International Workshop, SSPR&SPR 2010, Cesme, Izmir, Turkey, August 18-20, 2010. Proceedings , 2010, SSPR/SPR.