论文信息 - An Integrated Approach to Filtering Phishing E-mails

An Integrated Approach to Filtering Phishing E-mails

This paper presents a system for classifying e-mails into two categories, legitimate and fraudulent. This classifier system is based on the serial application of three filters: a Bayesian filter that classifies the textual content of e-mails, a rule based filter that classifies the nongrammatical content of e-mails and, finally, a filter based on an emulator of fictitious accesses which classifies the responses from websites referenced by links contained in e-mails. The approach of this system is hybrid, because it uses different classification methods, and also integrated, because it takes into account all kind of data and information contained in e-mails.

[1] Lourdes Araujo,et al. Statistical Recognition of Noun Phrases in Unrestricted Text , 2005, IDA.

[2] Héctor Rulot Segovia. Ecgi: un algoritmo de inferencia gramatical mediante corrección de errores , 1992 .

[3] Mads Haahr,et al. A Case-Based Approach to Spam Filtering that Can Track Concept Drift , 2003 .

[4] Joost N. Kok,et al. Advances in Intelligent Data Analysis VI, 6th International Symposium on Intelligent Data Analysis, IDA 2005, Madrid, Spain, September 8-10, 2005, Proceedings , 2005, IDA.

[5] William W. Cohen. Learning Rules that Classify E-Mail , 1996 .

[6] Harris Drucker,et al. Support vector machines for spam categorization , 1999, IEEE Trans. Neural Networks.

[7] Georgios Paliouras,et al. Learning to Filter Spam E-Mail: A Comparison of a Naive Bayesian and a Memory-Based Approach , 2000, ArXiv.

[8] José Antonio Cordero Martín. Instituto de Automática Industrial , 1991 .

[9] Walter Daelemans,et al. TiMBL: Tilburg Memory-Based Learner, version 2.0, Reference guide , 1998 .

[10] John R. Anderson,et al. MACHINE LEARNING An Artificial Intelligence Approach , 2009 .

[11] Ryszard S. Michalski,et al. A Theory and Methodology of Inductive Learning , 1983, Artificial Intelligence.

[12] Fabrizio Sebastiani,et al. Machine learning in automated text categorization , 2001, CSUR.

[13] M. Dolores del Castillo,et al. Evolutionary learning of document categories , 2006, Information Retrieval.