Spam filtering email classification (SFECM) using gain and graph mining algorithm

This paper proposes a hybrid solution of spam email classifier using context based email classification model as main algorithm complimented by information gain calculation to increase spam classification accuracy. Proposed solution consists of three stages email pre-processing, feature extraction and email classification. Research has found that LingerIG spam filter is highly effective at separating spam emails from cluster of homogenous work emails. Also experiment result proved the accuracy of spam filtering is 100% as recorded by the team of developers at University of Sydney. The study has shown that implementing the spam filter in the context-based email classification model is feasible. Experiment of the study has confirmed that spam filtering aspect of context-based classification model can be improved.

[1]  Dennis McLeod,et al.  Spam Email Classification using an Adaptive Ontology , 2007, J. Softw..

[2]  Taiwo Ayodele,et al.  Email classification using back propagation technique , 2010 .

[3]  D. Patil,et al.  A CLUSTERING TECHNIQUE FOR EMAIL CONTENT MINING , 2015 .

[4]  Sharma Chakravarthy,et al.  eMailSift: eMail classification based on structure and content , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[5]  Sharma Chakravarthy,et al.  A Graph-Based Approach for Multi-folder Email Classification , 2010, 2010 IEEE International Conference on Data Mining.

[6]  Izzat Alsmadi,et al.  Clustering and classification of email contents , 2015, J. King Saud Univ. Comput. Inf. Sci..

[7]  Zubair Ahmed Shaikh,et al.  Context‐based email classification model , 2016, Expert Syst. J. Knowl. Eng..

[8]  Jason D. M. Rennie ifile: An Application of Machine Learning to E-Mail Filtering , 2000 .

[9]  Kazem Taghva,et al.  Ontology-based classification of email , 2003, Proceedings ITCC 2003. International Conference on Information Technology: Coding and Computing.

[10]  Irena Koprinska,et al.  LINGER – A SMART PERSONAL ASSISTANT FOR E-MAIL CLASSIFICATION , 2003 .

[11]  Sabah Sayed,et al.  Three-Phase Tournament-Based Method for Better Email Classification , 2012 .

[12]  Debzani Deb,et al.  A Trainable Fuzzy Spam Detection System , 2004 .