Semantic Graph Neural Network: A Conversion from Spam Email Classification to Graph Classification

In this study, we propose a method named Semantic Graph Neural Network (SGNN) to address the challenging task of email classification. This method converts the email classification problem into a graph classification problem by projecting email into a graph and applying the SGNN model for classification. The email features are generated from the semantic graph; hence, there is no need of embedding the words into a numerical vector representation. The method performance is tested on the different public datasets. Experiments in the public dataset show that the presented method achieves high accuracy in the email classification test against a few public datasets. The performance is better than the state-of-the-art deep learning-based method in terms of spam classification.

[1]  Sunil B. Rathod,et al.  Content based spam detection in email using Bayesian classifier , 2015, 2015 International Conference on Communications and Signal Processing (ICCSP).

[2]  Mouad Lemoudden,et al.  Hybrid Email Spam Detection Model Using Artificial Intelligence , 2020 .

[3]  Zhiyuan Liu,et al.  Graph Neural Networks: A Review of Methods and Applications , 2018, AI Open.

[4]  Ala’ M. Al-Zoubi,et al.  Spam Emails Detection Based on Distributed Word Embedding with Deep Learning , 2020 .

[5]  Khalil Sima'an,et al.  Graph Convolutional Encoders for Syntax-aware Neural Machine Translation , 2017, EMNLP.

[6]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[7]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[8]  Shikhar Seth,et al.  Multimodal Spam Classification Using Deep Learning Techniques , 2017, 2017 13th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS).

[9]  Qing Yang,et al.  A support vector machine based naive Bayes algorithm for spam filtering , 2016, 2016 IEEE 35th International Performance Computing and Communications Conference (IPCCC).

[10]  Norfaradilla Wahid,et al.  Analysis of Naïve Bayes Algorithm for Email Spam Filtering across Multiple Datasets , 2017 .

[11]  Divyanjali Saini,et al.  Hybrid Forecasting Scheme for Enhance Prediction Accuracy of Spambase Dataset , 2021 .

[12]  Yuan He,et al.  Graph Neural Networks for Social Recommendation , 2019, WWW.

[13]  Samy S. Abu-Naser,et al.  Email Classification Using Artificial Neural Network , 2018 .

[14]  K. Renuka,et al.  A Hybrid ACO Based Feature Selection Method for Email Spam Classification , 2015 .

[15]  Archana K Rajan,et al.  An Improved Spam Detection Method with Weighted Support Vector Machine , 2018, 2018 International Conference on Data Science and Engineering (ICDSE).

[16]  Ran Jin,et al.  Classifying relations in clinical narratives using segment graph convolutional and recurrent neural networks (Seg-GCRNs) , 2018, J. Am. Medical Informatics Assoc..

[17]  Nizar Bouguila,et al.  A discrete mixture-based kernel for SVMs: Application to spam and image categorization , 2009, Inf. Process. Manag..

[18]  Ji-hye Kim,et al.  Knowledge Graph-based Korean New Words Detection Mechanism for Spam Filtering , 2020 .

[19]  Ahmed Z. Emam,et al.  Intrusion Detection System for Internet of Things Based on Temporal Convolution Neural Network and Efficient Feature Engineering , 2020, Wirel. Commun. Mob. Comput..

[20]  Xiaofeng Liao,et al.  An E-mail Filtering Approach Using Neural Network , 2004, ISNN.

[21]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[22]  Sikha Bagui,et al.  Classifying Phishing Email Using Machine Learning and Deep Learning , 2019, 2019 International Conference on Cyber Security and Protection of Digital Services (Cyber Security).

[23]  Andrew Trotman,et al.  Improvements to BM25 and Language Models Examined , 2014, ADCS.

[24]  Chun yan Li,et al.  Analysis of Tai Chi Ideological and Political Course in University Based on Big Data and Graph Neural Networks , 2021, Sci. Program..

[25]  Aakanksha Sharaff,et al.  Extra-Tree Classifier with Metaheuristics Approach for Email Classification , 2019, Advances in Intelligent Systems and Computing.

[26]  Georgios Paliouras,et al.  Learning to Filter Unsolicited Commercial E-Mail , 2006 .