Detecting Phishing E-mails Using Text Mining and Features Analysis

Phishing e-mails are used by malicious actors with the aim of obtaining sensitive information from a victim, deceiving or blackmailing them. An inattentive or uninformed user may often fail to recognise if an e-mail is sent by an authentic sender or is a scam. We therefore sought to develop a method that can effectively and efficiently detect phishing e-mails and report them to the user. We analyse all the information available on receipt of the e-mail both statically and performing text mining on the content and subject of the e-mail. In addition to indicating weather e-mails are suspicious, the degree of accuracy with which the above statement is made is also reported, and the aspects of the e-mail that are characteristic of a phishing e-mail are highlighted. Excellent results were achieved with our methodology, reaching 99.2% accuracy.

[1]  Gerhard Paass,et al.  Improved Phishing Detection using Model-Based Features , 2008, CEAS.

[2]  B. B. Gupta,et al.  A Survey of Phishing Email Filtering Techniques , 2013, IEEE Communications Surveys & Tutorials.

[3]  Fadi A. Thabtah,et al.  Intelligent phishing detection system for e-banking using fuzzy data mining , 2010, Expert Syst. Appl..

[4]  Sadia Afroz,et al.  PhishZoo : An Automated Web Phishing Detection Approach Based on Profiling and Fuzzy Matching , 2009 .

[5]  Vimala Balakrishnan,et al.  Stemming and lemmatization: A comparison of retrieval performances , 2014 .

[6]  Vadlamani Ravi,et al.  Detecting phishing e-mails using text and data mining , 2012, 2012 IEEE International Conference on Computational Intelligence and Computing Research.

[7]  Mrutyunjaya Panda,et al.  Developing an Efficient Text Pre-Processing Method with Sparse Generative Naive Bayes for Text Mining , 2018, International Journal of Modern Education and Computer Science.

[8]  P. Alam ‘E’ , 2021, Composites Engineering: An A–Z Guide.

[9]  P. Alam ‘S’ , 2021, Composites Engineering: An A–Z Guide.

[10]  Masoumeh Zareapoor,et al.  Text Mining for Phishing E-mail Detection , 2015 .

[11]  Sureswaran Ramadass,et al.  Evolving Fuzzy Neural Network for Phishing Emails Detection , 2012 .

[12]  Youssef Iraqi,et al.  Phishing Detection: A Literature Survey , 2013, IEEE Communications Surveys & Tutorials.

[13]  Ian Harris,et al.  Detecting Phishing Attacks Using Natural Language Processing and Machine Learning , 2018, 2018 IEEE 12th International Conference on Semantic Computing (ICSC).

[14]  Lorenzo Cavallaro,et al.  On the Dissection of Evasive Malware , 2020, IEEE Transactions on Information Forensics and Security.

[15]  Andrew H. Sung,et al.  Detection of Phishing Attacks: A Machine Learning Approach , 2008, Soft Computing Applications in Industry.

[16]  Jason R. C. Nurse,et al.  Catching the Phish: Detecting Phishing Attacks using Recurrent Neural Networks (RNNs) , 2019, WISA.

[17]  Mayank Pandey,et al.  Text and Data Mining to Detect Phishing Websites and Spam Emails , 2013, SEMCCO.