Using Attribute-based Feature Selection Approaches and Machine Learning Algorithms for Detecting Fraudulent Website URLs

Phishing is a malicious form of online theft and needs to be prevented in order to increase the overall trust of the public on the Internet. In this study, for that purpose, the authors present their findings on the methods of detecting phishing websites. Data mining algorithms along with classifier algorithms are used in order to achieve a satisfactory result. In terms of classifiers, the Naïve Bayes, SMO, and J48 algorithms are used. As for the feature selection algorithm; Gain Ratio Attribute and ReliefF Attribute are selected. The results are provided in a comparative way. Accordingly; SMO and J48 algorithms provided satisfactory results in the detection of phishing websites, however, Naïve Bayes performed poor and is the least recommended method among all.

[1]  Carolyn Penstein Rosé,et al.  CANTINA+: A Feature-Rich Machine Learning Framework for Detecting Phishing Web Sites , 2011, TSEC.

[2]  Nauman Aslam,et al.  Intelligent phishing detection and protection scheme for online transactions , 2013, Expert Syst. Appl..

[3]  V. Prasanna Venkatesan,et al.  A Framework for Predicting Phishing Websites using Neural Networks , 2011, ArXiv.

[4]  Adel Khelifi,et al.  Phishing Detection Plug-In Toolbar Using Intelligent Fuzzy-Classification Mining Techniques , 2013 .

[5]  M. S. Vijaya,et al.  Efficient prediction of phishing websites using supervised learning algorithms , 2012 .

[6]  Nazife Baykal,et al.  Feature extraction and classification phishing websites based on URL , 2015, 2015 IEEE Conference on Communications and Network Security (CNS).

[7]  Fadi A. Thabtah,et al.  Intelligent phishing detection system for e-banking using fuzzy data mining , 2010, Expert Syst. Appl..

[8]  Nuno Pereira,et al.  A Framework for Web Application Integrity , 2018, ICISSP.

[9]  Max-Emanuel Maurer,et al.  Sophisticated Phishers Make More Spelling Mistakes: Using URL Similarity against Phishing , 2012, CSS.

[10]  T. Balamuralikrishna,et al.  Mitigating Online Fraud by Ant phishing Model with URL & Image based Webpage Matching , 2012 .

[11]  Ismail Butun Privacy and trust relations in Internet of Things from the user point of view , 2017, 2017 IEEE 7th Annual Computing and Communication Workshop and Conference (CCWC).

[12]  Mona Ghotaish Alkhozae,et al.  Phishing Websites Detection based on Phishing Characteristics in the Webpage Source Code , 2011 .

[13]  Andrew H. Sung,et al.  Rule-Based Phishing Attack Detection , 2011 .

[14]  Mohammad Zulkernine,et al.  Information Source-Based Classification of Automatic Phishing Website Detectors , 2011, 2011 IEEE/IPSJ International Symposium on Applications and the Internet.