Effective Phishing Website Detection Based on Improved BP Neural Network and Dual Feature Evaluation

Nowadays, phishing poses a big threat to people's daily network environment. By phishing, attackers obtain the network users private information by inducing them to open illegal websites. Due to the active learning ability and preferable classifying ability for many datasets, BP neural network is an important heuristic machine learning method in phishing websites detection and prevention. However, improper selection of initial parameters, such as the initial weight and threshold, will induce the BP neural network into local minimum and slow learning convergence. Aiming at these problems, this paper proposes DF.GWO-BPNN, an effective phishing website detection model based on the improved BP neural network and dual feature evaluation mechanism. Under this model, the grey wolf algorithm is firstly used to optimize the BP neural network to reasonably select initial parameters. Then, the dual feature mechanism is used to evaluate the results of the improved BP neural network. By the dual feature evaluation mechanism, the accuracy of phishing website recognition is improved. Meanwhile, the black and white list is used to improve the efficiency of the proposed model. The DF.GWO-BPNN model is compared with some existing phishing website detection models. The experimental results have demonstrated that our model is accurate and strong adaptability.

[1]  Crina Grosan,et al.  Feature Subset Selection Approach by Gray-Wolf Optimization , 2014, AECIA.

[2]  Doaa Hassan,et al.  On Determining the Most Effective Subset of Features for Detecting Phishing Websites , 2015 .

[3]  Javier Vargas,et al.  Knowing your enemies: leveraging data analysis to expose phishing patterns against a major US financial institution , 2016, 2016 APWG Symposium on Electronic Crime Research (eCrime).

[4]  Carolyn Penstein Rosé,et al.  CANTINA+: A Feature-Rich Machine Learning Framework for Detecting Phishing Web Sites , 2011, TSEC.

[5]  A. Sardana,et al.  A PageRank based detection technique for phishing web sites , 2012, 2012 IEEE Symposium on Computers & Informatics (ISCI).

[6]  G. M. Komaki,et al.  Grey Wolf Optimizer algorithm for the two-stage assembly flow shop scheduling problem with release time , 2015, J. Comput. Sci..

[7]  Huajun Huang,et al.  A SVM-based Technique to Detect Phishing URLs , 2012 .

[8]  Seyed Mohammad Mirjalili How effective is the Grey Wolf optimizer in training multi-layer perceptrons , 2014, Applied Intelligence.

[9]  Shafique Ahmad Chaudhry,et al.  Phishing Attacks and Defenses , 2016 .

[10]  Peter Cassidy Statement of Support Anti-Phishing Working Group (APWG) , 2006, J. Digit. Forensic Pract..

[11]  Andrew Lewis,et al.  Grey Wolf Optimizer , 2014, Adv. Eng. Softw..

[12]  Ba Lam To,et al.  An efficient approach for phishing detection using single-layer neural network , 2014, 2014 International Conference on Advanced Technologies for Communications (ATC 2014).

[13]  Mona Ghotaish Alkhozae,et al.  Phishing Websites Detection based on Phishing Characteristics in the Webpage Source Code , 2011 .

[14]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation (3rd Edition) , 2007 .

[15]  Roman Z. Morawski,et al.  Application of naïve Bayes classifier in fall detection systems based on infrared depth sensors , 2015, 2015 IEEE 8th International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS).

[16]  Lin Ling,et al.  Survey on the research of focused crawling technique , 2005 .

[17]  Wei Zhang,et al.  Phishing Detection Research Based on PSO-BP Neural Network , 2018, EIDWT.

[18]  Seyed Mohammad Mirjalili,et al.  Evolutionary population dynamics and grey wolf optimizer , 2015, Neural Computing and Applications.

[19]  Wen Jin,et al.  The improvements of BP neural network learning algorithm , 2000, WCC 2000 - ICSP 2000. 2000 5th International Conference on Signal Processing Proceedings. 16th World Computer Congress 2000.