Feature selection using firefly algorithm in software defect prediction

Defects occurring in software product are a universal event. Prevention of these defects in the early stage needs more attention because early stage prevention and fixing requires less effort and lower cost. Software defect prediction (SDP) is necessary in the determination of software quality as well as reliability. Prediction of defects is relatively an original research area in software quality engineering. Coverage of key predictors and the kind of data to be collected along with defect prediction model role, the interdependence of defects and predictors can be recognized in software quality. Feature selection (FS) is one of the worthy preprocessing techniques for application that uses huge volumes of data. It is the process of selecting the probable minimal attribute which is expected to be represented in the set of actual attributes. This paper proposes, FS using firefly algorithm (FA) and classifiers like support vector machine (SVM), Naïve Bayes (NB) as well as K-nearest neighbor (KNN) are used for classifying the features selected. The FS that make use of the FA is that new technique of evolutionary computation that has been inspired by the process of flash lighting of the fireflies. This can search quickly the feature space for an optimal or a near optimal feature subset for minimizing a certain function of fitness. This proposed fitness function has made use of the incorporation of both the accuracy of classification and the reduction of the size. The results of the experiment have shown that the FS using the FA can achieve a better accuracy of classification than that of the other methods.

[1]  P. Selvaraj,et al.  Support Vector Machine for Software Defect Prediction , 2013 .

[2]  Jongmoon Baik,et al.  A Hybrid Instance Selection Using Nearest-Neighbor for Cross-Project Defect Prediction , 2015, Journal of Computer Science and Technology.

[3]  Xin Yao,et al.  A Learning-to-Rank Approach to Software Defect Prediction , 2015, IEEE Transactions on Reliability.

[4]  Ying Ma,et al.  On Software Defect Prediction Using Machine Learning , 2014, J. Appl. Math..

[5]  Sanjay Kumar Dubey,et al.  Software Defect Prediction Models for Quality Improvement : A Literature Study , 2012 .

[6]  Shrish Verma,et al.  A Comparative Study of Bug Classification Algorithms , 2014, Int. J. Softw. Eng. Knowl. Eng..

[7]  Haifeng Li,et al.  Software Defect Prediction Using Dynamic Support Vector Machine , 2013, 2013 Ninth International Conference on Computational Intelligence and Security.

[8]  Kuinam J. Kim,et al.  A feature selection approach to find optimal feature subsets for the network intrusion detection system , 2015, Cluster Computing.

[9]  A. Sudha,et al.  Software Defect Prediction System using Multilayer Perceptron Neural Network with Data Mining , 2014 .

[10]  Aboul Ella Hassanien,et al.  Firefly Optimization Algorithm for Feature Selection , 2015, BCI.

[11]  Shashank Dixit,et al.  Layered Approach to Classify Web Pages using Firefly Feature Selection by Support Vector Machine (SVM) , 2015 .

[12]  Ali Selamat,et al.  A survey on software fault detection based on different prediction approaches , 2014, Vietnam Journal of Computer Science.

[13]  Pradeep Kumar Singh,et al.  An Insight to Soft Computing based Defect Prediction Techniques in Software , 2015 .

[14]  Anuradha Chug,et al.  Software Defect Prediction Using Supervised Learning Algorithm and Unsupervised Learning Algorithm , 2013 .

[15]  Divya Tomar,et al.  A Feature Selection Based Model for Software Defect Prediction , 2014 .

[16]  Satvir Singh,et al.  The Firefly Optimization Algorithm: Convergence Analysis and Parameter Selection , 2013 .

[17]  Arti Arya,et al.  A Study on Software Metrics based Software Defect Prediction using Data Mining and Machine Learning Techniques , 2015 .

[18]  Tim Menzies,et al.  The \{PROMISE\} Repository of Software Engineering Databases. , 2005 .

[19]  John Yearwood,et al.  A parallel framework for software defect detection and metric selection on cloud computing , 2017, Cluster Computing.

[20]  Hema Banati,et al.  Fire Fly Based Feature Selection Approach , 2011 .

[21]  M. SURENDRA NAIDU,et al.  CLASSIFICATION OF DEFECTS IN SOFTWARE USING DECISION TREE ALGORITHM , 2013 .

[22]  Hossam Faris,et al.  Optimizing Software Effort Estimation Models Using Firefly Algorithm , 2015, ArXiv.

[23]  Li Juelong,et al.  A new model for software defect prediction using Particle Swarm Optimization and support vector machine , 2013, 2013 25th Chinese Control and Decision Conference (CCDC).