Features Weight Estimation Using a Genetic Algorithm for Customer Churn Prediction in the Telecom Sector

The high dimensional dataset results in more noise, will require more computations, has huge sparsity linked with high dimensional features and has thus introduced great challenges in data analysis. To efficiently manipulate and address the impact of the challenges faced by high dimensional dataset, researchers used several features reduction methods. Feature reduction is a formidable step, when dealing with improving the accuracy and reducing the processing time, within high-dimensional data; wherein the feature set is reduced before applying data mining or statistical methods. However with attribute reduction, there is a high chance of loss of important information. In order to avoid information loss, one way is to assign weights to the attributes through domain-expert which is a subjective exercise. It is not only costly but also requires human-expert of the field. Therefore, there is a need for a technique to automatically assign more appropriate weights without involving domain expert. This paper presents a novel features weighting technique. The technique employs a genetic algorithm (GA) to automatically assign weights to the attributes based on Naive Bayes (NB) classification. Experiments have been conducted on publically available dataset to compare the performance of the proposed approach and NB approach without the weighted features for predicting customer churn in telecommunication sector. The experimental results have demonstrated that the proposed technique outperformed through achieving an overall 89.1% accuracy, 95.65% precision which shows the effectiveness of the proposed technique.

[1]  Hu Yanrong,et al.  An evaluating method with combined assigning-weight based on maximizing variance , 2015 .

[2]  Bernhard Pfahringer,et al.  Locally Weighted Naive Bayes , 2002, UAI.

[3]  Harry Zhang,et al.  Learning weighted naive Bayes with accurate ranking , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[4]  Amir Hussain,et al.  Comparing Oversampling Techniques to Handle the Class Imbalance Problem: A Customer Churn Prediction Case Study , 2016, IEEE Access.

[5]  Zhihua Cai,et al.  Attribute Weighting via Differential Evolution Algorithm for Attribute Weighted Naive Bayes (WNB) , 2011 .

[6]  Chi-Hyuck Jun,et al.  Improved churn prediction in telecommunication industry by analyzing a large network , 2014, Expert Syst. Appl..

[7]  Abdelkamel Tari,et al.  Dimensionality reduction in data mining: A Copula approach , 2016, Expert Syst. Appl..

[8]  I K Fodor,et al.  A Survey of Dimension Reduction Techniques , 2002 .

[9]  Sanghyuk Lee,et al.  Information Analysis of High-Dimensional Data and Applications , 2015 .

[10]  Liangxiao Jiang,et al.  Two feature weighting approaches for naive Bayes text classifiers , 2016, Knowl. Based Syst..

[11]  Mark Hall A Decision Tree-Based Attribute Weighting Filter for Naive Bayes , 2006, SGAI Conf..

[12]  Shasha Wang,et al.  Deep feature weighting for naive Bayes and its application to text classification , 2016, Eng. Appl. Artif. Intell..

[13]  Jian Pei,et al.  Data Mining: Concepts and Techniques, 3rd edition , 2006 .

[14]  Kriti Saroha,et al.  Study of dimension reduction methodologies in data mining , 2015, International Conference on Computing, Communication & Automation.

[15]  Ramiro Gonçalves,et al.  How smartphone advertising influences consumers' purchase intention , 2019, Journal of Business Research.

[16]  Guanghui Sun,et al.  New Developments in Sliding Mode Control and Its Applications , 2014 .

[17]  Thar Baker,et al.  Just-in-time Customer Churn Prediction: With and Without Data Transformation , 2018, 2018 IEEE Congress on Evolutionary Computation (CEC).

[18]  Geoffrey I. Webb,et al.  Alleviating naive Bayes attribute independence assumption by attribute weighting , 2013, J. Mach. Learn. Res..

[19]  E. Sivasankar,et al.  Computing efficient features using rough set theory combined with ensemble classification techniques to improve the customer churn prediction in telecommunication sector , 2018, Computing.

[20]  Xin Yao,et al.  Using Class Imbalance Learning for Software Defect Prediction , 2013, IEEE Transactions on Reliability.

[21]  Kaizhu Huang,et al.  Customer churn prediction in the telecommunication sector using a rough set approach , 2017, Neurocomputing.

[22]  Adnan Amin,et al.  A Prudent Based Approach for Customer Churn Prediction , 2015, BDAS.

[23]  Zhuming Bi,et al.  Feature weighted naïve Bayes algorithm for information retrieval of enterprise systems , 2014, Enterp. Inf. Syst..