Cross-company customer churn prediction in telecommunication: A comparison of data transformation methods

Abstract Cross-Company Churn Prediction (CCCP) is a domain of research where one company (target) is lacking enough data and can use data from another company (source) to predict customer churn successfully. To support CCCP, the cross-company data is usually transformed to a set of similar normal distribution of target company data prior to building a CCCP model. However, it is still unclear which data transformation method is most effective in CCCP. Also, the impact of data transformation methods on CCCP model performance using different classifiers have not been comprehensively explored in the telecommunication sector. In this study, we devised a model for CCCP using data transformation methods (i.e., log, z-score, rank and box-cox) and presented not only an extensive comparison to validate the impact of these transformation methods in CCCP, but also evaluated the performance of underlying baseline classifiers (i.e., Naive Bayes (NB), K-Nearest Neighbour (KNN), Gradient Boosted Tree (GBT), Single Rule Induction (SRI) and Deep learner Neural net (DP)) for customer churn prediction in telecommunication sector using the above mentioned data transformation methods. We performed experiments on publicly available datasets related to the telecommunication sector. The results demonstrated that most of the data transformation methods (e.g., log, rank, and box-cox) improve the performance of CCCP significantly. However, the Z-Score data transformation method could not achieve better results as compared to the rest of the data transformation methods in this study. Moreover, it is also investigated that the CCCP model based on NB outperform on transformed data and DP, KNN and GBT performed on the average, while SRI classifier did not show significant results in term of the commonly used evaluation measures (i.e., probability of detection, probability of false alarm, area under the curve and g-mean).

[1]  Yong Li,et al.  Evaluating Data Filter on Cross-Project Defect Prediction: Comparison and Improvements , 2017, IEEE Access.

[2]  Xin Yao,et al.  Using Class Imbalance Learning for Software Defect Prediction , 2013, IEEE Transactions on Reliability.

[3]  Win Win Myo,et al.  Customer Churn Analysis in Banking Sector , 2020 .

[4]  Guandong Xu,et al.  Customer Churn Prediction in Superannuation: A Sequential Pattern Mining Approach , 2018, ADC.

[5]  Koen W. De Bock,et al.  Reconciling performance and interpretability in customer churn prediction using ensemble learning based on generalized additive models , 2012, Expert Syst. Appl..

[6]  Stefan Gheorghe Pentiuc,et al.  Data Dimensionality Reduction for Data Mining: A Combined Filter-Wrapper Framework , 2014, Int. J. Comput. Commun. Control.

[7]  ZhangHongyu,et al.  Comments on "Data Mining Static Code Attributes to Learn Defect Predictors" , 2007 .

[8]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[9]  Bojana Dalbelo Basic,et al.  Churn Prediction Model in Retail Banking Using Fuzzy C-Means Algorithm , 2009, Informatica.

[10]  Bart Baesens,et al.  New insights into churn prediction in the telecommunication sector: A profit driven data mining approach , 2012, Eur. J. Oper. Res..

[11]  Bojana Dalbelo Churn Prediction Model in Retail Banking Using Fuzzy C-Means Algorithm , 2009 .

[12]  Tim Menzies,et al.  Data Mining Static Code Attributes to Learn Defect Predictors , 2007 .

[13]  A. O. Oyeniyi,et al.  Customer Churn Analysis In Banking Sector Using Data Mining Techniques , 2015 .

[14]  Jonathan Loo,et al.  Customer churn prediction in telecommunication industry using data certainty , 2019, Journal of Business Research.

[15]  Zafar Iqbal,et al.  Classification of cyber attacks based on rough set theory , 2015, 2015 First International Conference on Anti-Cybercrime (ICACC).

[16]  Yong Shi,et al.  Prediction of Customer Attrition of Commercial Banks based on SVM Model , 2014, ITQM.

[17]  Chung-Tzer Liu,et al.  he effects of relationship quality and switching barriers on customer loyalty , 2010 .

[18]  V. Sheng,et al.  Immune Centroids OverSampling Method for Multi-Class Classification , 2016 .

[19]  Antonio Padilla-Meléndez,et al.  Analyzing the impact of knowledge management on CRM success: The mediating effects of organizational factors , 2011, Int. J. Inf. Manag..

[20]  Audris Mockus,et al.  Towards building a universal defect prediction model , 2014, MSR 2014.

[21]  Asifullah Khan,et al.  Churn comprehension analysis for telecommunication industry using ALBA , 2016, 2016 International Conference on Emerging Technologies (ICET).

[22]  Idan Szpektor,et al.  Churn prediction in new users of Yahoo! answers , 2012, WWW.

[23]  Yufeng Yao,et al.  Immune Centroids Over-Sampling Method for Multi-Class Classification , 2015, PAKDD.

[24]  Thar Baker,et al.  Just-in-time Customer Churn Prediction: With and Without Data Transformation , 2018, 2018 IEEE Congress on Evolutionary Computation (CEC).

[25]  Xiuzhen Zhang,et al.  Comments on "Data Mining Static Code Attributes to Learn Defect Predictors" , 2007, IEEE Trans. Software Eng..

[26]  K. Becker,et al.  Analysis of microarray data using Z score transformation. , 2003, The Journal of molecular diagnostics : JMD.

[27]  Li Hong,et al.  Predicting Customer Churn in Mobile Telephony Industry Using Probabilistic Classifiers in Data Mining , 2013 .

[28]  K. Chitra,et al.  Customer Retention in Banking Sector using Predictive Data Mining Technique , 2011 .

[29]  Hossam Faris,et al.  Negative Correlation Learning for Customer Churn Prediction: A Comparison Study , 2015, TheScientificWorldJournal.

[30]  Zia ur Rehman,et al.  Intelligent churn prediction for telecom using GP-AdaBoost learning and PSO undersampling , 2019, Cluster Computing.

[31]  Vadlamani Ravi,et al.  Churn prediction using comprehensible support vector machine: An analytical CRM application , 2014, Appl. Soft Comput..

[32]  Junyeong Lee,et al.  Business analytics use in CRM: A nomological net from IT competence to CRM performance , 2018, Int. J. Inf. Manag..

[33]  Guandong Xu,et al.  Deployment of churn prediction model in financial services industry , 2016, 2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC).

[34]  Jacky W. Keung,et al.  Cross-Project Defect Prediction Using a Credibility Theory Based Naive Bayes Classifier , 2017, 2017 IEEE International Conference on Software Quality, Reliability and Security (QRS).

[35]  Dirk Van den Poel,et al.  Handling class imbalance in customer churn prediction , 2009, Expert Syst. Appl..

[36]  Erhard Rahm,et al.  Data Cleaning: Problems and Current Approaches , 2000, IEEE Data Eng. Bull..

[37]  Harald C. Gall,et al.  Cross-project defect prediction: a large scale experiment on data vs. domain vs. process , 2009, ESEC/SIGSOFT FSE.

[38]  Arun Kumar Somani,et al.  Enhanced feature mining and classifier models to predict customer churn for an E-retailer , 2017, 2017 7th International Conference on Cloud Computing, Data Science & Engineering - Confluence.

[39]  Kaizhu Huang,et al.  Customer churn prediction in the telecommunication sector using a rough set approach , 2017, Neurocomputing.

[40]  TurhanBurak,et al.  Empirical evaluation of the effects of mixed project data on learning defect predictors , 2013 .

[41]  J. Deville,et al.  Efficient balanced sampling: The cube method , 2004 .

[42]  Iris Reychav,et al.  Going beyond technology: Knowledge sharing as a tool for enhancing customer-oriented attitudes , 2009, Int. J. Inf. Manag..

[43]  Haibo He,et al.  ADASYN: Adaptive synthetic sampling approach for imbalanced learning , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[44]  Maja Matijasevic,et al.  MMORPG player behavior model based on player action categories , 2011, 2011 10th Annual Workshop on Network and Systems Support for Games.

[45]  Arno De Caigny,et al.  A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees , 2018, Eur. J. Oper. Res..

[46]  Adnan Amin,et al.  Churn Prediction in Telecommunication Industry Using Rough Set Approach , 2015, New Trends in Computational Collective Intelligence.

[47]  Bart Baesens,et al.  Building comprehensible customer churn prediction models with advanced rule induction techniques , 2011, Expert Syst. Appl..

[48]  Andrej Kosir,et al.  A diffusion model for churn prediction based on sociometric theory , 2015, Adv. Data Anal. Classif..

[49]  Maria João Ferreira,et al.  Organizational Training within Digital Transformation: The ToOW Model , 2017, ICEIS.

[50]  Ekrem Duman,et al.  A profit-driven Artificial Neural Network (ANN) with applications to fraud detection and direct marketing , 2016, Neurocomputing.

[51]  Zaidah Ibrahim,et al.  Customer Churn Prediction using Recurrent Neural Network with Reinforcement Learning Algorithm in Mobile Phone Users , 2014 .

[52]  Xin Yao,et al.  A novel evolutionary data mining algorithm with applications to churn prediction , 2003, IEEE Trans. Evol. Comput..

[53]  Shini Renjith,et al.  B2C E-Commerce Customer Churn Management: Churn Detection using Support Vector Machine and Personalized Retention using Hybrid Recommendations , 2017 .

[54]  Zhaohua Deng,et al.  Understanding customer satisfaction and loyalty: An empirical study of mobile instant messages in China , 2010, Int. J. Inf. Manag..

[55]  Ying Zou,et al.  Data Transformation in Cross-project Defect Prediction , 2017, Empirical Software Engineering.

[56]  Àngela Nebot,et al.  Visualizing pay-per-view television customers churn using cartograms and flow maps , 2013, ESANN.

[57]  Girish Keshav Palshikar,et al.  Employee churn prediction , 2011, Expert Syst. Appl..

[58]  Chu-Chen Rosa Yeh,et al.  Simple database marketing tools in customer analysis and retention , 2003, Int. J. Inf. Manag..

[59]  Adnan Amin,et al.  Customer Churn Prediction in Telecommunication Industry: With and without Counter-Example , 2014, 2014 European Network Intelligence Conference.

[60]  Tim Menzies,et al.  Balancing Privacy and Utility in Cross-Company Defect Prediction , 2013, IEEE Transactions on Software Engineering.

[61]  Ramesh Hariharan,et al.  The analysis of microarray data. , 2003, Pharmacogenomics.

[62]  Jian-xiong Dong,et al.  Algorithms of fast SVM evaluation based on subspace projection , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[63]  Hongyue WANG,et al.  Log-transformation and its implications for data analysis , 2014, Shanghai archives of psychiatry.

[64]  Anthony J Bishara,et al.  Reducing Bias and Error in the Correlation Coefficient Due to Nonnormality , 2015, Educational and psychological measurement.

[65]  Bart Baesens,et al.  Time series for early churn detection: Using similarity based classification for dynamic networks , 2018, Expert Syst. Appl..

[66]  Ayse Basar Bener,et al.  Empirical evaluation of the effects of mixed project data on learning defect predictors , 2013, Inf. Softw. Technol..

[67]  Ye Yang,et al.  An investigation on the feasibility of cross-project defect prediction , 2012, Automated Software Engineering.

[68]  Naoyasu Ubayashi,et al.  An empirical study of just-in-time defect prediction using cross-project models , 2014, MSR 2014.

[69]  Reza Allahyari Soeini,et al.  Applying Data Mining to Insurance Customer Churn Management , 2012 .

[70]  Maliha S. Nash,et al.  Handbook of Parametric and Nonparametric Statistical Procedures , 2001, Technometrics.

[71]  Bart Baesens,et al.  A comparative study of social network classifiers for predicting churn in the telecommunication industry , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[72]  Tim Menzies,et al.  Data Mining Static Code Attributes to Learn Defect Predictors , 2007, IEEE Transactions on Software Engineering.

[73]  Parag C. Pendharkar,et al.  Genetic algorithm based neural network approaches for predicting churn in cellular wireless network services , 2009, Expert Syst. Appl..

[74]  Tim Menzies,et al.  Data Mining Static Code Attributes to Learn Defect Predictors , 2007, IEEE Transactions on Software Engineering.

[75]  Monika,et al.  Churn Prediction in Telecommunication Industry using Decision Tree , 2017 .

[76]  Bart Baesens,et al.  Profit Driven Decision Trees for Churn Prediction , 2017, Eur. J. Oper. Res..

[77]  Cheng Zhang,et al.  Native API based Windows anomaly intrusion detection method using SVM , 2006, IEEE International Conference on Sensor Networks, Ubiquitous, and Trustworthy Computing (SUTC'06).

[78]  Christopher Bull,et al.  Customer Relationship Management (CRM) systems, intermediation and disintermediation: The case of INSG , 2010, Int. J. Inf. Manag..

[79]  Adnan Amin,et al.  Just-in-time customer churn prediction in the telecommunication sector , 2017, The Journal of Supercomputing.

[80]  Adnan Amin,et al.  A Comparison of Two Oversampling Techniques (SMOTE vs MTDF) for Handling Class Imbalance Problem: A Case Study of Customer Churn Prediction , 2015, WorldCIST.

[81]  Imed Boughzala,et al.  The Shape of Digital Transformation: A Systematic Literature Review , 2015, MCIS.

[82]  Konstantinos I. Diamantaras,et al.  A comparison of machine learning techniques for customer churn prediction , 2015, Simul. Model. Pract. Theory.

[83]  Shyr-Shen Yu,et al.  Distance Metric Based Oversampling Method for Bioinformatics and Performance Evaluation , 2016, Journal of Medical Systems.

[84]  Bart Baesens,et al.  Social network analytics for churn prediction in telco: Model building, evaluation and network architecture , 2017, Expert Syst. Appl..

[85]  Audris Mockus,et al.  How Does Context Affect the Distribution of Software Maintainability Metrics? , 2013, 2013 IEEE International Conference on Software Maintenance.

[86]  Der-Chiang Li,et al.  Using mega-trend-diffusion and artificial samples in small data set learning for early flexible manufacturing system scheduling knowledge , 2007, Comput. Oper. Res..

[87]  Dirk Van den Poel,et al.  CRM at a pay-TV company: Using analytical models to reduce customer attrition by targeted marketing for subscription services , 2007, Expert Syst. Appl..

[88]  Jaideep Srivastava,et al.  Churn Prediction in MMORPGs: A Social Influence Based Approach , 2009, 2009 International Conference on Computational Science and Engineering.

[89]  Mustafa Mat Deris,et al.  CHURN CLASSIFICATION MODEL FOR LOCAL TELECOMMUNICATION COMPANY BASED ON ROUGH SET THEORY , 2018 .

[90]  Sheng-yi Jiang,et al.  Approximate Equal Frequency Discretization Method , 2009, 2009 WRI Global Congress on Intelligent Systems.

[91]  Guilherme Horta Travassos,et al.  Cross versus Within-Company Cost Estimation Studies: A Systematic Review , 2007, IEEE Transactions on Software Engineering.

[92]  Bart Baesens,et al.  Social network analysis for customer churn prediction , 2014, Appl. Soft Comput..

[93]  Xin Yao,et al.  MWMOTE--Majority Weighted Minority Oversampling Technique for Imbalanced Data Set Learning , 2014 .

[94]  Axel Uhl,et al.  Digital Enterprise Transformation: A Business-Driven Approach to Leveraging Innovative IT , 2014 .

[95]  Panos Vassiliadis,et al.  Deciding the physical implementation of ETL workflows , 2007, DOLAP '07.

[96]  Chih-Fong Tsai,et al.  Customer churn prediction by hybrid neural networks , 2009, Expert Syst. Appl..