Just-in-time customer churn prediction in the telecommunication sector

Due to the exponential growth in technologies and a greater number of competitors in the telecom sector, the companies are facing a rigorous problem of customer churns. The customer churn is a phenomenon that highlights the customer’s intention who may switch from a certain service or even the service provider company. Many customer churn prediction (CCP) techniques are developed by academics and practitioners to handle the customer churn in order to resolve the problems pertaining to customer retention. However, CCP is not widely studied in the scenario where the company is not having enough historical data due to either been a newly established company or due to the recent start of a new technology or even because of the loss of the historical data. The just-in-time (JIT) approach can be a more practical alternative to address this issue as compared to state-of-the-art CCP techniques. Unfortunately, similar to traditional churn prediction models, JIT also requires enough historical data. To address this gap in the traditional CCP models, this study uses the cross-company data, i.e., data from another company, in the context of JIT for addressing CCP problems in the telecom sector. We empirically evaluated the performance of the proposed model using publicly available datasets of two telecom companies. It is found from the empirical evaluation that in the JIT-CCP context: (i) it is possible to evaluate the performance of the predictive model using cross-company dataset for training purposes and (ii) it is evident that heterogeneous ensemble-based JIT-CCP model is more suitable approach to use as compared to individual classifier or homogeneous ensemble-based technique.

[1]  Hossam Faris,et al.  Negative Correlation Learning for Customer Churn Prediction: A Comparison Study , 2015, TheScientificWorldJournal.

[2]  Masoud Nikravesh,et al.  Feature Extraction - Foundations and Applications , 2006, Feature Extraction.

[3]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[4]  M. Tahar Kechadi,et al.  Customer churn prediction in telecommunications , 2012, Expert Syst. Appl..

[5]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[6]  A. O. Oyeniyi,et al.  Customer Churn Analysis In Banking Sector Using Data Mining Techniques , 2015 .

[7]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[8]  Zafar Iqbal,et al.  Classification of cyber attacks based on rough set theory , 2015, 2015 First International Conference on Anti-Cybercrime (ICACC).

[9]  D. Ross Jeffery,et al.  Using public domain metrics to estimate software development effort , 2001, Proceedings Seventh International Software Metrics Symposium.

[10]  Adnan Amin,et al.  A prudent based approach for compromised user credentials detection , 2017, Cluster Computing.

[11]  A. Athanassopoulos Customer Satisfaction Cues To Support Market Segmentation and Explain Switching Behavior , 2000 .

[12]  D. Ross Jeffery,et al.  A comparative study of two software development cost modeling techniques using multi-organizational and company-specific data , 2000, Inf. Softw. Technol..

[13]  Victor R. Basili,et al.  The TAME Project: Towards Improvement-Oriented Software Environments , 1988, IEEE Trans. Software Eng..

[14]  Burak Turhan,et al.  A Comparison of Cross-Versus Single-Company Effort Prediction Models for Web Projects , 2014, 2014 40th EUROMICRO Conference on Software Engineering and Advanced Applications.

[15]  Isabella Wieczorek,et al.  How valuable is company-specific data compared to multi-company data for software cost estimation? , 2002, Proceedings Eighth IEEE Symposium on Software Metrics.

[16]  Naoyasu Ubayashi,et al.  Studying just-in-time defect prediction using cross-project models , 2015, Empirical Software Engineering.

[17]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[18]  Adnan Amin,et al.  Customer Churn Prediction in Telecommunication Industry: With and without Counter-Example , 2014, ENIC.

[19]  M. McHugh Interrater reliability: the kappa statistic , 2012, Biochemia medica.

[20]  Konstantinos I. Diamantaras,et al.  A comparison of machine learning techniques for customer churn prediction , 2015, Simul. Model. Pract. Theory.

[21]  Eric Johnson,et al.  Predicting subscriber dissatisfaction and improving retention in the wireless telecommunications industry , 2000, IEEE Trans. Neural Networks Learn. Syst..

[22]  Akbar Javadian Kootanaee,et al.  Just-In-Time Manufacturing System: From Introduction to Implement , 2013 .

[23]  Vadlamani Ravi,et al.  Churn prediction using comprehensible support vector machine: An analytical CRM application , 2014, Appl. Soft Comput..

[24]  Geoff Holmes,et al.  Having a Blast: Meta-Learning and Heterogeneous Ensembles for Data Streams , 2015, 2015 IEEE International Conference on Data Mining.

[25]  Chin-Laung Lei,et al.  Network game design: hints and implications of player interaction , 2006, NetGames '06.

[26]  Dirk Van den Poel,et al.  Handling class imbalance in customer churn prediction , 2009, Expert Syst. Appl..

[27]  D. Kumar,et al.  A Survey on Diabetes Mellitus Prediction Using Machine Learning Techniques , 2016 .

[28]  Amir Hussain,et al.  Comparing Oversampling Techniques to Handle the Class Imbalance Problem: A Customer Churn Prediction Case Study , 2016, IEEE Access.

[29]  Ateeq Ur Rehman,et al.  Intelligent Churn prediction for Telecommunication Industry , 2013 .

[30]  Bart Baesens,et al.  Decision diagrams in machine learning: an empirical study on real-life credit-risk data , 2004, Expert Syst. Appl..

[31]  Vishal Mahajan,et al.  Review of Data Mining Techniques for Churn Prediction in Telecom , 2015 .

[32]  Zulaiha Ali Othman,et al.  ACADEMIC TALENT MODEL BASED ON HUMAN RESOURCE DATA MART , 2012 .

[33]  Wei Lee Woon,et al.  An ensemble model for day-ahead electricity demand time series forecasting , 2013, e-Energy '13.

[34]  Kaizhu Huang,et al.  Customer churn prediction in the telecommunication sector using a rough set approach , 2017, Neurocomputing.

[35]  Nello Cristianini,et al.  An introduction to Support Vector Machines , 2000 .

[36]  Tansel Özyer,et al.  A case study for the churn prediction in Turksat internet service subscription , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[37]  Ashutosh Tiwari,et al.  Computer assisted customer churn management: State-of-the-art and future trends , 2007, Comput. Oper. Res..

[38]  Bart Baesens,et al.  Social network analysis for customer churn prediction , 2014, Appl. Soft Comput..

[39]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[40]  Emilia Mendes,et al.  Investigating Early Web Size Measures for Web Cost Estimation , 2005 .

[41]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[42]  Li Hong,et al.  Predicting Customer Churn in Mobile Telephony Industry Using Probabilistic Classifiers in Data Mining , 2013 .

[43]  K. Chitra,et al.  Customer Retention in Banking Sector using Predictive Data Mining Technique , 2011 .

[44]  Àngela Nebot,et al.  Visualizing pay-per-view television customers churn using cartograms and flow maps , 2013, ESANN.

[45]  Adnan Amin,et al.  A Prudent Based Approach for Customer Churn Prediction , 2015, BDAS.

[46]  Kristof Coussement,et al.  Faculteit Economie En Bedrijfskunde Hoveniersberg 24 B-9000 Gent Churn Prediction in Subscription Services: an Application of Support Vector Machines While Comparing Two Parameter-selection Techniques Churn Prediction in Subscription Services: an Application of Support Vector Machines While Comparin , 2022 .

[47]  Parag C. Pendharkar,et al.  Benchmarking software development productivity of CMMI level 5 projects , 2015, Inf. Technol. Manag..

[48]  Jane Labadin,et al.  Feature selection based on mutual information , 2015, 2015 9th International Conference on IT in Asia (CITA).

[49]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[50]  Dirk Van den Poel,et al.  CRM at a pay-TV company: Using analytical models to reduce customer attrition by targeted marketing for subscription services , 2007, Expert Syst. Appl..

[51]  Maja Matijasevic,et al.  MMORPG player behavior model based on player action categories , 2011, 2011 10th Annual Workshop on Network and Systems Support for Games.

[52]  Zaidah Ibrahim,et al.  Customer Churn Prediction using Recurrent Neural Network with Reinforcement Learning Algorithm in Mobile Phone Users , 2014 .

[53]  Gemma Catolino,et al.  Just-In-Time Bug Prediction in Mobile Applications: The Domain Matters! , 2017, 2017 IEEE/ACM 4th International Conference on Mobile Software Engineering and Systems (MOBILESoft).

[54]  A. K. Mousa,et al.  Comment on “On Soft β-Open Sets and Soft β-Continuous Functions” , 2015, TheScientificWorldJournal.

[55]  Lionel C. Briand,et al.  An assessment and comparison of common software cost estimation modeling techniques , 1999, Proceedings of the 1999 International Conference on Software Engineering (IEEE Cat. No.99CB37002).

[56]  Michel Ballings,et al.  Customer event history for churn prediction: How long is long enough? , 2012, Expert Syst. Appl..

[57]  N. Iqbal,et al.  Neural networks for fault-prediction in a telecommunications network , 2004, 8th International Multitopic Conference, 2004. Proceedings of INMIC 2004..

[58]  Florin Gorunescu,et al.  Data Mining - Concepts, Models and Techniques , 2011, Intelligent Systems Reference Library.

[59]  Mona Nasr,et al.  A Proposed Churn Prediction Model , 2012 .

[60]  Manuel Graña,et al.  Reputation features for trust prediction in social networks , 2015, Neurocomputing.

[61]  Bart Baesens,et al.  New insights into churn prediction in the telecommunication sector: A profit driven data mining approach , 2012, Eur. J. Oper. Res..

[62]  Naoyasu Ubayashi,et al.  An empirical study of just-in-time defect prediction using cross-project models , 2014, MSR 2014.

[63]  Reza Allahyari Soeini,et al.  Applying Data Mining to Insurance Customer Churn Management , 2012 .

[64]  NebotÀngela,et al.  Intelligent data analysis approaches to churn as a business problem , 2017 .

[65]  Cao Kang,et al.  Customer Churn Prediction Based on SVM-RFE , 2008, 2008 International Seminar on Business and Information Management.

[66]  Bart Baesens,et al.  A comparative study of social network classifiers for predicting churn in the telecommunication industry , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[67]  Kofi Poku,et al.  Impact of Service Quality on Customer Loyalty in the Hotel Industry: An Empirical Study from Ghana , 2013 .

[68]  Chi-Hyuck Jun,et al.  Improved churn prediction in telecommunication industry by analyzing a large network , 2014, Expert Syst. Appl..

[69]  Dirk Van den Poel,et al.  Predicting online-purchasing behaviour , 2005, Eur. J. Oper. Res..

[70]  Ian H. Witten,et al.  Chapter 15 – Embedded Machine Learning , 2011 .

[71]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[72]  Setak Mostafa,et al.  A Neuro-Fuzzy Classifier for Customer Churn Prediction , 2011 .

[73]  Shweta Taneja,et al.  An Enhanced K-Nearest Neighbor Algorithm Using Information Gain and Clustering , 2014, 2014 Fourth International Conference on Advanced Computing & Communication Technologies.

[74]  Asifullah Khan,et al.  Ensemble Based Efficient Churn Prediction Model for Telecom , 2014, 2014 12th International Conference on Frontiers of Information Technology.

[75]  Ekrem Duman,et al.  A profit-driven Artificial Neural Network (ANN) with applications to fraud detection and direct marketing , 2016, Neurocomputing.

[76]  Gwo-Hshiung Tzeng,et al.  Combined rough set theory and flow network graph to predict customer churn in credit card accounts , 2011, Expert Syst. Appl..

[77]  U. Devi Prasad,et al.  Prediction of Churn Behaviour of Bank Customers Using Data Mining Tools , 2012 .

[78]  Martin J. Shepperd,et al.  Using Genetic Programming to Improve Software Effort Estimation Based on General Data Sets , 2003, GECCO.

[79]  Àngela Nebot,et al.  Intelligent data analysis approaches to churn as a business problem: a survey , 2017, Knowledge and Information Systems.

[80]  Katharina Morik,et al.  Analysing Customer Churn in Insurance Data - A Case Study , 2004, PKDD.

[81]  Praveen Asthana A comparison of machine learning techniques for customer churn prediction , 2018 .

[82]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[83]  Lionel C. Briand,et al.  A replicated Assessment of Common Software Cost Estimation Techniques , 2000, ICSE 2000.