Variable selection by association rules for customer churn prediction of multimedia on demand

Multimedia on demand (MOD) is an interactive system that provides a number of value-added services in addition to traditional TV services, such as video on demand and interactive online learning. This opens a new marketing and managerial problem for the telecommunication industry to retain valuable MOD customers. Data mining techniques have been widely applied to develop customer churn prediction models, such as neural networks and decision trees in the domain of mobile telecommunication. However, much related work focuses on developing the prediction models per se. Few studies consider the pre-processing step during data mining whose aim is to filter out unrepresentative data or information. This paper presents the important processes of developing MOD customer churn prediction models by data mining techniques. They contain the pre-processing stage for selecting important variables by association rules, which have not been applied before, the model construction stage by neural networks (NN) and decision trees (DT), which are widely adapted in the literature, and four evaluation measures including prediction accuracy, precision, recall, and F-measure, all of which have not been considered to examine the model performance. The source data are based on one telecommunication company providing the MOD services in Taiwan, and the experimental results show that using association rules allows the DT and NN models to provide better prediction performances over a chosen validation dataset. In particular, the DT model performs better than the NN model. Moreover, some useful and important rules in the DT model, which show the factors affecting a high proportion of customer churn, are also discussed for the marketing and managerial purpose.

[1]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[2]  Cheng-Jung Lin,et al.  Goal-oriented sequential pattern for network banking churn analysis , 2003, Expert Syst. Appl..

[3]  So Young Sohn,et al.  Searching customer patterns of mobile service using clustering and quantitative association rule , 2008, Expert Syst. Appl..

[4]  Chih-Fong Tsai,et al.  Market segmentation based on hierarchical self-organizing map for markets of multimedia on demand , 2008, Expert Syst. Appl..

[5]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[6]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[7]  David C. Yen,et al.  Applying data mining to telecom churn management , 2006, Expert Syst. Appl..

[8]  Moon-Koo Kim,et al.  The effects of customer satisfaction and switching barrier on customer loyalty in Korean mobile telecommunication services , 2004 .

[9]  Michael J. A. Berry,et al.  Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.

[10]  Savvas Papagiannidis,et al.  Well beyond streaming video: IPv6 and the next generation television , 2006 .

[11]  Gordon S. Linoff,et al.  Data mining techniques for marketing, sales and customer relationship , 2004 .

[12]  Cheng-Seen Ho,et al.  Toward a hybrid data mining model for customer retention , 2007, Knowl. Based Syst..

[13]  Jatinder N. D. Gupta,et al.  Neural networks in business: techniques and applications for the operations researcher , 2000, Comput. Oper. Res..

[14]  V. Hawkes,et al.  THE HEART OF THE MATTER: THE CHALLENGE OF CUSTOMER LIFETIME VALUE , 2000 .

[15]  Chih-Ping Wei,et al.  Turning telecommunications call details to churn prediction: a data mining approach , 2002, Expert Syst. Appl..

[16]  張 毓騰,et al.  APPLYING DATA MINING TO TELECOM CHURN MANAGEMENT , 2009 .

[17]  Shuang Shi,et al.  TreeLogit Model for Customer Churn Prediction , 2006, 2006 IEEE Asia-Pacific Conference on Services Computing (APSCC'06).

[18]  Gary M. Weiss Data Mining in Telecommunications , 2005, The Data Mining and Knowledge Discovery Handbook.

[19]  Jaekyung Yang,et al.  Optimization-based feature selection with adaptive instance sampling , 2006, Comput. Oper. Res..

[20]  Dirk Van den Poel,et al.  Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting , 2005, Eur. J. Oper. Res..

[21]  Alex Berson,et al.  Building Data Mining Applications for CRM , 1999 .

[22]  Claudio A. Perez,et al.  Subscription fraud prevention in telecommunications using fuzzy rules and neural networks , 2006, Expert Syst. Appl..

[23]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[24]  Jae-Hyeon Ahn,et al.  Customer churn analysis: Churn determinants and mediation effects of partial defection in the Korean mobile telecommunications service industry , 2006 .

[25]  Kristof Coussement,et al.  Faculteit Economie En Bedrijfskunde Hoveniersberg 24 B-9000 Gent Churn Prediction in Subscription Services: an Application of Support Vector Machines While Comparing Two Parameter-selection Techniques Churn Prediction in Subscription Services: an Application of Support Vector Machines While Comparin , 2022 .

[26]  D. Edwards Data Mining: Concepts, Models, Methods, and Algorithms , 2003 .

[27]  Bo K. Wong,et al.  Neural network applications in business: A review and analysis of the literature (1988-1995) , 1997, Decis. Support Syst..

[28]  Perambur S. Neelakanta,et al.  Development of a neuroinference engine for ADSL modem applications in telecommunications using an ANN with fast computational ability , 2002, Neurocomputing.

[29]  J. Nazuno Haykin, Simon. Neural networks: A comprehensive foundation, Prentice Hall, Inc. Segunda Edición, 1999 , 2000 .

[30]  Luo Bin,et al.  Evaluation of Three Discrete Methods on Customer Churn Model Based on Neural Network and Decision Tree in PHSS , 2007, The First International Symposium on Data, Privacy, and E-Commerce (ISDPE 2007).