Churn prediction in subscriber management for mobile and wireless communications services

Subscriber churn is a concern of customer care management for most of the mobile and wireless service providers and operators due to its associated costs. This paper explains our work on subscriber churn analysis and prediction for such services. We work on data mining techniques to accurately and efficiently predict subscribers who will change-and-turn (churn) to another provider for the same or similar service. The dataset we use is a public and real dataset compiled by Orange Telecom for the KDD 2009 Competition. Number of teams achieved high scores on this dataset requiring a significant amount of computing resources. We are aiming to find alternative methods that can match or improve the recorded high scores with more efficient and practical use of resources. In this study, we focus on ensemble of meta-classifiers which have been studied individually and chosen according to their performances.

[1]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[2]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[3]  S. Cessie,et al.  Ridge Estimators in Logistic Regression , 1992 .

[4]  Isabelle Guyon,et al.  Winning the KDD Cup Orange Challenge with Ensemble Selection , 2009 .

[5]  Isabelle Guyon,et al.  Design and analysis of the KDD cup 2009: fast scoring on a large orange customer database , 2009, SKDD.

[6]  Marc Boullé,et al.  The 2009 Knowledge Discovery and Data Mining Competition (KDD Cup 2009): Challenges in Machine Learning , 2011 .

[7]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[8]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[9]  Bart Baesens,et al.  New insights into churn prediction in the telecommunication sector: A profit driven data mining approach , 2012, Eur. J. Oper. Res..

[10]  Yossi Richter,et al.  Predicting Customer Churn in Mobile Networks through Analysis of Social Groups , 2010, SDM.

[11]  Minghe Sun,et al.  A hierarchical multiple kernel support vector machine for customer churn prediction using longitudinal behavioral data , 2012, Eur. J. Oper. Res..

[12]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[13]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[14]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[15]  I. Kononenko,et al.  INDUCTION OF DECISION TREES USING RELIEFF , 1995 .

[16]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[17]  Chih-Ping Wei,et al.  Turning telecommunications call details to churn prediction: a data mining approach , 2002, Expert Syst. Appl..

[18]  Yang Yang,et al.  Bagging binary and quantile predictors for time series , 2006 .

[19]  Hugh Miller,et al.  Predicting customer behaviour: The University of Melbourne's KDD Cup report , 2009, KDD Cup.

[20]  Jianjun Xie,et al.  A Combination of Boosting and Bagging for KDD Cup 2009 - Fast Scoring on a Large Database , 2009, KDD Cup.