This paper shows how to apply data-mining and modeling methods to learn predictive models of customer behaviors from survey and behavioral data. The models predict transition rates of individual customers among states, including product adds and drops and account attrition rates. A key insight is that classification tree algorithms from data mining can be used to test conditional independence (Cl) relations among variables in large multivariate data sets. This suggests constructive techniques for (a) Building causal graph models from data; and (b) Using data to define the states of a dynamic transition process. The resulting models can be used to help optimize product offers, forecast demand for products, and plan marketing campaigns. We use several real data sets to illustrate how to: (a) Develop predictive models from survey data and from billing data, (b) Validate model assumptions by using classification trees to identify and test conditional independence relations, (c) Evaluate model performance compared to other (e.g., logistic regression or discriminant analysis) models using cross-validation, and (d) Recommend the next logical product to offer to each customer and the best customers to target for each product in order to maximize sales.
[1]
Finn Verner Jensen,et al.
Introduction to Bayesian Networks
,
2008,
Innovations in Bayesian Networks.
[2]
David Biggs,et al.
A method of choosing multiway partitions for classification and decision trees
,
1991
.
[3]
J. Pearl.
Causality: Models, Reasoning and Inference
,
2000
.
[4]
Glenn Shafer,et al.
The art of causal conjecture
,
1996
.
[5]
S. T. Buckland,et al.
Computer Intensive Statistical Methods: Validation, Model Selection, and Bootstrap
,
1993
.
[6]
Maia Berkane.
Latent Variable Modeling and Applications to Causality
,
1997
.
[7]
Michael I. Jordan.
Learning in Graphical Models
,
1999,
NATO ASI Series.
[8]
Tony Lancaster,et al.
The econometric analysis of transition data
,
1990
.
[9]
Christopher M. Bishop,et al.
Classification and regression
,
1997
.
[10]
Leo Breiman,et al.
Classification and Regression Trees
,
1984
.
[11]
Louis Anthony Cox,et al.
Forecasting Demand for Telecommunications Products from Cross-Sectional Data
,
2001,
Telecommun. Syst..
[12]
Mtw,et al.
Computation, causation, and discovery
,
2000
.