Data augmentation by predicting spending pleasure using commercially available external data

Since customer relationship management (CRM) plays an increasingly important role in a company’s marketing strategy, the database of the company can be considered as a valuable asset to compete with others. Consequently, companies constantly try to augment their database through data collection themselves, as well as through the acquisition of commercially available external data. Until now, little research has been done on the usefulness of these commercially available external databases for CRM. This study will present a methodology for such external data vendors based on random forests predictive modeling techniques to create commercial variables that solve the shortcomings of a classic transactional database. Eventually, we predicted spending pleasure variables, a composite measure of purchasing behavior and attitude, in 26 product categories for more than 3 million respondents. Enhancing a company’s transactional database with these variables can significantly improve the predictive performance of existing CRM models. This has been demonstrated in a case study with a magazine publisher for which prospects needed to be identified for new customer acquisition.

[1]  TsaiChih-Fong,et al.  Market segmentation based on hierarchical self-organizing map for markets of multimedia on demand , 2008 .

[2]  John A. McCarty,et al.  SEGMENTATION APPROACHES IN DATA MINING: A COMPARISON OF RFM, CHAID, AND LOGISTIC REGRESSION , 2007 .

[3]  David G. Stork,et al.  Pattern Classification , 1973 .

[4]  Sungzoon Cho,et al.  Response modeling with support vector machines , 2006, Expert Syst. Appl..

[5]  Abbie Griffin,et al.  The Role Of Transactional Versus Relational Data In IMC Programs: Bringing Customer Data Together , 2004, Journal of Advertising Research.

[6]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[7]  Dirk Van den Poel,et al.  Predicting online-purchasing behaviour , 2005, Eur. J. Oper. Res..

[8]  Subir Bandyopadhyay,et al.  Does attitudinal loyalty influence behavioral loyalty? A theoretical and empirical study , 2007 .

[9]  Van den PoelDirk,et al.  Churn prediction in subscription services , 2008 .

[10]  H. Raghav Rao,et al.  Introduction to the special issue: decision support issues in customer relationship management and interactive marketing for e-commerce , 2001, Decis. Support Syst..

[11]  Dirk Van den Poel,et al.  Customer attrition analysis for financial services using proportional hazard models , 2004, Eur. J. Oper. Res..

[12]  John R. Rossiter QSpending PowerQ and the Subjective Discretionary Income (Sdi) Scale , 1995 .

[13]  Sunil Gupta,et al.  Valuing customers , 2007 .

[14]  Kristof Coussement,et al.  Faculteit Economie En Bedrijfskunde Hoveniersberg 24 B-9000 Gent Churn Prediction in Subscription Services: an Application of Support Vector Machines While Comparing Two Parameter-selection Techniques Churn Prediction in Subscription Services: an Application of Support Vector Machines While Comparin , 2022 .

[15]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[16]  Dirk Van den Poel,et al.  Predicting customer loyalty using the internal transactional database , 2007, Expert Syst. Appl..

[17]  Michel Wedel,et al.  List augmentation with model based multiple imputation: a case study using a mixed‐outcome factor model , 2003 .

[18]  D. Collings,et al.  Valuing customers , 2005 .

[19]  Dirk Van den Poel,et al.  Improving Purchasing Behavior Predictions by Data Augmentation with Situational Variables , 2010, Int. J. Inf. Technol. Decis. Mak..

[20]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[21]  Van den PoelDirk,et al.  Random Forests for multiclass classification , 2008 .

[22]  F. F. Reichheld,et al.  Zero defections: quality comes to services. , 1990, Harvard business review.

[23]  Nissan Levin,et al.  Applying neural computing to target marketing , 1997 .

[24]  Dirk Van den Poel,et al.  FACULTEIT ECONOMIE , 2007 .

[25]  J. R. Bult,et al.  Optimal Selection for Direct Mail , 1995 .

[26]  Jennifer J. Argo,et al.  Embarrassment in Consumer Purchase: The Roles of Social Presence and Purchase Familiarity , 2001 .

[27]  Kevin E. Voss,et al.  Measuring the Hedonic and Utilitarian Dimensions of Consumer Attitude , 2003 .

[28]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[29]  David W. Stewart,et al.  The Differential Impact of Goal Congruency on Attitudes, Intentions, and the Transfer of Brand Equity , 2001 .

[30]  Carl F. Mela,et al.  Choice Models and Customer Relationship Management , 2005 .

[31]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[32]  Paul D. Berger,et al.  New Customer Acquisition: Prospecting Models and the use of Commercially Available External Data , 1995 .

[33]  R. Blattberg,et al.  Database marketing , 1997 .

[34]  Hyoungjoo Lee,et al.  Response modeling with support vector regression , 2008, Expert Syst. Appl..

[35]  Dirk Van den Poel,et al.  Constrained optimization of data-mining problems to improve model performance: A direct-marketing application , 2005, Expert Syst. Appl..

[36]  Peter C. Verhoef,et al.  The commercial use of segmentation and predictive modeling techniques for database marketing in the Netherlands , 2003, Decis. Support Syst..

[37]  Dirk Van den Poel,et al.  Exploiting Randomness for Feature Selection in Multinomial Logit: A CRM Cross-Sell Application , 2006, Industrial Conference on Data Mining.

[38]  Euiho Suh,et al.  Customer list segmentation using the combined response model , 1999 .

[39]  Chih-Fong Tsai,et al.  Market segmentation based on hierarchical self-organizing map for markets of multimedia on demand , 2008, Expert Syst. Appl..

[40]  R. Kohli,et al.  Internet Recommendation Systems , 2000 .