MODELING LARGE-SCALE CROSS EFFECT IN CO-PURCHASE INCIDENCE: COMPARING ARTIFICIAL NEURAL NETWORK TECHNIQUES AND MULTIVARIATE PROBIT MODELING

OF DISSERTATION MODELING LARGE-SCALE CROSS EFFECT IN CO-PURCHASE INCIDENCE: COMPARING ARTIFICIAL NEURAL NETWORK TECHNIQUES AND MULTIVARIATE PROBIT MODELING This dissertation examines cross-category effects in consumer purchases from the big data and analytics perspectives. It uses data from Nielsen Consumer Panel and Scanner databases for its investigations. With big data analytics it becomes possible to examine the cross effects of many product categories on each other. The number of categories whose cross effects are studied is called category scale or just scale in this dissertation. The larger the category scale the higher the number of categories whose cross effects are studied. This dissertation extends research on models of cross effects by (1) examining the performance of MVP model across category scale; (2) customizing artificial neural network (ANN) techniques for large-scale cross effect analysis; (3) examining the performance of ANN across scale; and (4) developing a conceptual model of spending habits as a source of cross effect heterogeneity. The results provide researchers and managers new knowledge about using the two techniques in large category scale settings The computational capabilities required by MVP models grow exponentially with scale and thus are more significantly limited by computational capabilities than are ANN models. In our experiments, for scales 4, 8, 16 and 32, using Nielsen data, MVP models could not be estimated using baskets with 16 and more categories. We attempted to and could calibrate ANN models, on the other hand, for both scales 16 and 32. Surprisingly, the predictive results of ANN models exhibit an inverted U relationship with scale. As an ancillary result we provide a method for determining the existence and extent of non-linear own and cross category effects on likelihood of purchase of a category using ANN models. Besides our empirical studies, we draw on the mental budgeting model and impulsive spending literature, to provide a conceptualization of consumer spending habits as a source of heterogeneity in cross effect context. Finally, after a discussion of conclusions and limitations, the dissertation concludes with a discussion of open questions for future research.

[1]  P. Green,et al.  Conjoint Analysis in Consumer Research: Issues and Outlook , 1978 .

[2]  David J. Curry,et al.  Prediction in Marketing Using the Support Vector Machine , 2005 .

[3]  H. Hruschka Comparing Small‐ and Large‐Scale Models of Multicategory Buying Behavior , 2013 .

[4]  John C. B. Cooper Artificial neural networks versus multivariate statistics: An application from economics , 1999 .

[5]  Harald Hruschka,et al.  Determining market response functions by neural network modeling: A comparison to econometric techniques , 1993 .

[6]  Imran S. Currim,et al.  Using Segmentation Approaches for Better Prediction and Understanding from Consumer Mode Choice Models , 1981 .

[7]  Girish N. Punj,et al.  Cluster Analysis in Marketing Research: Review and Suggestions for Application , 1983 .

[8]  George G. Lendaris,et al.  On Matching ANN Structure to Problem Domain Structure , 1993 .

[9]  Ronald J. Faber,et al.  Spent Resources: Self‐Regulatory Resource Availability Affects Impulse Buying , 2007 .

[10]  Dennis W. Rook,et al.  Normative Influences on Impulsive Buying Behavior , 1995 .

[11]  Clyde W. Holsapple,et al.  A unified foundation for business analytics , 2014, Decis. Support Syst..

[12]  James A. Roberts,et al.  Environmental segmentation alternatives: a look at green consumer behavior in the new , 1999 .

[13]  Gary J. Russell,et al.  Multiple-Category Decision-Making: Review and Synthesis , 1999 .

[14]  Douglas Kline,et al.  Revisiting squared-error and cross-entropy functions for training neural network classifiers , 2005, Neural Computing & Applications.

[15]  Sunil Gupta,et al.  The Shopping Basket: A Model for Multicategory Purchase Incidence Decisions , 1999 .

[16]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[17]  Yang Li,et al.  A Bayesian Semiparametric Approach for Endogeneity and Heterogeneity in Choice Models , 2014, Manag. Sci..

[18]  Das Amrita,et al.  Mining Association Rules between Sets of Items in Large Databases , 2013 .

[19]  Gregory S. Carpenter,et al.  Meaningful Brands from Meaningless Differentiation: The Dependence on Irrelevant Attributes , 1994 .

[20]  Filippo Menczer,et al.  Customer Targeting: A Neural Network Approach Guided by Genetic Algorithms , 2005, Manag. Sci..

[21]  S. Chib,et al.  Analysis of multi-category purchase incidence decisions using IRI market basket data , 2002 .

[22]  G. Lewicki,et al.  Approximation by Superpositions of a Sigmoidal Function , 2003 .

[23]  D. McFadden Conditional logit analysis of qualitative choice behavior , 1972 .

[24]  Kwai-Sang Chin,et al.  Modeling daily patient arrivals at Emergency Department and quantifying the relative importance of contributing variables using artificial neural network , 2013, Decis. Support Syst..

[25]  Raúl Rojas,et al.  Neural Networks - A Systematic Introduction , 1996 .

[26]  Fred L. Collopy,et al.  Error Measures for Generalizing About Forecasting Methods: Empirical Comparisons , 1992 .

[27]  Bart Baesens,et al.  Bayesian neural network learning for repeat purchase modelling in direct marketing , 2002, Eur. J. Oper. Res..

[28]  R. Baumeister Yielding to Temptation: Self‐Control Failure, Impulsive Purchasing, and Consumer Behavior , 2002 .

[29]  Gary J. Russell,et al.  A Probabilistic Choice Model for Market Segmentation and Elasticity Structure , 1989 .

[30]  Greg M. Allenby,et al.  Modeling Interdependent Consumer Preferences , 2003 .

[31]  Marc'Aurelio Ranzato,et al.  Large Scale Distributed Deep Networks , 2012, NIPS.

[32]  Gary J. Russell,et al.  Predicting Product Purchase from Inferred Customer Similarity: An Autologistic Model Approach , 2008, Manag. Sci..

[33]  Yoram Wind,et al.  Issues and Advances in Segmentation Research , 1978 .

[34]  L. Hildebrandt,et al.  A Market Basket Analysis Based on the Multivariate MNL Model , 2003 .

[35]  Brad Warner,et al.  Understanding Neural Networks as Statistical Tools , 1996 .

[36]  H. Hruschka,et al.  A parsimonious multivariate poisson model for market basket analysis , 2013 .

[37]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[38]  Peter E. Rossi,et al.  Modeling Consumer Demand for Variety , 2002 .

[39]  M. Wedel,et al.  Analyzing Brand Competition across Subcategories , 2004 .

[40]  V. Rao,et al.  A General Choice Model for Bundles with Multiple-Category Products: Application to Market Segmentation and Optimal Pricing for Bundles , 2003 .

[41]  Pradeep Chintagunta,et al.  Measuring Cross-Category Price Effects with Aggregate Store Data , 2006, Manag. Sci..

[42]  Peter E. Rossi,et al.  Bayesian Statistics and Marketing , 2005 .

[43]  Peter E. Rossi,et al.  Hierarchical Bayes Models: A Practitioners Guide , 2005 .

[44]  Hon-Kwong Lui,et al.  Machine Learning for Direct Marketing Response Models: Bayesian Networks with Evolutionary Programming , 2006, Manag. Sci..

[45]  Joan L. Walker,et al.  Generalized random utility model , 2002, Math. Soc. Sci..

[46]  Dimitris Kanellopoulos,et al.  Association Rules Mining: A Recent Overview , 2006 .

[47]  Ronald T. Wilcox,et al.  Cross-Selling Sequentially Ordered Products: An Application to Consumer Banking Services , 2005 .

[48]  Wendell R. Smith Product Differentiation and Market Segmentation as Alternative Marketing Strategies , 1956 .

[49]  Peter E. Rossi,et al.  Marketing models of consumer heterogeneity , 1998 .

[50]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[51]  Vineet Padmanabhan,et al.  Research Note - A Cross-Category Model of Households' Incidence and Quantity Decisions , 2008, Mark. Sci..

[52]  Jason Weston,et al.  Curriculum learning , 2009, ICML '09.

[53]  Thomas Reutterer,et al.  A combined approach for segment-specific market basket analysis , 2008, Eur. J. Oper. Res..

[54]  Razvan Pascanu,et al.  Theano: Deep Learning on GPUs with Python , 2012 .

[55]  Jiawei Han,et al.  Frequent pattern mining: current status and future directions , 2007, Data Mining and Knowledge Discovery.

[56]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[57]  Gary J. Russell,et al.  Analysis of cross category dependence in market basket selection , 2000 .

[58]  Douglas B. Holt,et al.  Poststructuralist Lifestyle Analysis: Conceptualizing the Social Patterning of Consumption in Postmodernity , 1997 .

[59]  Lucila Ohno-Machado,et al.  Logistic regression and artificial neural network classification models: a methodology review , 2002, J. Biomed. Informatics.

[60]  Nitin Mehta,et al.  A Multicategory Model of Consumers' Purchase Incidence, Quantity, and Brand Choice Decisions: Methodological Issues and Implications on Promotional Decisions , 2012 .

[61]  Martin A. Riedmiller,et al.  Advanced supervised learning in multi-layer perceptrons — From backpropagation to adaptive learning algorithms , 1994 .

[62]  Yoshua Bengio,et al.  Pattern Recognition and Neural Networks , 1995 .

[63]  Patrick L. Brockett,et al.  A Comparative Analysis of Neural Networks and Statistical Methods for Predicting Consumer Choice , 1997 .

[64]  W. DeSarbo,et al.  Finite-Mixture Structural Equation Models for Response-Based Segmentation and Unobserved Heterogeneity , 1997 .

[65]  Orit Shechtman,et al.  The use of the coefficient of variation in detecting sincerity of effort: a meta-analysis. , 2006, Work.

[66]  M. Wedel,et al.  Market Segmentation: Conceptual and Methodological Foundations , 1997 .

[67]  Sunil Gupta,et al.  Consumers' Price Sensitivities Across Complementary Categories , 2007, Manag. Sci..

[68]  Bo K. Wong,et al.  Neural network applications in business: A review and analysis of the literature (1988-1995) , 1997, Decis. Support Syst..

[69]  S. Chib,et al.  Models of Multi-Category Choice Behavior , 2005 .

[70]  Harald Hruschka,et al.  Analyzing market baskets by restricted Boltzmann machines , 2012, OR Spectrum.

[71]  Xiaobo Li,et al.  On Theoretical and Empirical Aspects of Marginal Distribution Choice Models , 2014, Manag. Sci..

[72]  D. Baczynski,et al.  Influence of artificial neural network structure on quality of short-term electric energy consumption forecast , 2004 .

[73]  Denzil G. Fiebig,et al.  The Generalized Multinomial Logit Model: Accounting for Scale and Coefficient Heterogeneity , 2010, Mark. Sci..

[74]  Usha A. Kumar,et al.  Comparison of neural networks and regression analysis: A new insight , 2005, Expert Syst. Appl..

[75]  D. McFadden The Choice Theory Approach to Market Research , 1986 .

[76]  H. Fulford,et al.  Familiarity, expertise and involvement: key consumer segmentation factors , 2008 .

[77]  Ronald J. Faber,et al.  Impulse Buying: Its Relation to Personality Traits and Cues , 2000 .

[78]  Herman Aguinis,et al.  Using Market Basket Analysis in Management Research , 2012, Journal of Management.

[79]  Mukta Paliwal,et al.  Neural networks and statistical techniques: A review of applications , 2009, Expert Syst. Appl..

[80]  Stefan Fritsch,et al.  neuralnet: Training of Neural Networks , 2010, R J..

[81]  Jack B. Soll,et al.  Mental Budgeting and Consumer Decisions , 1996 .

[82]  Nathan Intrator,et al.  Interpreting neural-network results: a simulation study , 2001 .