A Comparative Analysis of Neural Networks and Statistical Methods for Predicting Consumer Choice

This paper presents a definitive description of neural network methodology and provides an evaluation of its advantages and disadvantages relative to statistical procedures. The development of this rich class of models was inspired by the neural architecture of the human brain. These models mathematically emulate the neurophysical structure and decision making of the human brain, and, from a statistical perspective, are closely related to generalized linear models. Artificial neural networks are, however, nonlinear and use a different estimation procedure feed forward and back propagation than is used in traditional statistical models least squares or maximum likelihood. Additionally, neural network models do not require the same restrictive assumptions about the relationship between the independent variables and dependent variables. Consequently, these models have already been very successfully applied in many diverse disciplines, including biology, psychology, statistics, mathematics, business, insurance, and computer science. We propose that neural networks will prove to be a valuable tool for marketers concerned with predicting consumer choice. We will demonstrate that neural networks provide superior predictions regarding consumer decision processes. In the context of modeling consumer judgment and decision making, for example, neural network models can offer significant improvement over traditional statistical methods because of their ability to capture nonlinear relationships associated with the use of noncompensatory decision rules. Our analysis reveals that neural networks have great potential for improving model predictions in nonlinear decision contexts without sacrificing performance in linear decision contexts. This paper provides a detailed introduction to neural networks that is understandable to both the academic researcher and the practitioner. This exposition is intended to provide both the intuition and the rigorous mathematical models needed for successful applications. In particular, a step-by-step outline of how to use the models is provided along with a discussion of the strengths and weaknesses of the model. We also address the robustness of the neural network models and discuss how far wrong you might go using neural network models versus traditional statistical methods. Herein we report the results of two studies. The first is a numerical simulation comparing the ability of neural networks with discriminant analysis and logistic regression at predicting choices made by decision rules that vary in complexity. This includes simulations involving two noncompensatory decision rules and one compensatory decision rule that involves attribute thresholds. In particular, we test a variant of the satisficing rule used by Johnson et al. Johnson, Eric J., Robert J. Meyer, Sanjoy Ghose. 1989. When choice models fail: Compensatory models in negatively correlated environments. J. Marketing Res.26August 255--270. that sets a lower bound threshold on all attribute values and a “latitude of acceptance” model that sets both a lower threshold and an upper threshold on attribute values, mimicking an “ideal point” model Coombs and Avrunin [Coombs, Clyde H., George S. Avrunin. 1977. Single peaked functions and the theory of preference. Psych. Rev.84 216--230.]. We also test a compensatory rule that equally weights attributes and judges the acceptability of an alternative based on the sum of its attribute values. Thus, the simulations include both a linear environment, in which traditional statistical models might be deemed appropriate, as well as a nonlinear environment where statistical models might not be appropriate. The complexity of the decision rules was varied to test for any potential degradation in model performance. For these simulated data it is shown that, in general, the neural network model outperforms the commonly used statistical procedures in terms of explained variance and out-of-sample predictive accuracy. An empirical study bridging the behavioral and statistical lines of research was also conducted. Here we examine the predictive relationship between retail store image variables and consumer patronage behavior. A direct comparison between a neural network model and the more commonly encountered techniques of discriminant analysis and factor analysis followed by logistic regression is presented. Again the results reveal that the neural network model outperformed the statistical procedures in terms of explained variance and out-of-sample predictive accuracy. We conclude that neural network models offer superior predictive capabilities over traditional statistical methods in predicting consumer choice in nonlinear and linear settings.

[1]  Shouhong Wang,et al.  Application of the Back Propagation Neural Network Algorithm with Monotonicity Constraints for Two‐Group Classification Problems* , 1993 .

[2]  D. O. Hebb,et al.  The organization of behavior , 1988 .

[3]  Russell C. Eberhart,et al.  Neural network PC tools: a practical guide , 1990 .

[4]  Jun Wang,et al.  A feedforward neural network for multiple criteria decision making , 1992, Comput. Oper. Res..

[5]  James L. McClelland,et al.  A distributed model of human learning and memory , 1986 .

[6]  Herbert A. Simon,et al.  The Sciences of the Artificial , 1970 .

[7]  John Mingers,et al.  Neural Networks, Decision Tree Induction and Discriminant Analysis: an Empirical Comparison , 1994 .

[8]  Sanjoy Ghose,et al.  When Choice Models Fail: Compensatory Models in Negatively Correlated Environments , 1989 .

[9]  W. Cooper,et al.  A Neural Network Method for Obtaining an Early Warning of Insurer Insolvency , 1994 .

[10]  Jeffrey L. Elman,et al.  Interactive processes in speech perception: the TRACE model , 1986 .

[11]  Eric J. Johnson,et al.  The adaptive decision maker , 1993 .

[12]  Paul E. Green,et al.  Conjoint Analysis in Marketing: New Developments with Implications for Research and Practice , 1990 .

[13]  Allan Collins,et al.  A spreading-activation theory of semantic processing , 1975 .

[14]  D Zipser,et al.  Learning the hidden structure of speech. , 1988, The Journal of the Acoustical Society of America.

[15]  Jun Wang,et al.  A neural network approach to modeling fuzzy preference relations for multiple criteria decision making , 1994, Comput. Oper. Res..

[16]  Youngohc Yoon,et al.  A Comparison of Discriminant Analysis versus Artificial Neural Networks , 1993 .

[17]  Y. Ganzach,et al.  Nonlinear models of clinical judgment: Meehl's data revisited. , 1995, Psychological bulletin.

[18]  Kishan G. Mehrotra,et al.  Characterization of a Class of Sigmoid Functions with Applications to Neural Networks , 1996, Neural Networks.

[19]  D. Treigueiros,et al.  The application of neural network based methods to the extraction of knowledge from accounting reports , 1991, Proceedings of the Twenty-Fourth Annual Hawaii International Conference on System Sciences.

[20]  Yves Chauvin,et al.  Backpropagation: the basic theory , 1995 .

[21]  A. Tversky Elimination by aspects: A theory of choice. , 1972 .

[22]  Murray Smith,et al.  Neural Networks for Statistical Modeling , 1993 .

[23]  Kunihiko Fukushima,et al.  Neocognitron: A new algorithm for pattern recognition tolerant of deformations and shifts in position , 1982, Pattern Recognit..

[24]  Richard P. Lippmann,et al.  An introduction to computing with neural nets , 1987 .

[25]  W. C. Ruediger Memory and thought. , 1919 .

[26]  Alvin J. Surkan,et al.  Neural networks for bond rating improved by multiple hidden layers , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[27]  N. Anderson Functional measurement and psychophysical judgment. , 1970, Psychological review.

[28]  Frank Rosenblatt,et al.  PRINCIPLES OF NEURODYNAMICS. PERCEPTRONS AND THE THEORY OF BRAIN MECHANISMS , 1963 .

[29]  B. Malakooti,et al.  Feedforward artificial neural networks for solving discrete multiple criteria decision making problems , 1994 .

[30]  John G. Lynch Uniqueness Issues in the Decompositional Modeling of Multiattribute Overall Evaluations: An Information Integration Perspective , 1985 .

[31]  Kurt Hornik,et al.  Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks , 1990, Neural Networks.

[32]  Michael S. Humphreys,et al.  An auto-associative neural network for sparse representations : analysis and application to models of recognition and cued recall , 1994 .

[33]  L. Shastri,et al.  From simple associations to systematic reasoning: A connectionist representation of rules, variables and dynamic bindings using temporal synchrony , 1993, Behavioral and Brain Sciences.

[34]  H. White Some Asymptotic Results for Learning in Single Hidden-Layer Feedforward Network Models , 1989 .

[35]  Soumitra Dutta,et al.  Bond rating: A non-conservative application of neural networks , 1988 .

[36]  Anna Hart,et al.  Using Neural Networks for Classification Tasks – Some Experiments on Datasets and Practical Advice , 1992 .

[37]  N. Anderson Integration theory and attitude change. , 1971 .

[38]  John R. Anderson Language, Memory, and Thought , 1976 .

[39]  William L. Wilkie,et al.  Issues in Marketing's use of Multi-Attribute Attitude Models , 1973 .

[40]  R. Dawes,et al.  Linear models in decision making. , 1974 .

[41]  H. J. Einhorn The use of nonlinear, noncompensatory models in decision making. , 1970, Psychological bulletin.

[42]  M. Fishbein A Behavior Theory Approach to the Relations between Beliefs about an Object and the Attitude Toward the Object , 1976 .

[43]  Eric J. Johnson,et al.  Compensatory Choice Models of Noncompensatory Processes: The Effect of Varying Context , 1984 .

[44]  Ken-ichi Funahashi,et al.  On the approximate realization of continuous mappings by neural networks , 1989, Neural Networks.

[45]  George S. Avrunin,et al.  Single-Peaked Functions and the Theory of Preference. , 1977 .

[46]  R. L. Keeney,et al.  Decisions with Multiple Objectives: Preferences and Value Trade-Offs , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[47]  E. Mine Cinar,et al.  Neural Networks: A New Tool for Predicting Thrift Failures , 1992 .

[48]  P. Green,et al.  Conjoint Analysis in Consumer Research: Issues and Outlook , 1978 .