Why less can be more : a Bayesian framework for heuristics

When making decisions under uncertainty, one common view is that people rely on simple heuristics that deliberately ignore information. One of the greatest puzzles in cognitive science concerns why heuristics can sometimes outperform full-information models, such as linear regression, which make full use of the available information. In this thesis, I will contribute the novel idea that heuristics can be thought of as embodying extreme Bayesian priors. Thereby, an explanation for less-is-more is that the heuristics’ relative simplicity and inflexibility amounts to a strong inductive bias, that is suitable for some learning and decision problems. I will formalize this idea by introducing Bayesian models within which heuristics are an extreme case along a continuum of model flexibility defined by the strength and nature of the prior. Crucially, the Bayesian models include heuristics at one of the Bayesian prior strength and classic full-information models at the other end of the Bayesian prior. This allows for a comparative test between the intermediate models along the continuum and the extremes of heuristics and full regression model. Indeed, I will show that intermediate models perform best across simulations, suggesting that down-weighting information is preferable to entirely ignoring it. These results refute an absolute version of less-is-more, demonstrating that heuristics will usually be outperformed by a model that takes into account the full information but weighs it appropriately. Thereby, the thesis provides a novel explanation for less-is-more: Heuristics work well because they embody a Bayesian prior that approximates the optimal prior. While the main contribution is formal, the final Chapter will explore whether less is more at the psychological level, and finds that people do not use heuristics, but rely on the full information instead. A consistent perspective will emerge throughout the whole thesis, which is that less is not more.

[1]  Julian N. Marewski,et al.  Using the ACT-R architecture to specify 39 quantitative process models of decision making , 2011, Judgment and Decision Making.

[2]  Jonathan D. Nelson,et al.  Information search with situation-specific reward functions , 2012, Judgment and Decision Making.

[3]  A. Pouget,et al.  Probabilistic brains: knowns and unknowns , 2013, Nature Neuroscience.

[4]  David R. Shanks,et al.  Predict choice: A comparison of 21 mathematical models , 2014, CogSci.

[5]  W. Stroebe,et al.  The Alleged Crisis and the Illusion of Exact Replication , 2014, Perspectives on psychological science : a journal of the Association for Psychological Science.

[6]  B. Newell,et al.  Empirical tests of a fast-and-frugal heuristic: Not everyone "takes-the-best" , 2003 .

[7]  J. Rieskamp,et al.  SSL: a theory of how people learn to select strategies. , 2006, Journal of experimental psychology. General.

[8]  J. Neumann,et al.  Theory of games and economic behavior , 1945, 100 Years of Math Milestones.

[9]  J. Tenenbaum,et al.  Generalization, similarity, and Bayesian inference. , 2001, The Behavioral and brain sciences.

[10]  I. J. Myung,et al.  When a good fit can be bad , 2002, Trends in Cognitive Sciences.

[11]  Robin M. Hogarth,et al.  Ignoring information in binary choice with continuous variables: When is less “more”? , 2005 .

[12]  Steven A. Sloman,et al.  The Problem of Induction , 2005 .

[13]  Jesfis Peral,et al.  Heuristics -- intelligent search strategies for computer problem solving , 1984 .

[14]  R. Hertwig,et al.  How forgetting aids heuristic inference. , 2005, Psychological review.

[15]  Thorsten Pachur,et al.  Type of learning task impacts performance and strategy selection in decision making , 2012, Cognitive Psychology.

[16]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[17]  R. Nosofsky Attention, similarity, and the identification-categorization relationship. , 1986, Journal of experimental psychology. General.

[18]  Kathryn B. Laskey,et al.  Bayesian benchmarks for fast and frugal heuristics , 1999 .

[19]  Jörg Rieskamp,et al.  Testing adaptive toolbox models: a Bayesian hierarchical approach. , 2013, Psychological review.

[20]  Nick Chater,et al.  Economic irrationality is optimal during noisy decision making , 2016, Proceedings of the National Academy of Sciences.

[21]  Eric J. Johnson,et al.  The adaptive decision maker , 1993 .

[22]  D. Markant,et al.  Is it better to select or to receive? Learning via active and passive hypothesis testing. , 2014, Journal of experimental psychology. General.

[23]  U. Hoffrage,et al.  Fast, frugal, and fit: Simple heuristics for paired comparison , 2002 .

[24]  Jörg Rieskamp,et al.  RedundancyEnvironment Structure That Simple Heuristics Can Exploit , 2012 .

[25]  L. Jacoby,et al.  On the relationship between autobiographical memory and perceptual learning. , 1981, Journal of experimental psychology. General.

[26]  Wei Ji Ma,et al.  Bayesian inference with probabilistic population codes , 2006, Nature Neuroscience.

[27]  B. Newell,et al.  Take the best or look at the rest? Factors influencing "one-reason" decision making. , 2003, Journal of experimental psychology. Learning, memory, and cognition.

[28]  Nick Chater,et al.  A rational analysis of the selection task as optimal data selection. , 1994 .

[29]  H. Raiffa,et al.  Applied Statistical Decision Theory. , 1961 .

[30]  Ulrich Hoffrage,et al.  Why does one-reason decision making work? A case study in ecological rationality , 1999 .

[31]  Magnus Persson,et al.  PROBabilities from EXemplars (PROBEX): a "lazy" algorithm for probabilistic inference from generic knowledge , 2002 .

[32]  A. Tversky,et al.  Judgment under Uncertainty: Heuristics and Biases , 1974, Science.

[33]  Todd M. Gureckis,et al.  One piece at a time: Learning complex rules through self-directed sampling , 2012, CogSci.

[34]  B. Love,et al.  The myth of computational level theory and the vacuity of rational analysis , 2011, Behavioral and Brain Sciences.

[35]  Robin M. Hogarth,et al.  Take-the-Best and Other Simple Strategies: Why and When They Work Well in Binary Choice , 2004 .

[36]  Donald W. Marquaridt Generalized Inverses, Ridge Regression, Biased Linear Estimation, and Nonlinear Estimation , 1970 .

[37]  David Marr,et al.  VISION A Computational Investigation into the Human Representation and Processing of Visual Information , 2009 .

[38]  N. Chater,et al.  Précis of Bayesian Rationality: The Probabilistic Approach to Human Reasoning , 2009, Behavioral and Brain Sciences.

[39]  H. Simon,et al.  Invariants of human behavior. , 1990, Annual review of psychology.

[40]  Joshua B. Tenenbaum,et al.  Rules and Similarity in Concept Learning , 1999, NIPS.

[41]  Gerd Gigerenzer,et al.  Good Judgments Do Not Require Complex Cognition , 2008 .

[42]  Claude E. Shannon,et al.  The Mathematical Theory of Communication , 1950 .

[43]  Fiona Cowie,et al.  THE LOGICAL PROBLEM OF LANGUAGE ACQUISITION , 1997, Synthese.

[44]  Edward H. Adelson,et al.  Motion illusions as optimal percepts , 2002, Nature Neuroscience.

[45]  P M Todd,et al.  Précis of Simple heuristics that make us smart , 2000, Behavioral and Brain Sciences.

[46]  A. Yuille,et al.  Object perception as Bayesian inference. , 2004, Annual review of psychology.

[47]  Konrad Paul Kording,et al.  Bayesian integration in sensorimotor learning , 2004, Nature.

[48]  Ben R. Newell,et al.  A Hierarchical Bayesian Modeling Approach to Searching and Stopping in Multi-Attribute Judgment , 2014, Cogn. Sci..

[49]  J. Scott Rational Choice Theory , 2007 .

[50]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[51]  R. Nosofsky Relations between exemplar-similarity and likelihood models of classification , 1990 .

[52]  P. McLeod,et al.  Do Fielders Know Where to Go to Catch the Ball or Only How to Get There , 1996 .

[53]  A. Yuille,et al.  Opinion TRENDS in Cognitive Sciences Vol.10 No.7 July 2006 Special Issue: Probabilistic models of cognition Vision as Bayesian inference: analysis by synthesis? , 2022 .

[54]  S. Maxwell,et al.  Is psychology suffering from a replication crisis? What does "failure to replicate" really mean? , 2015, The American psychologist.

[55]  Adam N Sanborn,et al.  Rational approximations to rational models: alternative algorithms for category learning. , 2010, Psychological review.

[56]  A. Tversky,et al.  Extensional versus intuitive reasoning: the conjunction fallacy in probability judgment , 1983 .

[57]  R. Hogarth,et al.  Heuristic and linear models of judgment: matching rules and environments. , 2007, Psychological review.