论文信息 - Computational rationality: A converging paradigm for intelligence in brains, minds, and machines

Computational rationality: A converging paradigm for intelligence in brains, minds, and machines

After growing up together, and mostly growing apart in the second half of the 20th century, the fields of artificial intelligence (AI), cognitive science, and neuroscience are reconverging on a shared view of the computational foundations of intelligence that promotes valuable cross-disciplinary exchanges on questions, methods, and results. We chart advances over the past several decades that address challenges of perception and action under uncertainty through the lens of computation. Advances include the development of representations and inferential procedures for large-scale probabilistic inference and machinery for enabling reflection and decisions about tradeoffs in effort, precision, and timeliness of computations. These tools are deployed toward the goal of computational rationality: identifying decisions with highest expected utility, while taking into consideration the costs of computation in complex real-world problems in which most relevant calculations can only be approximated. We highlight key concepts with examples that show the potential for interchange between computer science, cognitive science, and neuroscience.

[1] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .

[2] Michèle Sebag,et al. The grand challenge of computer Go , 2012, Commun. ACM.

[3] E. Rowland. Theory of Games and Economic Behavior , 1946, Nature.

[4] A. Turing. On computable numbers, with an application to the Entscheidungsproblem , 1937, Proc. London Math. Soc..

[5] Adam N Sanborn,et al. Rational approximations to rational models: alternative algorithms for category learning. , 2010, Psychological review.

[6] John R. Anderson. The Adaptive Character of Thought , 1990 .

[7] J. Tenenbaum,et al. Structure and strength in causal induction , 2005, Cognitive Psychology.

[8] Thomas L. Griffiths,et al. Algorithm selection by rational metareasoning as a model of human strategy selection , 2014, NIPS.

[9] Daphne Koller,et al. Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence , 2001 .

[10] Stuart J. Russell,et al. Principles of Metareasoning , 1989, Artif. Intell..

[11] Judea Pearl,et al. Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[12] Brad E. Pfeiffer,et al. Hippocampal place cell sequences depict future paths to remembered goals , 2013, Nature.

[13] Joshua B. Tenenbaum,et al. Multistability and Perceptual Inference , 2012, Neural Computation.

[14] Wolfgang Maass,et al. Neural Dynamics as Sampling: A Model for Stochastic Computation in Recurrent Networks of Spiking Neurons , 2011, PLoS Comput. Biol..

[15] Joseph T. McGuire,et al. Decision making and the avoidance of cognitive demand. , 2010, Journal of experimental psychology. General.

[16] Satinder Singh,et al. Computational Rationality: Linking Mechanism and Behavior Through Bounded Utility Maximization , 2014, Top. Cogn. Sci..

[17] Nir Friedman,et al. Probabilistic Graphical Models - Principles and Techniques , 2009 .

[18] Honglak Lee,et al. Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning , 2014, NIPS.

[19] 宁北芳,et al. 疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A , 2005 .

[20] Eric Horvitz,et al. Principles and applications of continual computation , 2001, Artif. Intell..

[21] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[22] S. Killcross,et al. Coordination of actions and habits in the medial prefrontal cortex of rats. , 2003, Cerebral cortex.

[23] A. Markman,et al. The Curse of Planning: Dissecting Multiple Reinforcement-Learning Systems by Taxing the Central Executive , 2013 .

[24] David Heckerman,et al. Proceedings of the Ninth international conference on Uncertainty in artificial intelligence , 1993, Conference on Uncertainty in Artificial Intelligence.

[25] Michael Isard,et al. CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[26] A. Dickinson. Actions and habits: the development of behavioural autonomy , 1985 .

[27] Amir Dezfouli,et al. Speed/Accuracy Trade-Off between the Habitual and the Goal-Directed Processes , 2011, PLoS Comput. Biol..

[28] D. Wilkin,et al. Neuron , 2001, Brain Research.

[29] Joseph Y. Halpern,et al. Proceedings of the 20th conference on Uncertainty in artificial intelligence , 2004, UAI 2004.

[30] J. Rieskamp,et al. SSL: a theory of how people learn to select strategies. , 2006, Journal of experimental psychology. General.

[31] George A. Alvarez,et al. Explaining human multiple object tracking as resource-constrained approximate inference in a dynamic probabilistic model , 2009, NIPS.

[32] John von Neumann,et al. The Computer and the Brain , 1960 .

[33] A. M. Turing,et al. Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[34] Eric J. Johnson,et al. Adaptive Strategy Selection in Decision Making. , 1988 .

[35] John R. Anderson,et al. The Adaptive Character of Thought , 1990 .

[36] A. Hasman,et al. Probabilistic reasoning in intelligent systems: Networks of plausible inference , 1991 .

[37] L. Beach,et al. Man as an Intuitive Statistician , 2022 .

[38] A. Tversky,et al. Judgment under Uncertainty: Heuristics and Biases , 1974, Science.

[39] P. Kline. Models of man , 1986, Nature.

[40] B. Balleine,et al. Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning , 2004, The European journal of neuroscience.

[41] Thomas L. Griffiths,et al. One and Done? Optimal Decisions From Very Few Samples , 2014, Cogn. Sci..

[42] Charles Kemp,et al. How to Grow a Mind: Statistics, Structure, and Abstraction , 2011, Science.

[43] P. Dayan,et al. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[44] Thomas L. Griffiths,et al. Modeling the effects of memory on human online sentence processing with particle filters , 2008, NIPS.

[45] Eric Horvitz,et al. Metareasoning for Planning Under Uncertainty , 2015, IJCAI.

[46] Leslie Pack Kaelbling,et al. Planning under Time Constraints in Stochastic Domains , 1993, Artif. Intell..

[47] H. Simon,et al. Models of Man. , 1957 .

[48] Shinsuke Shimojo,et al. Neural Computations Underlying Arbitration between Model-Based and Model-free Learning , 2013, Neuron.

[49] R. Rosenfeld. Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[50] Joseph T. McGuire,et al. Prefrontal cortex, cognitive control, and the registration of decision costs , 2010, Proceedings of the National Academy of Sciences.

[51] S. Denison,et al. Rational variability in children’s causal inferences: The Sampling Hypothesis , 2013, Cognition.

[52] BlakeAndrew,et al. C ONDENSATION Conditional Density Propagation forVisual Tracking , 1998 .

[53] P. Dayan,et al. Model-based influences on humans’ choices and striatal prediction errors , 2011, Neuron.

[54] Thomas L. Griffiths,et al. Rational Use of Cognitive Resources: Levels of Analysis Between the Computational and the Algorithmic , 2015, Top. Cogn. Sci..

[55] M. Botvinick,et al. A labor/leisure tradeoff in cognitive control. , 2014, Journal of experimental psychology. General.

[56] Gregory F. Cooper,et al. The Computational Complexity of Probabilistic Inference Using Bayesian Belief Networks , 1990, Artif. Intell..

[57] Adam Johnson,et al. Neural Ensembles in CA3 Transiently Encode Paths Forward of the Animal at a Decision Point , 2007, The Journal of Neuroscience.

[58] Rajesh P. N. Rao,et al. Bayesian brain : probabilistic approaches to neural coding , 2006 .

[59] Wheeler Ruml,et al. Heuristic Search When Time Matters , 2013, J. Artif. Intell. Res..

[60] G. Gigerenzer. Rationality for Mortals: How People Cope with Uncertainty , 2008 .