Simulation as an engine of physical scene understanding

In a glance, we can perceive whether a stack of dishes will topple, a branch will support a child’s weight, a grocery bag is poorly packed and liable to tear or crush its contents, or a tool is firmly attached to a table or free to be lifted. Such rapid physical inferences are central to how people interact with the world and with each other, yet their computational underpinnings are poorly understood. We propose a model based on an “intuitive physics engine,” a cognitive mechanism similar to computer engines that simulate rich physics in video games and graphics, but that uses approximate, probabilistic simulations to make robust and fast inferences in complex natural scenes where crucial information is unobserved. This single model fits data from five distinct psychophysical tasks, captures several illusions and biases, and explains core aspects of human mental models and common-sense reasoning that are instrumental to how humans understand their everyday world.

[1]  K. J. Craik,et al.  The nature of explanation , 1944 .

[2]  H. Hock,et al.  Contextual relations: The influence of familiarity, physical plausibility, and belongingness , 1974 .

[3]  Patrick Henry Winston,et al.  The psychology of computer vision , 1976, Pattern Recognit..

[4]  A. Caramazza,et al.  Curvilinear motion in the absence of external forces: naive beliefs about the motion of objects. , 1980, Science.

[5]  Dirk P. Kroese,et al.  Simulation and the Monte Carlo Method (Wiley Series in Probability and Statistics) , 1981 .

[6]  I. Biederman,et al.  Scene perception: Detecting and judging objects undergoing relational violations , 1982, Cognitive Psychology.

[7]  P. Johnson-Laird,et al.  Mental Models: Towards a Cognitive Science of Language, Inference, and Consciousness , 1985 .

[8]  D. Gentner Mental Models , 1983 .

[9]  N. Fisher,et al.  A correlation coefficient for circular data , 1983 .

[10]  中園 薫 A Qualitative Physics Based on Confluences , 1986 .

[11]  J. Freyd Dynamic mental representations. , 1987, Psychological review.

[12]  Leonard Talmy,et al.  Force Dynamics in Language and Cognition , 1987, Cogn. Sci..

[13]  D R Proffitt,et al.  Understanding collision dynamics. , 1989, Journal of experimental psychology. Human perception and performance.

[14]  D. Proffitt,et al.  Understanding wheel dynamics , 1990, Cognitive Psychology.

[15]  D. Baraff Physically Based Modeling Rigid Body Simulation , 1992 .

[16]  E. Spelke,et al.  Origins of knowledge. , 1992, Psychological review.

[17]  S. T. Buckland,et al.  An Introduction to the Bootstrap. , 1994 .

[18]  M. Tomasello The Cultural Origins of Human Cognition , 2000 .

[19]  J. Stevenson The cultural origins of human cognition , 2001 .

[20]  M. Hegarty Mechanical reasoning by mental simulation , 2004, Trends in Cognitive Sciences.

[21]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[22]  R. Baillargeon The Acquisition of Physical Knowledge in Infancy: A Summary in Eight Lessons , 2007 .

[23]  Heinrich H. Bülthoff,et al.  Perception and prediction of simple object interactions , 2007, APGV.

[24]  Dirk P. Kroese,et al.  Simulation and the Monte Carlo method , 1981, Wiley series in probability and mathematical statistics.

[25]  F. Lacquaniti,et al.  Visuo-motor coordination and internal models for object interception , 2009, Experimental Brain Research.

[26]  Kenneth D. Forbus Handbook of Knowledge Representation Edited Qualitative Modeling , 2022 .

[27]  George A. Alvarez,et al.  Explaining human multiple object tracking as resource-constrained approximate inference in a dynamic probabilistic model , 2009, NIPS.

[28]  Kenneth D. Forbus Qualitative modeling. , 2011, Wiley interdisciplinary reviews. Cognitive science.

[29]  Charles Kemp,et al.  How to Grow a Mind: Statistics, Structure, and Abstraction , 2011, Science.

[30]  Edward Vul,et al.  Pure Reasoning in 12-Month-Old Infants as Probabilistic Inference , 2011, Science.

[31]  Joshua B. Tenenbaum,et al.  Noisy Newtons: Unifying process and dependency accounts of causal attribution , 2012, CogSci.

[32]  Kevin A. Smith,et al.  Sources of uncertainty in intuitive physics , 2012, CogSci.

[33]  A. Holcombe,et al.  Splitting attention reduces temporal resolution from 7 Hz for tracking one object to <3 Hz when tracking three. , 2013, Journal of vision.

[34]  Kevin A. Smith,et al.  Consistent physics underlying ballistic motion prediction , 2013, CogSci.

[35]  Vikash K. Mansinghka,et al.  Reconciling intuitive physics and Newtonian mechanics for colliding objects. , 2013, Psychological review.

[36]  Manish Singh,et al.  Visual perception of the physical stability of asymmetric three-dimensional objects. , 2013, Journal of vision.

[37]  Thomas L. Griffiths,et al.  One and Done? Optimal Decisions From Very Few Samples , 2014, Cogn. Sci..