A Causal Model Approach to Dynamic Control

Acting effectively in the world requires learning and controlling dynamic systems, that is, systems involving feedback relations among continuous variables that vary in real time. We introduce a novel class of dynamic control environments using Ornstein-Uhlenbeck processes connected in causal Markov graphs that allow us to systematically test people’s ability to learn and control various dynamic systems. We find that performance varied across a range of test environments, roughly matching with complexity defined by a set of models trained on the task (an optimal model, a deep Reinforcement Learning agent, and a PID controller). The testbed of dynamic environments and class of models introduced in this paper lay the groundwork for the systematic study of people’s ability to control complex dynamic systems.

[1]  M. Osman Controlling uncertainty: a review of human behavior in complex dynamic environments. , 2010, Psychological bulletin.

[2]  George A. Alvarez,et al.  Explaining human multiple object tracking as resource-constrained approximate inference in a dynamic probabilistic model , 2009, NIPS.

[3]  Bernard Widrow,et al.  Adaptive switching circuits , 1988 .

[4]  H. Doughty Enlightenment Now: The Case for Reason, Science, Humanism, and Progress , 2019 .

[5]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[6]  Jessica B. Hamrick,et al.  psiTurk: An open-source framework for conducting replicable behavioral experiments online , 2016, Behavior research methods.

[7]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[8]  Vladimír Lacko,et al.  Planning of experiments for a nonautonomous ornstein-uhlenbeck process , 2012 .

[9]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[10]  D. Broadbent,et al.  On the Relationship between Task Performance and Associated Verbalizable Knowledge , 1984 .

[11]  G. Uhlenbeck,et al.  On the Theory of the Brownian Motion , 1930 .

[12]  D. Lagnado,et al.  Spontaneous Causal Learning While Controlling A Dynamic System , 2010 .

[13]  Bob Rehder,et al.  Causal Structure Learning with Continuous Variables in Continuous Time , 2018, CogSci.

[14]  Todd M. Gureckis,et al.  CUNY Academic , 2016 .

[15]  B. Brehmer,et al.  Micro-worlds and the circular relation between people and their environment , 2005 .

[16]  Bradley C. Love,et al.  Learning in Noise: Dynamic Decision-Making in a Variable Environment. , 2009, Journal of mathematical psychology.

[17]  Maarten Speekenbrink,et al.  Strategic exploration in human adaptive control , 2017, bioRxiv.

[18]  N. Shephard,et al.  Non‐Gaussian Ornstein–Uhlenbeck‐based models and some of their uses in financial economics , 2001 .

[19]  Michael J. Frank,et al.  A Control Theoretic Model of Adaptive Learning in Dynamic Environments , 2018, Journal of Cognitive Neuroscience.