An information-theoretic on-line update principle for perception-action coupling

Inspired by findings of sensorimotor coupling in humans and animals, there has recently been a growing interest in the interaction between action and perception in robotic systems [1]. Here we consider perception and action as two serial information channels with limited information-processing capacity. We follow [2] and formulate a constrained optimization problem that maximizes utility under limited information-processing capacity in the two channels. As a solution we obtain an optimal perceptual channel and an optimal action channel that are coupled such that perceptual information is optimized with respect to downstream processing in the action module. The main novelty of this study is that we propose an online optimization procedure to find bounded-optimal perception and action channels in parameterized serial perception-action systems. In particular, we implement the perceptual channel as a multi-layer neural network and the action channel as a multinomial distribution. We illustrate our method in a NAO robot simulator with a simplified cup lifting task.

[1]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[2]  Oliver Brock,et al.  Interactive Perception: Leveraging Action in Perception and Perception in Action , 2016, IEEE Transactions on Robotics.

[3]  Stuart J. Russell Rationality and Intelligence , 1995, IJCAI.

[4]  Oliver Kroemer,et al.  Learning Visual Representations for Interactive Systems , 2009, ISRR.

[5]  Naftali Tishby,et al.  The information bottleneck method , 2000, ArXiv.

[6]  Daniel A. Braun,et al.  Monte Carlo methods for exact & efficient solution of the generalized optimality equations , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[7]  P. Todd,et al.  Simple Heuristics That Make Us Smart , 1999 .

[8]  Satinder Singh,et al.  Computational Rationality: Linking Mechanism and Behavior Through Bounded Utility Maximization , 2014, Top. Cogn. Sci..

[9]  Pierre Priouret,et al.  Adaptive Algorithms and Stochastic Approximations , 1990, Applications of Mathematics.

[10]  Daniel A. Braun,et al.  Thermodynamics as a theory of decision-making with information-processing costs , 2012, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[11]  Stefan Schaal,et al.  A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..

[12]  Dieter Fox,et al.  Interactive singulation of objects from a pile , 2012, 2012 IEEE International Conference on Robotics and Automation.

[13]  Oliver Brock,et al.  Learning state representations with robotic priors , 2015, Auton. Robots.

[14]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[15]  Ronald C. Arkin,et al.  An Behavior-based Robotics , 1998 .

[16]  Raymond W. Yeung,et al.  Information Theory and Network Coding , 2008 .

[17]  Jitendra Malik,et al.  Learning to Poke by Poking: Experiential Learning of Intuitive Physics , 2016, NIPS.

[18]  Daniel A. Braun,et al.  Bounded Rational Decision-Making in Feedforward Neural Networks , 2016, UAI.

[19]  H. Kappen Linear theory for control of nonlinear stochastic systems. , 2004, Physical review letters.

[20]  Rolf Pfeifer,et al.  How the body shapes the way we think - a new view on intelligence , 2006 .

[21]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[22]  Daniel A. Braun,et al.  Free Energy and the Generalized Optimality Equations for Sequential Decision Making , 2012, EWRL 2012.

[23]  J. Schreiber Foundations Of Statistics , 2016 .

[24]  Richard E. Blahut,et al.  Computation of channel capacity and rate-distortion functions , 1972, IEEE Trans. Inf. Theory.

[25]  F. Ramsey Truth and Probability , 2016 .

[26]  Jordi Grau-Moya,et al.  Bounded Rationality, Abstraction, and Hierarchical Decision-Making: An Information-Theoretic Optimality Principle , 2015, Front. Robot. AI.

[27]  Stefan Schaal,et al.  Path integral control and bounded rationality , 2011, 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL).

[28]  Suguru Arimoto,et al.  An algorithm for computing the capacity of arbitrary discrete memoryless channels , 1972, IEEE Trans. Inf. Theory.

[29]  B. Jones BOUNDED RATIONALITY , 1999 .

[30]  Aaron D. Wyner,et al.  Coding Theorems for a Discrete Source With a Fidelity CriterionInstitute of Radio Engineers, International Convention Record, vol. 7, 1959. , 1993 .

[31]  M. Levine Empagliflozin for Type 2 Diabetes Mellitus: An Overview of Phase 3 Clinical Trials , 2017, Current diabetes reviews.

[32]  Gaurav S. Sukhatme,et al.  Using manipulation primitives for brick sorting in clutter , 2012, 2012 IEEE International Conference on Robotics and Automation.

[33]  Daniel A. Braun,et al.  A conversion between utility and information , 2009, AGI 2010.

[34]  Emanuel Todorov,et al.  Linearly-solvable Markov decision problems , 2006, NIPS.

[35]  D. Kahneman Maps of Bounded Rationality: Psychology for Behavioral Economics , 2003 .

[36]  E. Rowland Theory of Games and Economic Behavior , 1946, Nature.

[37]  Daniel A. Braun,et al.  Information-Theoretic Bounded Rationality and ε-Optimality , 2014, Entropy.

[38]  Giorgio Metta,et al.  Towards manipulation-driven vision , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[39]  Sergey Levine,et al.  Trust Region Policy Optimization , 2015, ICML.

[40]  Oliver Kroemer,et al.  Probabilistic Segmentation and Targeted Exploration of Objects in Cluttered Environments , 2014, IEEE Transactions on Robotics.

[41]  Oliver Kroemer,et al.  Learning visual representations for perception-action systems , 2011, Int. J. Robotics Res..

[42]  Yasemin Altun,et al.  Relative Entropy Policy Search , 2010 .

[43]  Sergey Levine,et al.  End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[44]  Richard L. Lewis,et al.  Rational adaptation under task and processing constraints: implications for testing theories of cognition and action. , 2009, Psychological review.