An information-theoretic on-line update principle for perception-action coupling
暂无分享,去创建一个
Daniel A. Braun | Zhen Peng | Felix Leibfried | Tim Genewein | D. Braun | Tim Genewein | Felix Leibfried | Zhen Peng
[1] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .
[2] Oliver Brock,et al. Interactive Perception: Leveraging Action in Perception and Perception in Action , 2016, IEEE Transactions on Robotics.
[3] Stuart J. Russell. Rationality and Intelligence , 1995, IJCAI.
[4] Oliver Kroemer,et al. Learning Visual Representations for Interactive Systems , 2009, ISRR.
[5] Naftali Tishby,et al. The information bottleneck method , 2000, ArXiv.
[6] Daniel A. Braun,et al. Monte Carlo methods for exact & efficient solution of the generalized optimality equations , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).
[7] P. Todd,et al. Simple Heuristics That Make Us Smart , 1999 .
[8] Satinder Singh,et al. Computational Rationality: Linking Mechanism and Behavior Through Bounded Utility Maximization , 2014, Top. Cogn. Sci..
[9] Pierre Priouret,et al. Adaptive Algorithms and Stochastic Approximations , 1990, Applications of Mathematics.
[10] Daniel A. Braun,et al. Thermodynamics as a theory of decision-making with information-processing costs , 2012, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences.
[11] Stefan Schaal,et al. A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..
[12] Dieter Fox,et al. Interactive singulation of objects from a pile , 2012, 2012 IEEE International Conference on Robotics and Automation.
[13] Oliver Brock,et al. Learning state representations with robotic priors , 2015, Auton. Robots.
[14] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.
[15] Ronald C. Arkin,et al. An Behavior-based Robotics , 1998 .
[16] Raymond W. Yeung,et al. Information Theory and Network Coding , 2008 .
[17] Jitendra Malik,et al. Learning to Poke by Poking: Experiential Learning of Intuitive Physics , 2016, NIPS.
[18] Daniel A. Braun,et al. Bounded Rational Decision-Making in Feedforward Neural Networks , 2016, UAI.
[19] H. Kappen. Linear theory for control of nonlinear stochastic systems. , 2004, Physical review letters.
[20] Rolf Pfeifer,et al. How the body shapes the way we think - a new view on intelligence , 2006 .
[21] Thomas M. Cover,et al. Elements of Information Theory , 2005 .
[22] Daniel A. Braun,et al. Free Energy and the Generalized Optimality Equations for Sequential Decision Making , 2012, EWRL 2012.
[23] J. Schreiber. Foundations Of Statistics , 2016 .
[24] Richard E. Blahut,et al. Computation of channel capacity and rate-distortion functions , 1972, IEEE Trans. Inf. Theory.
[25] F. Ramsey. Truth and Probability , 2016 .
[26] Jordi Grau-Moya,et al. Bounded Rationality, Abstraction, and Hierarchical Decision-Making: An Information-Theoretic Optimality Principle , 2015, Front. Robot. AI.
[27] Stefan Schaal,et al. Path integral control and bounded rationality , 2011, 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL).
[28] Suguru Arimoto,et al. An algorithm for computing the capacity of arbitrary discrete memoryless channels , 1972, IEEE Trans. Inf. Theory.
[29] B. Jones. BOUNDED RATIONALITY , 1999 .
[30] Aaron D. Wyner,et al. Coding Theorems for a Discrete Source With a Fidelity CriterionInstitute of Radio Engineers, International Convention Record, vol. 7, 1959. , 1993 .
[31] M. Levine. Empagliflozin for Type 2 Diabetes Mellitus: An Overview of Phase 3 Clinical Trials , 2017, Current diabetes reviews.
[32] Gaurav S. Sukhatme,et al. Using manipulation primitives for brick sorting in clutter , 2012, 2012 IEEE International Conference on Robotics and Automation.
[33] Daniel A. Braun,et al. A conversion between utility and information , 2009, AGI 2010.
[34] Emanuel Todorov,et al. Linearly-solvable Markov decision problems , 2006, NIPS.
[35] D. Kahneman. Maps of Bounded Rationality: Psychology for Behavioral Economics , 2003 .
[36] E. Rowland. Theory of Games and Economic Behavior , 1946, Nature.
[37] Daniel A. Braun,et al. Information-Theoretic Bounded Rationality and ε-Optimality , 2014, Entropy.
[38] Giorgio Metta,et al. Towards manipulation-driven vision , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.
[39] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[40] Oliver Kroemer,et al. Probabilistic Segmentation and Targeted Exploration of Objects in Cluttered Environments , 2014, IEEE Transactions on Robotics.
[41] Oliver Kroemer,et al. Learning visual representations for perception-action systems , 2011, Int. J. Robotics Res..
[42] Yasemin Altun,et al. Relative Entropy Policy Search , 2010 .
[43] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[44] Richard L. Lewis,et al. Rational adaptation under task and processing constraints: implications for testing theories of cognition and action. , 2009, Psychological review.