Dynamic Games of Asymmetric Information for Deceptive Autonomous Vehicles.

This paper studies rational and persistent deception among intelligent robots to enhance the security and operation efficiency of autonomous vehicles. We present an N-person K-stage nonzero-sum game with an asymmetric information structure where each robot's private information is modeled as a random variable or its type. The deception is persistent as each robot's private type remains unknown to other robots for all stages. The deception is rational as robots aim to achieve their deception goals at minimum cost. Each robot forms a belief on others' types based on state observations and updates it via the Bayesian rule. The level-t perfect Bayesian Nash equilibrium is a natural solution concept of the dynamic game. It demonstrates the sequential rationality of the agents, maintains the belief consistency with the observations and strategies, and provides a reliable prediction of the outcome of the deception game. In particular, in the linear-quadratic setting, we derive a set of extended Riccati equations, obtain the explicit form of the affine state-feedback control, and develop an online computational algorithm. We define the concepts of deceivability and the price of deception to evaluate the strategy design and assess the deception outcome. We investigate a case study of deceptive pursuit-evasion games and use numerical experiments to corroborate the results.

[1]  Ronald C. Arkin,et al.  A Taxonomy of Robot Deception and Its Benefits in HRI , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[2]  Tamer Basar,et al.  Common Information based Markov Perfect Equilibria for Linear-Gaussian Games with Asymmetric Information , 2014, SIAM J. Control. Optim..

[3]  Jose B. Cruz,et al.  Defending an Asset: A Linear Quadratic Game Approach , 2011, IEEE Transactions on Aerospace and Electronic Systems.

[4]  Quanyan Zhu,et al.  A hybrid stochastic game for secure control of cyber-physical systems , 2018, Autom..

[5]  Ronald C. Arkin,et al.  Other-oriented robot deception: A computational approach for deceptive action generation to benefit the mark , 2014, 2014 IEEE International Conference on Robotics and Biomimetics (ROBIO 2014).

[6]  A. Doucet,et al.  State estimation of jump Markov linear systems via stochastic sampling algorithms , 1998, Proceedings of the 37th IEEE Conference on Decision and Control (Cat. No.98CH36171).

[7]  Vrizlynn L. L. Thing,et al.  Autonomous Vehicle Security: A Taxonomy of Attacks and Defences , 2016, 2016 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData).

[8]  Benjamin J. Raphael,et al.  Network propagation: a universal amplifier of genetic associations , 2017, Nature Reviews Genetics.

[9]  Quanyan Zhu,et al.  Dynamic Bayesian Games for Adversarial and Defensive Cyber Deception , 2018, Autonomous Cyber Deception.

[10]  Ruochi Zhang,et al.  Stealthy Control Signal Attacks in Linear Quadratic Gaussian Control Systems: Detectability Reward Tradeoff , 2017, IEEE Transactions on Information Forensics and Security.

[11]  Aldo Morales,et al.  An Adaptive Markov Process for Robot Deception , 2019, ArXiv.

[12]  Quanyan Zhu,et al.  A moving-horizon hybrid stochastic game for secure control of cyber-physical systems , 2014, 53rd IEEE Conference on Decision and Control.

[13]  M. Athans,et al.  Solution of some nonclassical LQG stochastic decision problems , 1974 .

[14]  Ashutosh Nayyar,et al.  The Common-Information Approach to Decentralized Stochastic Control , 2014 .

[15]  Quanyan Zhu,et al.  Dynamic games for secure and resilient control system design , 2019, National science review.

[16]  Insup Lee,et al.  Attack resilient state estimation for autonomous robotic systems , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[17]  Ufuk Topcu,et al.  Deception in Optimal Control , 2018, 2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[18]  M. Ani Hsieh,et al.  Cooperative transport of a buoyant load: A differential geometric approach , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[19]  Vijay Kumar,et al.  Dynamics, Control and Planning for Cooperative Manipulation of Payloads Suspended by Cables from Multiple Quadrotor Robots , 2013, Robotics: Science and Systems.

[20]  Ronald C. Arkin,et al.  Acting Deceptively: Providing Robots with the Capacity for Deception , 2011, Int. J. Soc. Robotics.

[21]  Eric Maskin,et al.  Markov Perfect Equilibrium: I. Observable Actions , 2001, J. Econ. Theory.

[22]  Yi Ouyang,et al.  Dynamic Games With Asymmetric Information: Common Information Based Perfect Bayesian Equilibria and Sequential Decomposition , 2015, IEEE Transactions on Automatic Control.

[23]  Quanyan Zhu,et al.  Analysis and Computation of Adaptive Defense Strategies Against Advanced Persistent Threats for Cyber-Physical Systems , 2018, GameSec.

[24]  Quanyan Zhu,et al.  Deception by Design: Evidence-Based Signaling Games for Network Defense , 2015, WEIS.

[25]  Quanyan Zhu,et al.  A Dynamic Games Approach to Proactive Defense Strategies against Advanced Persistent Threats in Cyber-Physical Systems , 2019, Comput. Secur..

[26]  Jason L. Speyer,et al.  On the LQG Game With Nonclassical Information Pattern Using a Direct Solution Method , 2020, IEEE Transactions on Automatic Control.

[27]  R. P. Marques,et al.  Discrete-Time Markov Jump Linear Systems , 2004, IEEE Transactions on Automatic Control.

[28]  Arnaud Doucet,et al.  Particle filters for state estimation of jump Markov linear systems , 2001, IEEE Trans. Signal Process..

[29]  Peter G. Bennett,et al.  Hypergames: Developing a model of conflict☆ , 1980 .

[30]  David Fridovich-Keil,et al.  An Iterative Quadratic Method for General-Sum Differential Games with Feedback Linearizable Dynamics , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[31]  S Bhattacharya,et al.  Game-theoretic analysis of an aerial jamming attack on a UAV communication network , 2010, Proceedings of the 2010 American Control Conference.

[32]  Achilleas Anastasopoulos,et al.  Signaling equilibria for dynamic LQG games with asymmetric information , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[33]  John C. Harsanyi,et al.  Games with Incomplete Information Played by "Bayesian" Players, I-III: Part I. The Basic Model& , 2004, Manag. Sci..

[34]  Siddhartha S. Srinivasa,et al.  An Analysis of Deceptive Robot Motion , 2014, Robotics: Science and Systems.

[35]  Branislav Bosanský,et al.  Manipulating Adversary's Belief: A Dynamic Game Approach to Deception by Design for Proactive Network Security , 2017, GameSec.

[36]  Quanyan Zhu,et al.  Adaptive Strategic Cyber Defense for Advanced Persistent Threats in Critical Infrastructure Networks , 2018, PERV.

[37]  D. Smith,et al.  Why We Lie: The Evolutionary Roots of Deception and the Unconscious Mind , 2004 .

[38]  John Lygeros,et al.  A Noncooperative Game Approach to Autonomous Racing , 2017, IEEE Transactions on Control Systems Technology.

[39]  Quanyan Zhu,et al.  Finite-horizon semi-Markov game for time-sensitive attack response and probabilistic risk assessment in nuclear power plants , 2020, Reliab. Eng. Syst. Saf..

[40]  J. B. Cruz,et al.  Moving horizon Nash strategies for a military air operation , 2002 .

[41]  Roberta L. Klatzky,et al.  Visual-Feedback Distortion in a Robotic Rehabilitation Environment , 2006, Proceedings of the IEEE.