论文信息 - On a Formal Model of Safe and Scalable Self-driving Cars

On a Formal Model of Safe and Scalable Self-driving Cars

In recent years, car makers and tech companies have been racing towards self driving cars. It seems that the main parameter in this race is who will have the first car on the road. The goal of this paper is to add to the equation two additional crucial parameters. The first is standardization of safety assurance --- what are the minimal requirements that every self-driving car must satisfy, and how can we verify these requirements. The second parameter is scalability --- engineering solutions that lead to unleashed costs will not scale to millions of cars, which will push interest in this field into a niche academic corner, and drive the entire field into a "winter of autonomous driving". In the first part of the paper we propose a white-box, interpretable, mathematical model for safety assurance, which we call Responsibility-Sensitive Safety (RSS). In the second part we describe a design of a system that adheres to our safety assurance requirements and is scalable to millions of cars.

[1] Amnon Shashua,et al. Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving , 2016, ArXiv.

[2] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[3] Siuming Lo,et al. Discrete Element Crowd Model for Pedestrian Evacuation Through an Exit , 2015 .

[4] Richard Bellman,et al. Introduction to the mathematical theory of control processes , 1967 .

[5] L. C. Baird,et al. Reinforcement learning in continuous time: advantage updating , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[6] Leslie G. Valiant,et al. A theory of the learnable , 1984, STOC '84.

[7] R Bellman,et al. DYNAMIC PROGRAMMING AND LAGRANGE MULTIPLIERS. , 1956, Proceedings of the National Academy of Sciences of the United States of America.

[8] Pete L. Clark,et al. The Instructor’s Guide to Real Induction , 2012, Mathematics Magazine.

[9] John L. Casti. Introduction to the Mathematical Theory of Control Processes, Volume I: Linear Equations and Quadratic Criteria, Volume II: Nonlinear Processes , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[10] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..