Hybrid control and learning with coresets for autonomous vehicles

Modern autonomous systems such as driverless vehicles need to safely operate in a wide range of conditions. A potential solution is to employ a hybrid systems approach, where safety is guaranteed in each individual mode within the system. This offsets complexity and responsibility from the individual controllers onto the complexity of determining discrete mode transitions. In this work we propose an efficient framework based on recursive neural networks and coreset data summarization to learn the transitions between an arbitrary number of controller modes that can have arbitrary complexity. Our approach allows us to efficiently gather annotation data from the large-scale datasets that are required to train such hybrid nonlinear systems to be safe under all operating conditions, favoring underexplored parts of the data. We demonstrate the construction of the embedding, and efficient detection of switching points for autonomous and non-autonomous car data. We further show how our approach enables efficient sampling of training data, to further improve either our embedding or the controllers.

[1]  Alexandre M. Bayen,et al.  A time-dependent Hamilton-Jacobi formulation of reachable sets for continuous dynamic games , 2005, IEEE Transactions on Automatic Control.

[2]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Jonathan P. How,et al.  Duckietown: An open, inexpensive and flexible platform for autonomy education and research , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[4]  Michael I. Jordan,et al.  Distance Metric Learning with Application to Clustering with Side-Information , 2002, NIPS.

[5]  Kevin P. Murphy Hidden semi-Markov models ( HSMMs ) , 2002 .

[6]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[7]  Yang Gao,et al.  End-to-End Learning of Driving Models from Large-Scale Video Datasets , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Christopher G. Atkeson,et al.  CHAMP: Changepoint Detection Using Approximate Model Parameters , 2014 .

[9]  Hajime Asama,et al.  Inevitable collision states. A step towards safer robots? , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[10]  Ramesh Raskar,et al.  Coreset-Based Adaptive Tracking , 2015, ArXiv.

[11]  Sergey Levine,et al.  Generalizing Skills with Semi-Supervised Reinforcement Learning , 2016, ICLR.

[12]  John Lygeros,et al.  Multiobjective Hybrid Controller Synthesis , 1997, HART.

[13]  Claire J. Tomlin,et al.  Design of guaranteed safe maneuvers using reachable sets: Autonomous quadrotor aerobatics in theory and practice , 2010, 2010 IEEE International Conference on Robotics and Automation.

[14]  Steven E Shladover,et al.  OVERVIEW OF PLATOONING SYSTEMS , 2012 .

[15]  Xin Zhang,et al.  End to End Learning for Self-Driving Cars , 2016, ArXiv.

[16]  Domitilla Del Vecchio,et al.  Design of a lane departure driver-assist system under safety specifications , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[17]  Michael I. Jordan,et al.  Nonparametric Bayesian Learning of Switching Linear Dynamical Systems , 2008, NIPS.

[18]  Dean Pomerleau,et al.  ALVINN, an autonomous land vehicle in a neural network , 2015 .

[19]  Christopher J. C. Burges,et al.  Dimension Reduction: A Guided Tour , 2010, Found. Trends Mach. Learn..

[20]  Jeff M. Phillips,et al.  Coresets and Sketches , 2016, ArXiv.

[21]  Claire J. Tomlin,et al.  Mobile Sensor Network Control Using Mutual Information Methods and Particle Filters , 2010, IEEE Transactions on Automatic Control.

[22]  Matthew J. Johnson,et al.  Bayesian nonparametric hidden semi-Markov models , 2012, J. Mach. Learn. Res..

[23]  Guy Rosman,et al.  Machine learning and coresets for automated real-time video segmentation of laparoscopic and robot-assisted surgery , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[24]  Domitilla Del Vecchio,et al.  Design of Driver-Assist Systems Under Probabilistic Safety Specifications Near Stop Signs , 2016, IEEE Transactions on Automation Science and Engineering.

[25]  John W. Fisher,et al.  Coresets for k-Segmentation of Streaming Data , 2014, NIPS.

[26]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[27]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[28]  Michael Athans,et al.  The stochastic control of the F-8C aircraft using a multiple model adaptive control (MMAC) method--Part I: Equilibrium flight , 1977 .

[29]  John W. Fisher,et al.  Coresets for visual summarization with applications to loop closure , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[30]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[31]  John Lygeros,et al.  Hybrid Systems: Modeling, Analysis and Control , 2008 .

[32]  Paul A. Beardsley,et al.  Shared control of autonomous vehicles based on velocity space optimization , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[33]  Scott Kuindersma,et al.  An Architecture for Online Affordance‐based Perception and Whole‐body Planning , 2015, J. Field Robotics.

[34]  Christoph Goller,et al.  Learning task-dependent distributed representations by backpropagation through structure , 1996, Proceedings of International Conference on Neural Networks (ICNN'96).

[35]  Kasturi R. Varadarajan,et al.  Geometric Approximation via Coresets , 2007 .