Learning Self-Awareness for Autonomous Vehicles: Exploring Multisensory Incremental Models

The technology for autonomous vehicles is close to replacing human drivers by artificial systems endowed with high-level decision-making capabilities. In this regard, systems must learn about the usual vehicle's behavior to predict imminent difficulties before they happen. An autonomous agent should be capable of continuously interacting with multi-modal dynamic environments while learning unseen novel concepts. Such environments are not often available to train the agent on it, so the agent should have an understanding of its own capacities and limitations. This understanding is usually called self-awareness. This paper proposes a multi-modal self-awareness modeling of signals coming from different sources. This paper shows how different machine learning techniques can be used under a generic framework to learn single modality models by using Dynamic Bayesian Networks. In the presented case, a probabilistic switching model and a bank of generative adversarial networks are employed to model a vehicle's positional and visual information respectively. Our results include experiments performed on a real vehicle, highlighting the potentiality of the proposed approach at detecting abnormalities in real scenarios.

[1]  Shaogang Gong,et al.  Global Behaviour Inference using Probabilistic Latent Semantic Analysis , 2008, BMVC.

[2]  Nicu Sebe,et al.  Plug-and-Play CNN for Crowd Motion Analysis: An Application in Abnormal Event Detection , 2016, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[3]  Aggelos K. Katsaggelos,et al.  Variational Bayesian Methods For Multimedia Problems , 2014, IEEE Transactions on Multimedia.

[4]  Alessandro Perina,et al.  Crowd motion monitoring using tracklet-based commotion measure , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[5]  Mohan M. Trivedi,et al.  A Survey of Vision-Based Trajectory Learning and Analysis for Surveillance , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Thomas Brox,et al.  High Accuracy Optical Flow Estimation Based on a Theory for Warping , 2004, ECCV.

[7]  Medhat Moussa,et al.  Deep Learning for Intelligent Transportation Systems: A Survey of Emerging Trends , 2020, IEEE Transactions on Intelligent Transportation Systems.

[8]  Patrick Jähnichen,et al.  Learning to Remember: A Synaptic Plasticity Driven Framework for Continual Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Matthias Rauterberg,et al.  Dynamic representations for autonomous driving , 2017, 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[10]  Andrea E. Martin,et al.  Language Processing as Cue Integration: Grounding the Psychology of Language in Perception and Neurophysiology , 2016, Front. Psychol..

[11]  D. Aldous Exchangeability and related topics , 1985 .

[12]  Mahdyar Ravanbakhsh,et al.  a Multi-Perspective Approach to Anomaly Detection for Self -Aware Embodied Agents , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13]  Jianxin Wu,et al.  A Tube-and-Droplet-Based Approach for Representing and Analyzing Motion Trajectories , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Alessio Del Bue,et al.  Temporal Poselets for Collective Activity Detection and Recognition , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[15]  Gabriella Vigliocco,et al.  The Hidden Markov Topic Model: A Probabilistic Model of Semantic Representation , 2010, Top. Cogn. Sci..

[16]  Erik Steinmetz,et al.  Coordination of Cooperative Autonomous Vehicles: Toward safer and more efficient road transportation , 2016, IEEE Signal Processing Magazine.

[17]  Arnaud Doucet,et al.  Particle filters for state estimation of jump Markov linear systems , 2001, IEEE Trans. Signal Process..

[18]  École d'été de probabilités de Saint-Flour,et al.  École d'été de probabilités de Saint-Flour XIII - 1983 , 1985 .

[19]  Andreas Geiger,et al.  Understanding High-Level Semantics by Modeling Traffic Patterns , 2013, 2013 IEEE International Conference on Computer Vision.

[20]  Xian-Sheng Hua,et al.  Bridging the Semantic Gap via Functional Brain Imaging , 2012, IEEE Transactions on Multimedia.

[21]  Nicu Sebe,et al.  Self Paced Deep Learning for Weakly Supervised Object Detection , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Ramón Moreno,et al.  A machine learning based intelligent vision system for autonomous object detection and recognition , 2013, Applied Intelligence.

[24]  Bodo Rosenhahn,et al.  Multisensor-fusion for 3D full-body human motion capture , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  Hamid R. Rabiee,et al.  Novel dataset for fine-grained abnormal behavior understanding in crowd , 2016, 2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[26]  Lei Xu,et al.  A probabilistic model for track random irregularities in vehicle/track coupled dynamics , 2017 .

[27]  Mahmood Fathy,et al.  Deep-anomaly: Fully convolutional neural network for fast anomaly detection in crowded scenes , 2016, Comput. Vis. Image Underst..

[28]  Carlo S. Regazzoni,et al.  A Cognitive Control-Inspired Approach to Object Tracking , 2016, IEEE Transactions on Image Processing.

[29]  Chung-Lin Huang,et al.  Semantic analysis of soccer video using dynamic Bayesian network , 2006, IEEE Transactions on Multimedia.

[30]  Alessandro Perina,et al.  Abnormality Detection with Improved Histogram of Oriented Tracklets , 2015, ICIAP.

[31]  Jirí Vomlel,et al.  Exploiting Functional Dependence in Bayesian Network Inference , 2002, UAI.

[32]  Nicu Sebe,et al.  Abnormal Event Recognition in Crowd Environments , 2018 .

[33]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[34]  Hamid R. Rabiee,et al.  Detection and localization of crowd behavior using a novel tracklet-based model , 2018, Int. J. Mach. Learn. Cybern..

[35]  C. Antoniak Mixtures of Dirichlet Processes with Applications to Bayesian Nonparametric Problems , 1974 .

[36]  Nir Friedman,et al.  Probabilistic Graphical Models: Principles and Techniques - Adaptive Computation and Machine Learning , 2009 .

[37]  Rolf Ernst,et al.  Self-awareness in autonomous automotive systems , 2017, Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017.

[38]  Tao Tang,et al.  Big Data Analytics in Intelligent Transportation Systems: A Survey , 2019, IEEE Transactions on Intelligent Transportation Systems.

[39]  Michael I. Jordan,et al.  Bayesian Nonparametric Inference of Switching Dynamic Linear Models , 2010, IEEE Transactions on Signal Processing.

[40]  Carlo S. Regazzoni,et al.  Learning Multi-Modal Self-Awareness Models for Autonomous Vehicles from Human Driving , 2018, 2018 21st International Conference on Information Fusion (FUSION).

[41]  J. Sethuraman A CONSTRUCTIVE DEFINITION OF DIRICHLET PRIORS , 1991 .

[42]  Carlo S. Regazzoni,et al.  Learning Switching Models for Abnormality Detection for Autonomous Driving , 2018, 2018 21st International Conference on Information Fusion (FUSION).

[43]  Nicu Sebe,et al.  Abnormal event detection in videos using generative adversarial nets , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[44]  Lucio Marcenaro,et al.  Learning Probabilistic Awareness Models for Detecting Abnormalities in Vehicle Motions , 2020, IEEE Transactions on Intelligent Transportation Systems.

[45]  Basam Musleh,et al.  Stereo Vision-based Local Occupancy Grid Map for Autonomous Navigation in ROS , 2016, VISIGRAPP.

[46]  Konstantinos N. Plataniotis,et al.  Smart Driver Monitoring: When Signal Processing Meets Human Factors: In the driver's seat , 2016, IEEE Signal Processing Magazine.

[47]  Léon Bottou,et al.  Towards Principled Methods for Training Generative Adversarial Networks , 2017, ICLR.

[48]  Wei Zhan,et al.  Probabilistic Prediction of Vehicle Semantic Intention and Motion , 2018, 2018 IEEE Intelligent Vehicles Symposium (IV).

[49]  Carlo S. Regazzoni,et al.  Online Nonparametric Bayesian Activity Mining and Analysis From Surveillance Video , 2016, IEEE Transactions on Image Processing.

[50]  Kazuya Takeda,et al.  Driver-Behavior Modeling Using On-Road Driving Data: A new application for behavior signal processing , 2016, IEEE Signal Processing Magazine.

[51]  Bo Chen,et al.  A Review of the Applications of Agent Technology in Traffic and Transportation Systems , 2010, IEEE Transactions on Intelligent Transportation Systems.

[52]  Alejandro Betancourt,et al.  Static force field representation of environments based on agents’ nonlinear motions , 2017, EURASIP Journal on Advances in Signal Processing.

[53]  Hossein Mousavi,et al.  Crowd behavior representation: an attribute-based approach , 2016, SpringerPlus.

[54]  Petros Daras,et al.  Search and Retrieval of Rich Media Objects Supporting Multiple Multimodal Queries , 2012, IEEE Transactions on Multimedia.

[55]  Carlo S. Regazzoni,et al.  Fast but Not Deep: Efficient Crowd Abnormality Detection with Local Binary Tracklets , 2018, 2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[56]  Nicu Sebe,et al.  Training Adversarial Discriminators for Cross-Channel Abnormal Event Detection in Crowds , 2017, 2019 IEEE Winter Conference on Applications of Computer Vision (WACV).