An Improved LSTM Model for Behavior Recognition of Intelligent Vehicles

Long Short-Term Memory (LSTM) neural network has been widely used in many applications, but its application in classification of vehicle movement patterns is still limited. In this paper, LSTM is applied to the vehicle behavior recognition problem to identify the left turn, right turn and straight behavior of the vehicle at the intersection. On the basis of the traditional LSTM classification model, this paper transversely merges the input features and then inputs into a LSTM cell to get an improved model. The improved model can make full use of the input information and reduce unnecessary calculations, and the output of a single LSTM cell model can filter out interference information and retain important information, so it has better classification effect and faster training speed. The experimental results show that the proposed improved LSTM network classification model in this paper has a significant improvement in recognition accuracy and training speed compared with the improved model, the accuracy is increased by 1.6%, and the training time is reduced by 3.96 s. In addition, this paper also applies the improved model to regression problems, emotion classification and handwritten digit recognition and all of them have a good improvement effect, which improves the applicability and stability of LSTM in classification problems and provides a new way to deal with classification problems.

[1]  Mohamed Cheriet,et al.  Feature Set Evaluation for Offline Handwriting Recognition Systems: Application to the Recurrent Neural Network Model , 2016, IEEE Transactions on Cybernetics.

[2]  Ashraf Y. Maghari,et al.  A Comparative Study on Handwriting Digit Recognition Using Neural Networks , 2017, 2017 International Conference on Promising Electronic Technologies (ICPET).

[3]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[4]  Pan Zhao,et al.  A Scenario-Adaptive Driving Behavior Prediction Approach to Urban Autonomous Driving , 2017 .

[5]  Praveen Edara,et al.  Modeling Mandatory Lane Changing Using Bayes Classifier and Decision Trees , 2014, IEEE Transactions on Intelligent Transportation Systems.

[6]  T. Munich,et al.  Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks , 2008, NIPS.

[7]  Ioannis Iossifidis,et al.  Autonomous driving: A comparison of machine learning techniques by means of the prediction of lane change behavior , 2011, 2011 IEEE International Conference on Robotics and Biomimetics.

[8]  Ghulam Hussain,et al.  Estimating Driver Mandatory Lane Change Behavior on a Multi lane Freeway , 2009 .

[9]  Hao Wang,et al.  Traffic Speed Prediction Under Non-Recurrent Congestion: Based on LSTM Method and BeiDou Navigation Satellite System Data , 2019, IEEE Intelligent Transportation Systems Magazine.

[10]  Thenkurussi Kesavadas,et al.  iART: Learning From Demonstration for Assisted Robotic Therapy Using LSTM , 2020, IEEE Robotics and Automation Letters.

[11]  Chafic Mokbel,et al.  Dynamic and Contextual Information in HMM Modeling for Handwritten Word Recognition , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Phil Blunsom,et al.  Recurrent Continuous Translation Models , 2013, EMNLP.

[13]  Rahul Chauhan,et al.  Recognition of Handwritten Digits Using DNN, CNN, and RNN , 2018 .

[14]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[15]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[16]  Moshe Ben-Akiva,et al.  Lane Selection Model for Urban Intersections , 2008 .

[17]  Holger R. Maier,et al.  The effect of internal parameters and geometry on the performance of back-propagation neural networks: an empirical study , 1998 .

[18]  Pan He,et al.  Reading Scene Text in Deep Convolutional Sequences , 2015, AAAI.

[19]  Zhizheng Wu,et al.  Merlin: An Open Source Neural Network Speech Synthesis System , 2016, SSW.

[20]  Dongsuk Kum,et al.  The multilayer perceptron approach to lateral motion prediction of surrounding vehicles for autonomous vehicles , 2016, 2016 IEEE Intelligent Vehicles Symposium (IV).

[21]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[22]  Yue Zhang,et al.  Gated Neural Networks for Targeted Sentiment Analysis , 2016, AAAI.

[23]  Jürgen Schmidhuber,et al.  Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks , 2008, NIPS.

[24]  Jürgen Schmidhuber,et al.  Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.

[25]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[26]  Thierry Paquet,et al.  A Unified Multilingual Handwriting Recognition System using multigrams sub-lexical units , 2018, Pattern Recognit. Lett..

[27]  P. G. Gipps,et al.  A MODEL FOR THE STRUCTURE OF LANE-CHANGING DECISIONS , 1986 .

[28]  Ruimin Hu,et al.  Action recognition with temporal scale-invariant deep learning framework , 2017, China Communications.

[29]  Heiga Zen,et al.  Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[30]  Alexander M. Rush,et al.  Character-Aware Neural Language Models , 2015, AAAI.

[31]  Luca Maria Gambardella,et al.  Deep, Big, Simple Neural Nets for Handwritten Digit Recognition , 2010, Neural Computation.

[32]  U. Ravi Babu,et al.  Handwritten Digit Recognition Using K-Nearest Neighbour Classifier , 2014, 2014 World Congress on Computing and Communication Technologies.

[33]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[34]  Masayoshi Tomizuka,et al.  Interaction-aware Multi-agent Tracking and Probabilistic Behavior Prediction via Adversarial Learning , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[35]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[36]  Hermann Ney,et al.  LSTM Neural Networks for Language Modeling , 2012, INTERSPEECH.

[37]  Sepp Hochreiter,et al.  The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions , 1998, Int. J. Uncertain. Fuzziness Knowl. Based Syst..