Gait-Based Human Identification by Combining Shallow Convolutional Neural Network-Stacked Long Short-Term Memory and Deep Convolutional Neural Network

Human identification using camera-based surveillance systems is a challenging research topic, especially in cases where the human face is not visible to cameras and/or when humans captured on cameras have no clear visual identity owing to environments with low-illumination. With the development of deep learning algorithms, studies that are based on the human gait using convolutional neural networks (CNNs) and long short-term memory (LSTM) have achieved promising performance for human identification. However, CNN and LSTM-based methods have the limitation of having higher loss of temporal and spatial information, respectively. In our approach, we use shallow CNN stacked with LSTM and deep CNN followed by score fusion to capture more spatial and temporal features. In addition, there have been a few studies regarding gait-based human identification based on the front and back view images of humans captured in low-illumination environments. This makes it difficult to extract conventional features, such as skeleton joints, cycle, cadence, and the lengths of walking strides. To overcome these problems, we designed our method considering the front and back view images captured in both high- and low-illumination environments. The experimental results obtained using a self-collected database and the open database of the institute of automation Chinese academy of sciences gait dataset C show that the proposed method outperforms previous methods.

[1]  Trevor Darrell,et al.  Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Michael W. Whittle,et al.  Gait Analysis: An Introduction , 1986 .

[3]  Rafael C. González,et al.  Digital image processing, 3rd Edition , 2008 .

[4]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[5]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[6]  Björn Ommer,et al.  LSTM Self-Supervision for Detailed Behavior Analysis , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Qiang Wu,et al.  Speed-invariant gait recognition based on Procrustes Shape Analysis using higher-order shape configuration , 2011, 2011 18th IEEE International Conference on Image Processing.

[8]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[9]  Euntai Kim,et al.  A gait phase classifier using a recurrent neural network , 2015 .

[10]  Xiaohui Xie,et al.  Co-Occurrence Feature Learning for Skeleton Based Action Recognition Using Regularized Deep LSTM Networks , 2016, AAAI.

[11]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Kang Ryoung Park,et al.  Body-movement-based human identification using convolutional neural network , 2018, Expert Syst. Appl..

[16]  Amit K. Roy-Chowdhury,et al.  GAIT-BASED HUMAN IDENTIFICATION FROM A MONOCULAR VIDEO SEQUENCE , 2003 .

[17]  Xiaolin Hu,et al.  Joint Training of Cascaded CNN for Face Detection , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[19]  Yu Liu,et al.  Quality Aware Network for Set to Set Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Xiaogang Wang,et al.  A Comprehensive Study on Cross-View Gait Based Human Identification with Deep CNNs , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Vijay Vasudevan,et al.  Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22]  Hagen Soltau,et al.  Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition , 2016, INTERSPEECH.

[23]  Lin Wu,et al.  Convolutional LSTM Networks for Video-based Person Re-identification , 2016 .

[24]  Jürgen Schmidhuber,et al.  LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Yang Feng,et al.  Learning effective Gait features using LSTM , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[26]  Tieniu Tan,et al.  Efficient Night Gait Recognition Based on Template Matching , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[27]  Thomas Wolf,et al.  Multi-view gait recognition using 3D convolutional neural networks , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[28]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[29]  Junyu Dong,et al.  Dual channel LSTM based multi-feature extraction in gait for diagnosis of Neurodegenerative diseases , 2018, Knowl. Based Syst..

[30]  A. Sokolova,et al.  GAIT RECOGNITION BASED ON CONVOLUTIONAL NEURAL NETWORKS , 2017 .

[31]  Kang Ryoung Park,et al.  Robust Pedestrian Detection by Combining Visible and Thermal Infrared Cameras , 2015, Sensors.

[32]  Liang Lin,et al.  Deep feature learning with relative distance comparison for person re-identification , 2015, Pattern Recognit..

[33]  Jamal Ahmad Dargham,et al.  Gait Recognition using Gait Energy Image , 2011 .

[34]  Mao Ye,et al.  Memory-based Gait Recognition , 2016, BMVC.

[35]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[36]  Manuel J. Marín-Jiménez,et al.  Automatic Learning of Gait Signatures for People Identification , 2016, IWANN.

[37]  Yasushi Makihara,et al.  GEINet: View-invariant gait recognition using a convolutional neural network , 2016, 2016 International Conference on Biometrics (ICB).

[38]  C. Prathap Gait Based Human Identification in Bad Illumination , 2017 .

[39]  Samy Bengio,et al.  Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Jorge Cadima,et al.  Principal component analysis: a review and recent developments , 2016, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[41]  Omid Dehzangi,et al.  IMU-Based Gait Recognition Using Convolutional Neural Networks and Multi-Sensor Fusion , 2017, Sensors.

[42]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[43]  Shuang Wang,et al.  Skeleton-based action recognition using LSTM and CNN , 2017, 2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[44]  Daksh Thapar,et al.  VGR-net: A view invariant gait recognition network , 2017, 2018 IEEE 4th International Conference on Identity, Security, and Behavior Analysis (ISBA).

[45]  Santanu Chaudhury,et al.  Text recognition using deep BLSTM networks , 2015, 2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR).

[46]  Chen Wang,et al.  Chrono-Gait Image: A Novel Temporal Template for Gait Recognition , 2010, ECCV.

[47]  Suiping Zhou,et al.  Wearable Device-Based Gait Recognition Using Angle Embedded Gait Dynamic Images and a Convolutional Neural Network , 2017, Sensors.

[48]  Haruya Matsumoto,et al.  Phoneme recognition using recurrent neural networks , 1991, EUROSPEECH.

[49]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[51]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[52]  Wang Liang Gait-Based Human Identification , 2003 .

[53]  Chao Li,et al.  DeepGait: A Learning Deep Convolutional Representation for View-Invariant Gait Recognition Using Joint Bayesian , 2017 .

[54]  Wei Xu,et al.  Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN) , 2014, ICLR.