Hierarchical attributes learning for pedestrian re-identification via parallel stochastic gradient descent combined with momentum correction and adaptive learning rate

Convolutional neural networks (CNNs) have obtained high accuracy results for pedestrian re-identification in the past few years. There is always a trade-off between high accuracy and computational time in CNNs. Training CNN is always very difficult as it may take a long time to produce high accuracy results. To overcome this limitation, a novel method parallel stochastic gradient descent (PSGD) is proposed to train a five-hierarchical parallel CNNs that is designed according to pedestrian attributes. Moreover, the momentum correction and adaptive adjustment of learning rate are applied during training process and the time interval for updating parameters is inspected during optimization of parameters selection. The results of this paper prove the effectiveness of proposed PSGD that successfully decreases the training process by five times and surpasses the state-of-the-art methods of pedestrian re-identification in terms of both accuracy and time. The minimum reported running time of the proposed method is 8.7 s which is minimum among all other state-of-the-art methods. These promising results show the efficiency and performance of the proposed model.

[1]  Kenli Li,et al.  Scheduling Precedence Constrained Stochastic Tasks on Heterogeneous Cluster Systems , 2015, IEEE Transactions on Computers.

[2]  Horst Bischof,et al.  Mahalanobis Distance Learning for Person Re-identification , 2014, Person Re-Identification.

[3]  Jin-Hee Lee,et al.  ResNet-Based Vehicle Classification and Localization in Traffic Surveillance Systems , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[4]  Ehtesham Hassan,et al.  A density based method for automatic hairstyle discovery and recognition , 2013, 2013 Fourth National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG).

[5]  Kaiqi Huang,et al.  A Richly Annotated Dataset for Pedestrian Attribute Recognition , 2016, ArXiv.

[6]  Xu Zhou,et al.  Efficient top-(k,l) range query processing for uncertain data based on multicore architectures , 2015, Distributed and Parallel Databases.

[7]  David Barber,et al.  Nesterov's accelerated gradient and momentum as approximations to regularised update descent , 2016, 2017 International Joint Conference on Neural Networks (IJCNN).

[8]  Kaiqi Huang,et al.  Pose Guided Deep Model for Pedestrian Attribute Recognition in Surveillance Scenarios , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[9]  Xiaogang Wang,et al.  Unsupervised Salience Learning for Person Re-identification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Frank Hutter,et al.  SGDR: Stochastic Gradient Descent with Warm Restarts , 2016, ICLR.

[11]  Abdul Hamid Adom,et al.  Facial emotion recognition under partial occlusion using Empirical Mode Decomposition , 2016, 2016 2nd IEEE International Symposium on Robotics and Manufacturing Automation (ROMA).

[12]  Léon Bottou,et al.  Stochastic Gradient Descent Tricks , 2012, Neural Networks: Tricks of the Trade.

[13]  Yongzhao Zhan,et al.  Sparse representations based distributed attribute learning for person re-identification , 2017, Multimedia Tools and Applications.

[14]  Serge J. Belongie,et al.  Removing pedestrians from Google street view images , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[15]  Robinson Piramuthu,et al.  HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[16]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Frank Hutter,et al.  SGDR: Stochastic Gradient Descent with Restarts , 2016, ArXiv.

[18]  George E. Tsekouras,et al.  Modeling beach realignment using a neuro-fuzzy network optimized by a novel backtracking search algorithm , 2018, Neural Computing and Applications.

[19]  José Francisco Martínez Trinidad,et al.  Easy Categorization of Attributes in Decision Tables Based on Basic Binary Discernibility Matrix , 2013, CIARP.

[20]  George D. Magoulas,et al.  Customised ensemble methodologies for deep learning: Boosted Residual Networks and related approaches , 2018, Neural Computing and Applications.

[21]  Xiao Wang,et al.  Pedestrian Attribute Recognition: A Survey , 2019, Pattern Recognit..

[22]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[23]  Shiliang Zhang,et al.  Deep Attributes Driven Multi-Camera Person Re-identification , 2016, ECCV.

[24]  Meiyu Shen,et al.  Equivalence Tests for Interchangeability Based on Two One-Sided Probabilities , 2014, Journal of biopharmaceutical statistics.

[25]  Philip S. Yu,et al.  A Bi-layered Parallel Training Architecture for Large-Scale Convolutional Neural Networks , 2018, IEEE Transactions on Parallel and Distributed Systems.

[26]  Haizhou Ai,et al.  A feature fusion strategy for person re-identification , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[27]  Donghoon Lee,et al.  Face attribute classification using attribute-aware correlation map and gated convolutional neural networks , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[28]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[29]  Shaogang Gong,et al.  Person Re-identification by Attributes , 2012, BMVC.

[30]  Philip S. Yu,et al.  Distributed Deep Learning Model for Intelligent Video Surveillance Systems with Edge Computing , 2019, IEEE Transactions on Industrial Informatics.

[31]  Yiqiang Chen,et al.  Pedestrian Attribute Recognition with Part-based CNN and Combined Feature Representations , 2018, VISIGRAPP.

[32]  Andrea Vedaldi,et al.  MatConvNet: Convolutional Neural Networks for MATLAB , 2014, ACM Multimedia.

[33]  Maozhen Li,et al.  Data‐driven pedestrian re‐identification based on hierarchical semantic representation , 2018, Concurr. Comput. Pract. Exp..

[34]  Nadine Hajj,et al.  A piecewise weight update rule for a supervised training of cortical algorithms , 2017, Neural Computing and Applications.

[35]  Xiaoyang Tan,et al.  Sparse representations based attribute learning for flower classification , 2014, Neurocomputing.

[36]  K. Tanaka,et al.  A hierarchical quorum‐sensing cascade in Pseudomonas aeruginosa links the transcriptional activators LasR and RhIR (VsmR) to expression of the stationary‐phase sigma factor RpoS , 1996, Molecular microbiology.

[37]  Nazli Ikizler-Cinbis,et al.  Low-level features for visual attribute recognition: An evaluation , 2016, Pattern Recognit. Lett..

[38]  J. Zurada,et al.  Convergence of batch gradient learning with smoothing regularization and adaptive momentum for neural networks , 2016, SpringerPlus.

[39]  Shaogang Gong,et al.  Attributes-Based Re-identification , 2014, Person Re-Identification.

[40]  Kai-Kuang Ma,et al.  HOG-assisted deep feature learning for pedestrian gender recognition , 2017, J. Frankl. Inst..

[41]  Kenli Li,et al.  Performance Analysis and Optimization for SpMV on GPU Using Probabilistic Modeling , 2015, IEEE Transactions on Parallel and Distributed Systems.

[42]  Francesco Solera,et al.  Performance Measures and a Data Set for Multi-target, Multi-camera Tracking , 2016, ECCV Workshops.

[43]  Yujie Liu,et al.  PARNet: A Joint Loss Function and Dynamic Weights Network for Pedestrian Semantic Attributes Recognition of Smart Surveillance Image , 2019 .

[44]  Xiaowei Li,et al.  C-Brain: A deep learning accelerator that tames the diversity of CNNs through adaptive data-level parallelization , 2016, 2016 53nd ACM/EDAC/IEEE Design Automation Conference (DAC).

[45]  Tülay Adali,et al.  Data-driven fusion of multi-camera video sequences: Application to abandoned object detection , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[46]  Dieter Fox,et al.  Object recognition with hierarchical kernel descriptors , 2011, CVPR 2011.

[47]  ZhanYongzhao,et al.  Sparse representations based distributed attribute learning for person re-identification , 2017 .

[48]  Yang Liu,et al.  MapReduce Based Parallel Neural Networks in Enabling Large Scale Machine Learning , 2015, Comput. Intell. Neurosci..

[49]  Go Irie,et al.  Attribute Discovery for Person Re-Identification , 2016, MMM.

[50]  Xiaogang Wang,et al.  Locally Aligned Feature Transforms across Views , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[51]  Hai Tao,et al.  Evaluating Appearance Models for Recognition, Reacquisition, and Tracking , 2007 .

[52]  Kang-Hyun Jo,et al.  Hybrid cascade boosting machine using variant scale blocks based HOG features for pedestrian detection , 2014, Neurocomputing.

[53]  Xiaogang Wang,et al.  Human Reidentification with Transferred Metric Learning , 2012, ACCV.

[54]  Geoffrey E. Hinton,et al.  On the importance of initialization and momentum in deep learning , 2013, ICML.

[55]  Anazida Zainal,et al.  An optimized skin texture model using gray-level co-occurrence matrix , 2017, Neural Computing and Applications.

[56]  Alessandro Perina,et al.  Person re-identification by symmetry-driven accumulation of local features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[57]  Jamario White,et al.  Distracted walking: Examining the extent to pedestrian safety problems , 2015 .

[58]  Thanh Phuong Nguyen,et al.  Impact of Topology-Related Attributes from Local Binary Patterns on Texture Classification , 2014, ECCV Workshops.

[59]  María Vanrell,et al.  Color Representation in CNNs: Parallelisms with Biological Vision , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[60]  Liqing Zhang,et al.  Multi-shot Pedestrian Re-identification via Sequential Decision Making , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[61]  Jian Guo,et al.  Depth Dropout: Efficient Training of Residual Convolutional Neural Networks , 2016, 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA).

[62]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[63]  Forrest N. Iandola,et al.  SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[64]  Chang-Tsun Li,et al.  Developing a pattern discovery method in time series data and its GPU acceleration , 2018, Big Data Min. Anal..

[65]  Asmelash Teka,et al.  Large-scale learning with AdaGrad on Spark , 2015, 2015 IEEE International Conference on Big Data (Big Data).