Vehicle type classification and attribute prediction using multi-task RCNN

Vehicle classification is an important subject of study due to its significance in a number of areas including law enforcement, traffic surveillance, autonomous navigation, and transportation management. While numerous approaches have been proposed, few studies have been published with regard to the multi-view classification of vehicles captured in real surveillance. In this paper, we consider the multi-view classification of vehicles as an attribute prediction problem with views (rear, front, and side) as attributes. The corresponding multi-task learning is implemented in the Region-based Convolutional Neural Network (RCNN) framework, which classifies vehicle categories (car, truck, bus, and van) and predicts the attributes simultaneously. Experiments on a field-captured vehicle dataset provide satisfactory results, with approximate 83% accuracy for vehicle type classification and over 90% accuracy for attribute prediction.

[1]  Gang Wang,et al.  Multi-Task CNN Model for Attribute Prediction , 2015, IEEE Transactions on Multimedia.

[2]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[4]  Lawrence Carin,et al.  Multi-Task Learning for Classification with Dirichlet Process Priors , 2007, J. Mach. Learn. Res..

[5]  Jitendra Malik,et al.  R-CNNs for Pose Estimation and Action Detection , 2014, ArXiv.

[6]  Yan Wang,et al.  Vehicle Type Recognition in Sensor Networks Using Improved Time Encoded Signal Processing Algorithm , 2014 .

[7]  Pinghua Gong,et al.  Multi-Stage MultiTask Feature Learning ∗ , 2012 .

[8]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[9]  Bernt Schiele,et al.  How good are detection proposals, really? , 2014, BMVC.

[10]  Yunde Jia,et al.  Vehicle Type Classification Using a Semisupervised Convolutional Neural Network , 2015, IEEE Transactions on Intelligent Transportation Systems.

[11]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[12]  Koen E. A. van de Sande,et al.  Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[13]  C. Lawrence Zitnick,et al.  Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.

[14]  Sharath Pankanti,et al.  Attribute-based vehicle search in crowded surveillance videos , 2011, ICMR.

[15]  Bir Bhanu,et al.  Dynamic Bayesian Networks for Vehicle Classification in Video , 2012, IEEE Transactions on Industrial Informatics.

[16]  Zezhi Chen,et al.  Multi-shape Descriptor Vehicle Classification for Urban Traffic , 2011, 2011 International Conference on Digital Image Computing: Techniques and Applications.

[17]  Frans Coenen,et al.  Face occlusion detection based on multi-task convolution neural network , 2015, 2015 12th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD).

[18]  Neil A. Dodgson,et al.  Proceedings Ninth IEEE International Conference on Computer Vision , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[19]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[20]  Jieping Ye,et al.  Multi-stage multi-task feature learning , 2012, J. Mach. Learn. Res..

[21]  Bernt Schiele,et al.  What Makes for Effective Detection Proposals? , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.