Fine-Grained Vehicle Recognition via Detection-Classification-Tracking in Surveillance Video

Object recognition is a wide applied task in computer vision. Many fine-grained object recognition approaches are proposed in recent years to detect the same species objects effectively at subordinate-level. In this paper, we present a novel fine-grained vehicle recognition by utilizing collaborative feedback scheme of detection-classification-tracking in surveillance video. We collect a labeled data set of 200 car images which contains three types of car images in addition to negative samples, and extract Haar-like features from the training samples to build global appearance model for fine-grained feature representation. Then a multi-class SVMs classifier is trained offline on the training data set to distinguish the intra-class variability of cars. The collaborative feedback scheme incorporate the tracking and feedback constraint for reduce the frequency of detection and recognition. That is, the detector localizes the motion objects in every frame except that have been observed and tracked, and the best matching object with the given initial car is identified by the classifier and be tracked in the subsequent frames until it is not present. The collaborative scheme of detecting-and-tracking can decreases the computational cost in terms of the frequency of detection and recognition at each frame. The experiment result shows that our approach can effectively locate the best matching car at the frames when the target car appears.

[1]  Xiaonan Luo,et al.  Real-time background subtraction-based video surveillance of people by integrating local texture patterns , 2014, Signal Image Video Process..

[2]  David W. Jacobs,et al.  Dog Breed Classification Using Part Localization , 2012, ECCV.

[3]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  Ling Shao,et al.  Submodular Object Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Hefeng Wu,et al.  Hierarchical Ensemble of Background Models for PTZ-Based Video Surveillance , 2015, IEEE Transactions on Cybernetics.

[6]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[7]  Andrew Zisserman,et al.  Automated Flower Classification over a Large Number of Classes , 2008, 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing.

[8]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[9]  Fei-Fei Li,et al.  Novel Dataset for Fine-Grained Image Categorization : Stanford Dogs , 2012 .

[10]  Jiri Matas,et al.  Forward-Backward Error: Automatic Detection of Tracking Failures , 2010, 2010 20th International Conference on Pattern Recognition.

[11]  Shree K. Nayar,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence Describable Visual Attributes for Face Verification and Image Search , 2022 .

[12]  Jian Sun,et al.  An associate-predict model for face recognition , 2011, CVPR 2011.

[13]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[14]  Peter N. Belhumeur,et al.  POOF: Part-Based One-vs.-One Features for Fine-Grained Categorization, Face Verification, and Attribute Estimation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Larry S. Davis,et al.  Birdlets: Subordinate categorization using volumetric primitives and pose-normalized appearance , 2011, 2011 International Conference on Computer Vision.

[16]  Zdenek Kalal,et al.  Tracking-Learning-Detection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Shenghuo Zhu,et al.  Efficient Object Detection and Segmentation for Fine-Grained Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.