A visually salient approach to recognize vehicles based on hierarchical architecture

In order to recognize multi-class vehicles, traditional methods are typically based on license plates and frontal images of vehicles. These methods rely heavily on specific datasets and thus are not applicable in real-world tasks. In this paper, we propose a novel method based on a hierarchical model, HMAX, which simulates visual architecture of primates for object recognition. It can extract features of shift-invariance and scale-invariance by Gabor filtering, template matching, and max pooling. In particular, we adopt a model of saliency-based visual attention to detect salient patches for template matching, also we drop inefficient features via an all-pairs linear SVM. During experiments, high accuracy and great efficiency are achieved on a dataset which has 31 types and over 1400 vehicle images with varying scales, orientations, and colors. With comparisons with Original-HMAX, Salient-HMAX, and Sifted-HMAX model, our method achieves classifying accuracy at 92% and time for each image at around 1.5s, while reduces 73% of the time consumed by original HMAX model.

[1]  Yifan Zhou,et al.  Vehicle Classification with Confidence by Classified Vector Quantization , 2013, IEEE Intelligent Transportation Systems Magazine.

[2]  Thomas Serre,et al.  Object recognition with features inspired by visual cortex , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[3]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[4]  Hsi-Jian Lee,et al.  Extraction and recognition of license plates of motorcycles and vehicles on highways , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[5]  Nick Pears,et al.  Automatic make and model recognition from frontal images of cars , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[6]  Thomas Serre,et al.  A feedforward architecture accounts for rapid categorization , 2007, Proceedings of the National Academy of Sciences.

[7]  Qian Huang,et al.  Automatic license extraction from moving vehicles , 1997, Proceedings of International Conference on Image Processing.

[8]  Ioannis Anagnostopoulos,et al.  A License Plate-Recognition Algorithm for Intelligent Transportation System Applications , 2006, IEEE Transactions on Intelligent Transportation Systems.

[9]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[10]  Xiao Hui-min Computational Model of Visual Selective Attention , 2009 .

[11]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[12]  Maurice Milgram,et al.  Multi-class Vehicle Type Recognition System , 2008, ANNPR.

[13]  David G. Lowe,et al.  University of British Columbia. , 1945, Canadian Medical Association journal.