Semi-automatic annotation samples for vehicle type classification in urban environments

Data collection, and especially data annotation, are surprisingly time consuming and costly tasks for vehicle classification. Annotation is used to label examples of vehicles, manually outlining their shapes and assigning their correct classification, for use in classifier training and performance evaluation. This study presents a semi-automatic approach for the annotation of the vehicle samples recorded from roadside CCTV video cameras. Vehicles are detected by using automatic image analysis and classified into four main categories: car, van, bus and motorcycle/bicycle by using a vehicle observation vector constructed from the size, the shape and the appearance features. Unsupervised K-means clustering is used to automatically compute an initial class label for each detected vehicle. Then, in an iterative process, the output scores of a linear support vector machines classifier are used to identify the low confidence samples, for which the annotations are considered for manual correction. Experimental results are presented for both synthetic and real datasets to demonstrate the effectiveness and the efficiency of the authors approach, which significantly reduces the time required to generate an annotated dataset. The method is general enough that it can be used in other classification problems and domains that use a manually-created ground-truth.

[1]  Ashish Kapoor,et al.  Active learning for large multi-class problems , 2009, CVPR.

[2]  Saso Dzeroski,et al.  An extensive experimental comparison of methods for multi-label learning , 2012, Pattern Recognit..

[3]  Sergio A. Velastin,et al.  A Review of Computer Vision Techniques for the Analysis of Urban Traffic , 2011, IEEE Transactions on Intelligent Transportation Systems.

[4]  Zezhi Chen,et al.  Background Subtraction in Video Using Recursive Mixture Models, Spatio-Temporal Filtering and Shadow Removal , 2009, ISVC.

[5]  Mohan M. Trivedi,et al.  A General Active-Learning Framework for On-Road Vehicle Recognition and Tracking , 2010, IEEE Transactions on Intelligent Transportation Systems.

[6]  Nikolaos Papanikolopoulos,et al.  Scalable Active Learning for Multiclass Image Classification , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Zezhi Chen,et al.  Vehicle type categorization: A comparison of classification schemes , 2011, 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[8]  Horst Bischof,et al.  Active sampling via tracking , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[9]  Nikos A. Vlassis,et al.  The global k-means clustering algorithm , 2003, Pattern Recognit..

[10]  Ishwar K. Sethi,et al.  Confidence-based active learning , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Yen-Wei Chen,et al.  Image recognition by learned linear subspace of combined bag-of-features and low-level features , 2010, 2010 IEEE International Conference on Image Processing.

[12]  Andrew Zisserman,et al.  Representing shape with a spatial pyramid kernel , 2007, CIVR '07.

[13]  Zezhi Chen,et al.  Confidence based active learning for vehicle classification in urban traffic , 2012 .

[14]  C. Laurgeau,et al.  Vehicle detection combining gradient analysis and AdaBoost classification , 2005, Proceedings. 2005 IEEE Intelligent Transportation Systems, 2005..

[15]  Nikolaos Papanikolopoulos,et al.  Coverage optimized active learning for k - NN classifiers , 2012, 2012 IEEE International Conference on Robotics and Automation.

[16]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[17]  Zezhi Chen,et al.  Self-adaptive Gaussian mixture model for urban traffic monitoring system , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[18]  Zezhi Chen,et al.  Vehicle detection, tracking and classification in urban traffic , 2012, 2012 15th International IEEE Conference on Intelligent Transportation Systems.

[19]  Pat Morin,et al.  Output-Sensitive Algorithms for Computing Nearest-Neighbour Decision Boundaries , 2005, Discret. Comput. Geom..

[20]  Lianwen Jin,et al.  A novel feature extraction method using Pyramid Histogram of Orientation Gradients for smile recognition , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[21]  David D. Lewis,et al.  Heterogeneous Uncertainty Sampling for Supervised Learning , 1994, ICML.