A traffic surveillance system using dynamic saliency map and SVM boosting

This paper proposes a traffic surveillance system that can efficiently detect an interesting object and identify vehicles and pedestrians in real traffic situations. The proposed system consists of a moving object detection model and an object identification model. A dynamic saliency map is used for analyzing dynamics of the successive static saliency maps, and can localize an attention area in dynamic scenes to focus on a specific moving object for traffic surveillance purposes. The candidate local areas of a moving object are followed by a blob detection processing including binarization, morphological closing and labeling methods. For identifying a moving object class, the proposed system uses a hybrid of global and local information in each local area. Although the global feature analysis is a compact way to identify an object and provide a good accuracy for non-occluded objects, it is sensitive to image translation and occlusion. Therefore, a local feature analysis is also considered and combined with the global feature analysis. In order to construct an efficient classifier using the global and local features, this study proposes a novel classifier based on boosting of support vector machines. The proposed object identification model can identify a class of moving object and discard unexpected candidate area which does not include an interesting object. As a result, the proposed road surveillance system is able to detect a moving object and identify the class of the moving object. Experimental results show that the proposed traffic surveillance system can successfully detect specific moving objects.

[1]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[2]  J. Rodgers,et al.  Thirteen ways to look at the correlation coefficient , 1988 .

[3]  Ioannis Pitas,et al.  Digital Image Processing Algorithms , 1993 .

[4]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[5]  Kunihiko Fukushima,et al.  Use of non-uniform spatial blur for image comparison: symmetry axis extraction , 2005, Neural Networks.

[6]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[7]  Minho Lee,et al.  Dynamic visual selective attention model , 2008, Neurocomputing.

[8]  J. Gray,et al.  Septal driving of hippocampal theta rhythm as a function of frequency in the male rat: Effects of drugs , 1977, Neuroscience.

[9]  R. Schapire The Strength of Weak Learnability , 1990, Machine Learning.

[10]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[11]  Minho Lee,et al.  Stereo saliency map considering affective factors and selective motion analysis in a dynamic environment , 2008, Neural Networks.

[12]  Yiğithan Dedeoğlu,et al.  Moving object detection, tracking and classification for smart video surveillance , 2004 .

[13]  Yoav Freund,et al.  A Short Introduction to Boosting , 1999 .

[14]  Yoav Freund,et al.  Boosting a weak learning algorithm by majority , 1995, COLT '90.

[15]  Jong Bae Kim,et al.  Efficient region-based motion segmentation for a video monitoring system , 2003, Pattern Recognit. Lett..

[16]  Matthew B. Blaschko,et al.  Combining Local and Global Image Features for Object Class Recognition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[17]  Minho Lee,et al.  Biologically motivated vergence control system using human-like selective attention model , 2006, Neurocomputing.

[18]  Anup Basu,et al.  Motion Tracking with an Active Camera , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[20]  Chandrika Kamath,et al.  Robust techniques for background subtraction in urban traffic video , 2004, IS&T/SPIE Electronic Imaging.

[21]  Pietro Perona,et al.  Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[22]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[23]  Michael Brady,et al.  Saliency, Scale and Image Description , 2001, International Journal of Computer Vision.

[24]  Jake K. Aggarwal,et al.  Tracking Human Motion in Structured Environments Using a Distributed-Camera System , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[26]  Minho Lee,et al.  Saliency map model with adaptive masking based on independent component analysis , 2002, Neurocomputing.

[27]  Minho Lee,et al.  Obstacle Categorization Based on Hybridizing Global and Local Features , 2009, ICONIP.

[28]  Shiming Xiang,et al.  Real-time Object Classification in Video Surveillance Based on Appearance Learning , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Abel G. Oliva,et al.  Gist of a scene , 2005 .