A texton-based kernel density estimation approach for background modeling under extreme conditions

Abstract Background modeling is a well-know approach to detect moving objects in video sequences. In recent years, background modeling methods that adopt spatial and texture information have been developed for dealing with complex scenarios. However, none of the investigated approaches have been tested under extreme conditions, such as the underwater domain, on which effects compromising the video quality affect negatively the performance of the background modeling process. In order to overcome such difficulties, more significant features and more robust methods must be found. In this paper, we present a kernel density estimation method which models background and foreground by exploiting textons to describe textures within small and low contrasted regions. Comparison with other texture descriptors, namely, local binary pattern (LBP) and scale invariant local ternary pattern (SILTP) shown improved performance. Besides, quantitative and qualitative performance evaluation carried out on three standard datasets showing very complex conditions revealed that our method outperformed state-of-the-art methods that use different features and modeling techniques and, most importantly, it is able to generalize over different scenarios and targets.

[1]  Fatih Murat Porikli,et al.  Achieving real-time object detection and tracking under extreme conditions , 2006, Journal of Real-Time Image Processing.

[2]  Silvano Di Zenzo,et al.  A note on the gradient of a multi-image , 1986, Comput. Vis. Graph. Image Process..

[3]  Raimondo Schettini,et al.  Underwater Image Processing: State of the Art of Restoration and Image Enhancement Methods , 2010, EURASIP J. Adv. Signal Process..

[4]  Jean-Marc Odobez,et al.  Multi-Layer Background Subtraction Based on Color and Texture , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Marc Van Droogenbroeck,et al.  ViBe: A Universal Background Subtraction Algorithm for Video Sequences , 2011, IEEE Transactions on Image Processing.

[6]  Ferdinand van der Heijden,et al.  Efficient adaptive density estimation per image pixel for the task of background subtraction , 2006, Pattern Recognit. Lett..

[7]  Marko Heikkilä,et al.  A texture-based method for modeling the background and detecting moving objects , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Robert B. Fisher,et al.  Understanding fish behavior during typhoon events in real-life underwater environments , 2012, Multimedia Tools and Applications.

[9]  Nikos Paragios,et al.  Motion-based background subtraction using adaptive kernel density estimation , 2004, CVPR 2004.

[10]  Jingyu Yang,et al.  Image retrieval based on the texton co-occurrence matrix , 2008, Pattern Recognit..

[11]  Marko Heikkilä,et al.  Description of interest regions with local binary patterns , 2009, Pattern Recognit..

[12]  Qi Tian,et al.  Foreground object detection from videos containing complex background , 2003, MULTIMEDIA '03.

[13]  M. Rosenblatt Remarks on Some Nonparametric Estimates of a Density Function , 1956 .

[14]  Allen R. Hanson,et al.  Background modeling using adaptive pixelwise kernel variances in a hybrid feature space , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Simone Palazzo,et al.  Quantitative performance analysis of object detection algorithms on underwater video footage , 2012, MAED '12.

[16]  Dar-Shyang Lee,et al.  Effective Gaussian mixture learning for video background subtraction , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Matti Pietikäinen,et al.  Modeling pixel process with scale invariant local patterns for background subtraction in complex scenes , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[18]  Simone Palazzo,et al.  An innovative web-based collaborative platform for video annotation , 2014, Multimedia Tools and Applications.

[19]  Kazuhiko Sumi,et al.  Background subtraction based on cooccurrence of image variations , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[20]  Baochang Zhang,et al.  Kernel Similarity Modeling of Texture Pattern Flow for Motion Detection in Complex Background , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Nikos Paragios,et al.  Background modeling and subtraction of dynamic scenes , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[22]  Alex Pentland,et al.  A Bayesian Computer Vision System for Modeling Human Interactions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Thierry Chateau,et al.  A Benchmark Dataset for Foreground/Background Extraction , 2012, ACCV 2012.

[24]  Alex Pentland,et al.  Pfinder: real-time tracking of the human body , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[25]  Du-Ming Tsai,et al.  Independent Component Analysis-Based Background Subtraction for Indoor Surveillance , 2009, IEEE Transactions on Image Processing.

[26]  Montse Pardàs,et al.  Bayesian foreground segmentation and tracking using pixel-wise background model and region based foreground model , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[27]  Bohyung Han,et al.  Density-Based Multifeature Background Subtraction with Support Vector Machine , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[29]  Laure Tougne,et al.  A testing framework for background subtraction algorithms comparison in intrusion detection context , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[30]  B. Julesz Textons, the elements of texture perception, and their interactions , 1981, Nature.

[31]  Yaser Sheikh,et al.  Bayesian modeling of dynamic scenes for object detection , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[33]  Matt P. Wand,et al.  On the Accuracy of Binned Kernel Density Estimators , 1994 .