Foreground extraction for freely moving RGBD cameras

In this study, the authors propose a novel method to perform foreground extraction for freely moving RGBD cameras. Although the field of foreground extraction or background subtraction has been explored by the computer vision researcher community since a long time, the depth-based subtraction is relatively new and has not been extensively addressed as of yet. Most of the current methods make heavy use of geometric reconstruction, making the solutions quite restrictive. In this study, the authors make a novel use of RGB and depth data: from the RGB frame, they first extract corner features and then represent these features with the histogram of oriented gradients (HoG) descriptor. They then train a non-linear SVM on these HoG descriptors. During the test phase, they make use of the fact that the foreground object has a distinct depth ordering with respect to the rest of the scene. Hence, they use the positively classified features from accelerated segment test (FAST) features on the test frame to initiate a region growing algorithm to obtain an accurate foreground segmentation from the depth data alone. The authors demonstrate the proposed method on six datasets, and demonstrate encouraging quantitative and qualitative results.

[1]  Wei Chen,et al.  A refinement framework for background subtraction based on color and depth data , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[2]  James W. Davis,et al.  Exploiting Temporal Geometry for Moving Camera Background Subtraction , 2014, 2014 22nd International Conference on Pattern Recognition.

[3]  Thierry Fraichard,et al.  Motion estimation from range images in dynamic outdoor scenes , 2010, 2010 IEEE International Conference on Robotics and Automation.

[4]  Bohyung Han,et al.  Generalized background subtraction based on hybrid inference by belief propagation and Bayesian filtering , 2011, 2011 International Conference on Computer Vision.

[5]  Bohyung Han,et al.  Modeling and segmentation of floating foreground and background in videos , 2012, Pattern Recognit..

[6]  Takeo Kanade,et al.  Background Subtraction for Freely Moving Cameras , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[7]  Rakesh Kumar,et al.  Fast human segmentation using color and depth , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  Sergio Escalera,et al.  Multi-modal RGB–Depth–Thermal Human Body Segmentation , 2016, International Journal of Computer Vision.

[9]  Reinhard Koch,et al.  Improved Video Segmentation by Adaptive Combination of Depth Keying and Mixture-of-Gaussians , 2011, SCIA.

[10]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[11]  Junzhou Huang,et al.  Background Subtraction Using Low Rank and Group Sparsity Constraints , 2012, ECCV.

[12]  Truong Q. Nguyen,et al.  Moving Object Detection With a Freely Moving Camera via Background Motion Subtraction , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Tom Drummond,et al.  Machine Learning for High-Speed Corner Detection , 2006, ECCV.

[14]  Allen R. Tannenbaum,et al.  Localizing Region-Based Active Contours , 2008, IEEE Transactions on Image Processing.

[15]  Dan Xu,et al.  Background Subtraction Using Local SVD Binary Pattern , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[16]  Esteban Walter Gonzalez Clua,et al.  A Comparison between Background Subtraction Algorithms using a Consumer Depth Camera , 2012, VISAPP.

[17]  Mubarak Shah,et al.  Tracking and Object Classification for Automated Surveillance , 2002, ECCV.

[18]  François Brémond,et al.  Background subtraction in people detection framework for RGB-D cameras , 2014, 2014 11th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[19]  Didier Stricker,et al.  Comparison of Kinect V1 and V2 Depth Images in Terms of Accuracy and Precision , 2016, ACCV Workshops.

[20]  Krishnan Kutty,et al.  Background Modelling from a Moving Camera , 2015 .

[21]  Hujun Bao,et al.  Robust Bilayer Segmentation and Motion/Depth Estimation with a Handheld Camera , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[24]  Radu Bogdan Rusu,et al.  3D is here: Point Cloud Library (PCL) , 2011, 2011 IEEE International Conference on Robotics and Automation.

[25]  Ahmed M. Elgammal,et al.  Online Moving Camera Background Subtraction , 2012, ECCV.

[26]  Ammar Bouallègue,et al.  Efficient and fast multi-modal foreground-background segmentation using RGBD data , 2017, Pattern Recognit. Lett..

[27]  Deborah Estrin,et al.  Background Subtraction on Distributions , 2008, ECCV.

[28]  Luis Salgado,et al.  Background foreground segmentation with RGB-D Kinect data: An efficient combination of classifiers , 2014, J. Vis. Commun. Image Represent..

[29]  Lin Yang,et al.  Evaluating and Improving the Depth Accuracy of Kinect for Windows v2 , 2015, IEEE Sensors Journal.

[30]  Eduardo Ros,et al.  Background subtraction model based on color and depth cues , 2013, Machine Vision and Applications.

[31]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Antonis A. Argyros,et al.  Foreground Detection with a Moving RGBD Camera , 2013, ISVC.

[33]  Eduardo Ros,et al.  Background Subtraction Based on Color and Depth Using Active Sensors , 2013, Sensors.

[34]  Tom Drummond,et al.  Fusing points and lines for high performance tracking , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[35]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[36]  Sander Oude Elberink,et al.  Accuracy and Resolution of Kinect Depth Data for Indoor Mapping Applications , 2012, Sensors.