Computationally efficient background subtraction in the light field domain

In this paperwe present a novel approach for depth estimation and background subtraction in light field images. our approach exploits the regularity and the internal structure of the light field signal in order to extract an initial depth map of the captured scene and uses the extracted depth map as the input to a final segmentation algorithm which finely isolates the background in the image. Background subtraction is natural application of the light field information since it is highly involved with depth information and segmentation. However many of the approaches proposed so far are not optimized specifically for background subtraction and are highly computationally expensive. Here we propose an approach based on a modified version of the well-known Radon Transform and not involving massive matrix calculations. It is therefore computationally very efficient and appropriate for real-time use. Our approach exploits the structured nature of the light field signal and the information inherent in the plenoptic space in order to extract an initial depth map and background model of the captured scene. We apply a modified the Radon transform and the gradient operator to horizontal slices of the light field signal to infer the initial depth map. The initial depth estimated are further refined to a precise background using a series of depth thresholding and segmentation in ambiguous areas. We test on method on various types real and synthetic of light field images. Scenes with different levels of clutter and also various foreground object depth have been considered in the experiments. The results of our experiments show much better computational complexity while retaining comparable performance to similar more complex methods.

[1]  Pier Luigi Dragotti,et al.  Segmentation of Epipolar-Plane Image Volumes with Occlusion and Disocclusion Competition , 2006, 2006 IEEE Workshop on Multimedia Signal Processing.

[2]  Edward H. Adelson,et al.  Single Lens Stereo with a Plenoptic Camera , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  M. Brookes,et al.  Object recognition using multi-view imaging , 2008, 2008 9th International Conference on Signal Processing.

[4]  Yael Pritch,et al.  Scene reconstruction from high spatio-angular resolution light fields , 2013, ACM Trans. Graph..

[5]  Gordon Wetzstein,et al.  Compressive light field photography using overcomplete dictionaries and optimized projections , 2013, ACM Trans. Graph..

[6]  Zhan Yu,et al.  Lytro camera technology: theory, algorithms, performance analysis , 2013, Electronic Imaging.

[7]  Robert C. Bolles,et al.  Epipolar-plane image analysis: An approach to determining structure from motion , 1987, International Journal of Computer Vision.

[8]  Rafael Grompone von Gioi,et al.  LSD: A Fast Line Segment Detector with a False Detection Control , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Minh N. Do,et al.  Ieee Transactions on Image Processing on the Bandwidth of the Plenoptic Function , 2022 .

[10]  J. Berent,et al.  Plenoptic Manifolds , 2007, IEEE Signal Processing Magazine.

[11]  Thierry Bouwmans,et al.  Background Modeling using Mixture of Gaussians for Foreground Detection - A Survey , 2008 .

[12]  Ivana Tosic,et al.  Dictionary Learning for Incoherent Sampling with application to plenoptic imaging , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[13]  Sven Wanner,et al.  Variational Light Field Analysis for Disparity Estimation and Super-Resolution , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Sven Wanner,et al.  Globally Consistent Multi-label Assignment on the Ray Space of 4D Light Fields , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Bertrand Vachon,et al.  Statistical Background Modeling for Foreground Detection: A Survey , 2010 .

[16]  Ramesh Raskar,et al.  Near-invariant blur for depth and 2D motion via time-varying light field analysis , 2013, TOGS.

[17]  Shree K. Nayar,et al.  PiCam , 2013, ACM Trans. Graph..