Videopsy: dissecting visual data in space-time

Network camera, made possible by recent advances in the integration of sensing, compression, and communication hardware, is a new video source that can be easily deployed and remotely managed. Unobtrusively located along highways, at airports, or in office buildings, such cameras can form a visual sensor network, or camera web, an extremely rich source of visual information. In its infancy today, camera web deployment will likely accelerate in the future and one can expect visual sensing devices to eventually become as ubiquitous as electric bulbs. While the capturing hardware has evolved tremendously, hardware and algorithms necessary for effective analysis and efficient communication of multi-camera data clearly lag. In this article, I overview one particular aspect of visual data analysis, namely, space-time video segmentation that is often a pre-requisite for motion estimation, video compression, event detection, scene understanding, etc. I introduce the concept of object tunnel, a 3-D surface in space-time through which a video object travels, and the associated concept of occlusion volume. I present examples of object tunnels and occlusion volumes on surveillance data that, upon further processing, may lead to automatic event detection or scene understanding. Finally, I describe challenges in extending video analysis algorithms to visual sensor networks, and I outline some possible approaches

[1]  Tony F. Chan,et al.  A Multiphase Level Set Framework for Image Segmentation Using the Mumford and Shah Model , 2002, International Journal of Computer Vision.

[2]  Michel Barlaud,et al.  DREAM2S: Deformable Regions Driven by an Eulerian Accurate Minimization Method for Image and Video Segmentation , 2002, International Journal of Computer Vision.

[3]  Alice Caplier,et al.  Spatiotemporal MRF approach to video segmentation: Application to motion detection and lip segmentation , 1999, Signal Processing.

[4]  Janusz Konrad,et al.  Multiple motion segmentation with level sets , 2003, IEEE Trans. Image Process..

[5]  Janusz Konrad,et al.  Space-time image sequence analysis: object tunnels and occlusion volumes , 2006, IEEE Transactions on Image Processing.

[6]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[7]  Yao Wang,et al.  An unsupervised multi-resolution object extraction algorithm using video-cube , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[8]  Rachid Deriche,et al.  Geodesic Active Contours and Level Sets for the Detection and Tracking of Moving Objects , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Julian Magarey,et al.  Three-dimensional video segmentation using a variational method , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[10]  W. Clem Karl,et al.  A fast level set method without solving PDEs [image segmentation applications] , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[11]  King Ngi Ngan,et al.  Special Issue on segmentation, description, and retrieval of video content , 1998 .

[12]  Guillermo Sapiro,et al.  Geodesic Active Contours , 1995, International Journal of Computer Vision.

[13]  Janusz Konrad,et al.  CHAPTER 3 – Motion Detection and Estimation , 2009 .

[14]  Amar Mitiche,et al.  Spatiotemporal motion boundary detection and motion boundary velocity estimation for tracking moving objects with a moving camera: a level sets PDEs approach with concurrent camera motion compensation , 2004, IEEE Transactions on Image Processing.

[15]  Stanley Osher,et al.  Level Set Methods , 2003 .