Multi-view video segmentation and tracking for video surveillance

Tracking moving objects is a critical step for smart video surveillance systems. Despite the complexity increase, multiple camera systems exhibit the undoubted advantages of covering wide areas and handling the occurrence of occlusions by exploiting the different viewpoints. The technical problems in multiple camera systems are several: installation, calibration, objects matching, switching, data fusion, and occlusion handling. In this paper, we address the issue of tracking moving objects in an environment covered by multiple un-calibrated cameras with overlapping fields of view, typical of most surveillance setups. Our main objective is to create a framework that can be used to integrate objecttracking information from multiple video sources. Basically, the proposed technique consists of the following steps. We first perform a single-view tracking algorithm on each camera view, and then apply a consistent object labeling algorithm on all views. In the next step, we verify objects in each view separately for inconsistencies. Correspondent objects are extracted through a Homography transform from one view to the other and vice versa. Having found the correspondent objects of different views, we partition each object into homogeneous regions. In the last step, we apply the Homography transform to find the region map of first view in the second view and vice versa. For each region (in the main frame and mapped frame) a set of descriptors are extracted to find the best match between two views based on region descriptors similarity. This method is able to deal with multiple objects. Track management issues such as occlusion, appearance and disappearance of objects are resolved using information from all views. This method is capable of tracking rigid and deformable objects and this versatility lets it to be suitable for different application scenarios.

[1]  Jiang Li,et al.  Color based multiple people tracking , 2002, 7th International Conference on Control, Automation, Robotics and Vision, 2002. ICARCV 2002..

[2]  Hung Hai Bui,et al.  Mutliple camera coordination in a surveillance system , 2003 .

[3]  Larry S. Davis,et al.  Unified multi-camera detection and tracking using region-matching , 2001, Proceedings 2001 IEEE Workshop on Multi-Object Tracking.

[4]  Jake K. Aggarwal,et al.  Automatic tracking of human motion in indoor scenes across multiple synchronized video streams , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[5]  Rita Cucchiara,et al.  Posture classification in a multi-camera indoor environment , 2005, IEEE International Conference on Image Processing 2005.

[6]  Tim J. Ellis,et al.  Multi camera image tracking , 2006, Image Vis. Comput..

[7]  Joonki Paik,et al.  Mutiple-view object tracking using metadata , 2007, 2007 International Conference on Wavelet Analysis and Pattern Recognition.

[8]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[9]  Hideo Saito,et al.  Tracking soccer players based on homography among multiple views , 2003, Visual Communications and Image Processing.

[10]  Larry S. Davis,et al.  W4S : A real-time system for detecting and tracking people in 2 D , 1998, eccv 1998.

[11]  Larry S. Davis,et al.  W/sup 4/: A Real Time System for Detecting and Tracking People , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[12]  Rama Chellappa,et al.  Robust two-camera tracking using homography , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Touradj Ebrahimi,et al.  Tracking video objects in cluttered background , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Larry S. Davis,et al.  W/sup 4/: Who? When? Where? What? A real time system for detecting and tracking people , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[15]  Tieniu Tan,et al.  A survey on visual surveillance of object motion and behaviors , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[16]  Ramin Zabih,et al.  Counting people from multiple cameras , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[17]  James Black,et al.  Multi view image surveillance and tracking , 2002, Workshop on Motion and Video Computing, 2002. Proceedings..

[18]  Simone Calderara,et al.  Entry edge of field of view for multi-camera tracking in distributed video surveillance , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[19]  Mubarak Shah,et al.  Consistent Labeling of Tracked Objects in Multiple Cameras with Overlapping Fields of View , 2003, IEEE Trans. Pattern Anal. Mach. Intell..