A combined motion and appearance model for human tracking in multiple cameras environment

The aim of this paper is to present an algorithm for multiple object tracking and video summarization in a scene filmed by one or several cameras. We propose a computationally efficient real time human tracking algorithm, which can 1) track objects inside the field of view (FOV) of a camera even in case of occlusions; 2) recognize objects that quit and then return on a camera's FOV; 3) recognize objects passing through different cameras FOV. We propose a simple 1-D appearance model, called vertical feature (VF), view and size invariant, which is stored in a database in order to help object recognition. We combine it with other motion features like position and velocity for real-time tracking. We find the k closest matches of current object and select the one whose predicted position is closest to the current object position. Our algorithm shows good capabilities for objects tracking even with the change of object view angle and also with the partial change of shape. We compare our algorithm with appearance based and motion based algorithms and show the advantage of a combined approach.

[1]  Pierre Vandergheynst,et al.  Cascade of descriptors to detect and track objects across any network of cameras , 2010, Comput. Vis. Image Underst..

[2]  Carlos Vázquez,et al.  Occlusion and split detection and correction for object tracking in surveillance applications , 2007, Electronic Imaging.

[3]  Luc Van Gool,et al.  An adaptive color-based particle filter , 2003, Image Vis. Comput..

[4]  Mubarak Shah,et al.  Tracking Multiple Occluding People by Localizing on Multiple Scene Planes , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Nicolas Thome,et al.  Human Body Part Labeling and Tracking Using Graph Matching Theory , 2006, 2006 IEEE International Conference on Video and Signal Based Surveillance.

[6]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[7]  Serge Miguet,et al.  Real Time Foreground-Background Segmentation Using a Modified Codebook Model , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[8]  Greg Welch,et al.  Welch & Bishop , An Introduction to the Kalman Filter 2 1 The Discrete Kalman Filter In 1960 , 1994 .

[9]  Larry S. Davis,et al.  Efficient Kernel Density Estimation Using the Fast Gauss Transform with Applications to Color Modeling and Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Dorin Comaniciu,et al.  Real-time tracking of non-rigid objects using mean shift , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[11]  Heungkyu Lee,et al.  Detection of occluded multiple objects using occlusion activity detection and object association , 2004, Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, 2004. ISPACS 2004..

[12]  David Gerónimo Gómez,et al.  Survey of Pedestrian Detection for Advanced Driver Assistance Systems , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Stanley T. Birchfield,et al.  Spatiograms versus histograms for region-based tracking , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[14]  Dariu Gavrila,et al.  Monocular Pedestrian Detection: Survey and Experiments , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Jake K. Aggarwal,et al.  Temporal spatio-velocity transform and its application to tracking and interaction , 2004, Comput. Vis. Image Underst..

[16]  Pietro Perona,et al.  Pedestrian detection: A benchmark , 2009, CVPR.

[17]  Fabien Moutarde,et al.  Person re-identification in multi-camera system by signature based on interest point descriptors collected on short video sequences , 2008, 2008 Second ACM/IEEE International Conference on Distributed Smart Cameras.

[18]  Branko Ristic,et al.  A particle filter for joint detection and tracking of color objects , 2007, Image Vis. Comput..

[19]  L. Davis,et al.  M2Tracker: A Multi-View Approach to Segmenting and Tracking People in a Cluttered Scene , 2003, International Journal of Computer Vision.