Single/cross-camera multiple-person tracking by graph matching

Single and cross-camera multiple person tracking in unconstrained condition is an extremely challenging task in computer vision. Facing the main difficulties caused by the existence of occlusion in single-camera scenario and the occurrence of transition in cross-camera scenario, we propose a unified framework formulated in graph matching with affinity constraints for both single and cross-camera tracking tasks. To our knowledge, our work is the first to unify two kinds of tracking problems with the same framework by graph matching. The proposed method consists of two steps, tracklet generation and tracklet association. First, we implement the modified part-based human detector and the Tracking-Modeling-Detection (TMD) method for tracklet generation. Then we propose to associate tracklets by graph matching which is mathematically formulated into the Rayleigh Quotients Maximization. The comparison experiments show that the proposed method can produce the competing results with the state-of-the-art methods.

[1]  Yue Gao,et al.  Symbiotic Tracker Ensemble Toward A Unified Tracking Framework , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Luc Van Gool,et al.  Online Multiperson Tracking-by-Detection from a Single, Uncalibrated Camera , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Bodo Rosenhahn,et al.  Everybody needs somebody: Modeling social and grouping behavior on a linear programming multiple people tracker , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[5]  Serdar Korukoglu,et al.  Moving object detection and tracking by using annealed background subtraction method in videos: Performance optimization , 2012, Expert Syst. Appl..

[6]  Wen Gao,et al.  Learning to Distribute Vocabulary Indexing for Scalable Visual Search , 2013, IEEE Transactions on Multimedia.

[7]  Carlo S. Regazzoni,et al.  Performance Evaluation of Multi-camera Visual Tracking , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[8]  Steven Gold,et al.  A Graduated Assignment Algorithm for Graph Matching , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Zhong Zhang,et al.  Video Surveillance Using a Multi-Camera Tracking and Fusion System , 2009, Multi-Camera Networks.

[10]  ZhangJing,et al.  Framework for Performance Evaluation of Face, Text, and Vehicle Detection and Tracking in Video , 2009 .

[11]  Zdenek Kalal,et al.  Tracking-Learning-Detection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Luis E. Ortiz,et al.  Who are you with and where are you going? , 2011, CVPR 2011.

[13]  Yue Gao,et al.  Camera Constraint-Free View-Based 3-D Object Retrieval , 2012, IEEE Transactions on Image Processing.

[14]  Xindong Wu,et al.  3-D Object Retrieval With Hausdorff Distance Learning , 2014, IEEE Transactions on Industrial Electronics.

[15]  Ramakant Nevatia,et al.  Multi-target tracking by online learning of non-linear motion patterns and robust appearance models , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[17]  Shihong Lao,et al.  Multi-object tracking through occlusions by local tracklets filtering and global tracklets association with detection responses , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Margrit Betke,et al.  Coupling detection and data association for multiple object tracking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Carl D. Meyer,et al.  Matrix Analysis and Applied Linear Algebra , 2000 .

[20]  James J. Little,et al.  A Linear Programming Approach for Multiple Object Tracking , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Rongrong Ji,et al.  Robust tracking via patch-based appearance model and local background estimation , 2014, Neurocomputing.

[22]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[23]  Jing Zhang,et al.  Framework for Performance Evaluation of Face, Text, and Vehicle Detection and Tracking in Video: Data, Metrics, and Protocol , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Pascal Fua,et al.  Robust People Tracking with Global Trajectory Optimization , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[26]  Xuelong Li,et al.  Spectral-Spatial Constraint Hyperspectral Image Classification , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[27]  Luc Van Gool,et al.  Improving Data Association by Joint Modeling of Pedestrian Trajectories and Groupings , 2010, ECCV.

[28]  Stephan Schraml,et al.  Spatiotemporal multiple persons tracking using Dynamic Vision Sensor , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[29]  Yongdong Zhang,et al.  Efficient Parallel Framework for H.264/AVC Deblocking Filter on Many-Core Platform , 2012, IEEE Transactions on Multimedia.

[30]  Wen Gao,et al.  Location Discriminative Vocabulary Coding for Mobile Landmark Search , 2011, International Journal of Computer Vision.

[31]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[32]  Wen Gao,et al.  Mining Compact Bag-of-Patterns for Low Bit Rate Mobile Visual Search , 2014, IEEE Transactions on Image Processing.

[33]  Sajib Saha,et al.  Multiple Collaborative Cameras for Multi-Target Tracking Using Color-Based Particle Filter and Contour Information , 2011, DICTAP.

[34]  Xuelong Li,et al.  Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search , 2013, IEEE Transactions on Image Processing.

[35]  John W. McDonough,et al.  Tracking Multiple Speakers with Probabilistic Data Association Filters , 2006, CLEAR.

[36]  Fernando De la Torre,et al.  Factorized Graph Matching , 2016, IEEE Trans. Pattern Anal. Mach. Intell..

[37]  Anan Liu,et al.  Multiple Person Tracking by Spatiotemporal Tracklet Association , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[38]  Ramakant Nevatia,et al.  Detection and Tracking of Multiple, Partially Occluded Humans by Bayesian Combination of Edgelet based Part Detectors , 2007, International Journal of Computer Vision.

[39]  Christoph Schnörr,et al.  Probabilistic Subgraph Matching Based on Convex Relaxation , 2005, EMMCVPR.

[40]  Vittorio Murino,et al.  Decentralized particle filter for joint individual-group tracking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[41]  Shamik Sural,et al.  Object Tracking Using Background Subtraction and Motion Estimation in MPEG Videos , 2006, ACCV.

[42]  Bernt Schiele,et al.  Pictorial structures revisited: People detection and articulated pose estimation , 2009, CVPR.

[43]  Csaba Beleznai,et al.  Multiple object tracking by hierarchical association of spatio-temporal data , 2010, 2010 IEEE International Conference on Image Processing.

[44]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[45]  Qi Tian,et al.  Task-Dependent Visual-Codebook Compression , 2012, IEEE Transactions on Image Processing.

[46]  Afshin Dehghan,et al.  Part-based multiple-person tracking with partial occlusion handling , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[47]  Qi Tian,et al.  Less is More: Efficient 3-D Object Retrieval With Query View Selection , 2011, IEEE Transactions on Multimedia.

[48]  Jenq-Neng Hwang,et al.  Tracking across multiple cameras with overlapping views based on brightness and tangent transfer functions , 2011, 2011 Fifth ACM/IEEE International Conference on Distributed Smart Cameras.

[49]  ZuWhan Kim Real time object tracking based on dynamic feature grouping with background subtraction , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[50]  Martial Hebert,et al.  A spectral technique for correspondence problems using pairwise constraints , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[51]  Ian D. Reid,et al.  Stable multi-target tracking in real-time surveillance video , 2011, CVPR 2011.

[52]  Thomas S. Huang,et al.  Online updating appearance generative mixture model for meanshift tracking , 2007, Machine Vision and Applications.

[53]  Rubén Heras Evangelio,et al.  Boosting Multi-hypothesis Tracking by Means of Instance-Specific Models , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[54]  Rafael Muñoz-Salinas,et al.  A Bayesian plan-view map based approach for multiple-person detection and tracking , 2008, Pattern Recognit..

[55]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[56]  Rongrong Ji,et al.  Visual tracking via weakly supervised learning from multiple imperfect oracles , 2014, Pattern Recognit..

[57]  Ramakant Nevatia,et al.  Global data association for multi-object tracking using network flows , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[58]  Wei Huang,et al.  Detection and tracking of multiple moving objects in video , 2007, VISAPP.

[59]  Yue Gao,et al.  3-D Object Retrieval and Recognition With Hypergraph Analysis , 2012, IEEE Transactions on Image Processing.

[60]  Jianbo Shi,et al.  Balanced Graph Matching , 2006, NIPS.

[61]  Jiri Matas,et al.  Online learning of robust object detectors during unstable tracking , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[62]  Qixiang Ye,et al.  Online feature evaluation for object tracking using Kalman Filter , 2008, 2008 19th International Conference on Pattern Recognition.

[63]  Vincent Lepetit,et al.  Randomized trees for real-time keypoint recognition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).