Intrackability: Characterizing Video Statistics and Pursuing Video Representations

Videos of natural environments contain a wide variety of motion patterns of varying complexities which are represented by many different models in the vision literature. In many situations, a tracking algorithm is formulated as maximizing a posterior probability. In this paper, we propose to measure the video complexity by the entropy of the posterior probability, called the intrackability, to characterize the video statistics and pursue optimal video representations. Based on the definition of intrackability, our study is aimed at three objectives. Firstly, we characterize video clips of natural scenes by intrackability. We calculate the intrackabilities of image points to measure the local inferential uncertainty, and collect the histogram of the intrackabilities over the video in space and time as the global video statistics. We find that a PCA scatter-plot based on the first two principle components of intrackability histograms can reflect the major variations, i.e., image scaling and object density, in natural video clips. Secondly, we show that different video representations, including deformable contours, tracking kernels with various appearance features, dense motion fields, and dynamic texture models, are connected by the change of intrackability and thus develop a simple criterion for model transition and for pursuing the optimal video representation. Thirdly, we derive the connections between the intrackability measure and other criteria in the literature such as the Shi-Tomasi texturedness measure, conditional number, and Harris-Stephens R score, and compare with the Shi-Tomasi measure in tracking experiments.

[1]  Kevin Nickels,et al.  Estimating uncertainty in SSD-based feature tracking , 2002, Image Vis. Comput..

[2]  Tony Lindeberg Detecting salient blob-like image structures and their scales , 1994 .

[3]  J. Koenderink The structure of images , 2004, Biological Cybernetics.

[4]  Martin Szummer,et al.  Temporal texture modeling , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[5]  Gang Hua,et al.  Efficient Optimal Kernel Placement for Reliable Visual Tracking , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[6]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[7]  Michael Brady,et al.  Saliency, Scale and Image Description , 2001, International Journal of Computer Vision.

[8]  Sinisa Segvic,et al.  Enhancing the Point Feature Tracker by Adaptive Modelling of the Feature Support , 2006, ECCV.

[9]  Stochastic Rigidity: Image Registration for Nowhere-Static Scenes , 2001, ICCV.

[10]  Song-Chun Zhu,et al.  From Information Scaling of Natural Images to Regimes of Statistical Models , 2007 .

[11]  David J. Fleet,et al.  Probabilistic Detection and Tracking of Motion Boundaries , 2000, International Journal of Computer Vision.

[12]  Jake K. Aggarwal,et al.  Temporal spatio-velocity transform and its application to tracking and interaction , 2004, Comput. Vis. Image Underst..

[13]  Song-Chun Zhu,et al.  Perceptual Scale-Space and Its Applications , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[14]  Dorin Comaniciu,et al.  Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  D Marr,et al.  Bandpass channels, zero-crossings, and early visual information processing. , 1979, Journal of the Optical Society of America.

[16]  Z. Pylyshyn,et al.  Dynamics of target selection in multiple object tracking (MOT). , 2006, Spatial vision.

[17]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Ying Zhu,et al.  On optimizing template matching via performance characterization , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[19]  Cor J. Veenman,et al.  Resolving Motion Correspondence for Densely Moving Points , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Yanxi Liu,et al.  Online Selection of Discriminative Tracking Features , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Z. Pylyshyn Some puzzling findings in multiple object tracking: I. Tracking without keeping track of object identities , 2004 .

[22]  Kyoung Mu Lee,et al.  Visual tracking via geometric particle filtering on the affine group with optimal importance functions , 2009, CVPR.

[23]  Yandong Tang,et al.  Flow mosaicking: Real-time pedestrian counting without scene-specific learning , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  F. Porikli,et al.  Recurrent Tracking using Multifold Consistency , 2009 .

[25]  A. Fitzgibbon Stochastic rigidity: image registration for nowhere-static scenes , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[26]  Emanuele Trucco,et al.  Making good features track better , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[27]  L. Gool,et al.  Probabilistic object tracking using multiple features , 2004, ICPR 2004.

[28]  Hans-Hellmut Nagel,et al.  Volumetric model and 3D trajectory of a moving car derived from monocular TV frame sequences of a street scene , 1981, Comput. Graph. Image Process..

[29]  Nong Sang,et al.  Intrackability theory and application , 2007, International Symposium on Multispectral Image Processing and Pattern Recognition.

[30]  Andrew P. Witkin,et al.  Scale-Space Filtering , 1983, IJCAI.

[31]  Ming-Hsuan Yang,et al.  Incremental Learning for Robust Visual Tracking , 2008, International Journal of Computer Vision.

[32]  Andrew Blake,et al.  A Probabilistic Exclusion Principle for Tracking Multiple Objects , 2000, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[33]  Mubarak Shah,et al.  A Lagrangian Particle Dynamics Approach for Crowd Flow Segmentation and Stability Analysis , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[34]  Dorin Comaniciu,et al.  An information fusion framework for robust shape tracking , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[36]  Yandong Tang,et al.  Flow mosaicking: Real-time pedestrian counting without scene-specific learning , 2009, CVPR.

[37]  Song-Chun Zhu,et al.  Modeling textured motion : particle, wave and sketch , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[38]  Z. Pylyshyn Some puzzling findings in multiple object tracking (MOT): II. Inhibition of moving nontargets , 2006 .

[39]  Zheng Li,et al.  Dynamic Feature Cascade for Multiple Object Tracking with Trackability Analysis , 2007, EMMCVPR.

[40]  Eero P. Simoncelli,et al.  On Advances in Statistical Modeling of Natural Images , 2004, Journal of Mathematical Imaging and Vision.

[41]  Mubarak Shah,et al.  Floor Fields for Tracking in High Density Crowd Scenes , 2008, ECCV.

[42]  Patrick Pérez,et al.  On Uncertainties, Random Features and Object Tracking , 2007, 2007 IEEE International Conference on Image Processing.

[43]  Stefano Soatto,et al.  Dynamic Textures , 2003, International Journal of Computer Vision.

[44]  Robert T. Collins,et al.  Mean-shift blob tracking through scale space , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..