Fusion by optimal dynamic mixtures of proposal distributions

We propose a fusion framework to integrate multiple cues for tracking by finding a set of optimal dynamic weights for different tracking modalities. In the setup of Bayesian sequential estimation, we give an optimal criterion to find the dynamic weight for each modality: Using a linear combination of the proposal distributions from multiple cues to approach the posterior distribution p(xt|yt). The fusion problem is then formulated as an optimization problem with a non-convex objective function. We further convert the optimization problem to a constrained convex programming problem. The equations for finding the global optimal solution are given and an approximate analytical solution is derived. The derived approximate analytical solution is justified by comparing to the fusion weights/mixture weights in. The fusion framework can find out reliable cues and rely more on them dynamically. We test the proposed fusion framework for human tracking on a very challenging surveillance video taken at crowded subway station. We also test the fusion framework for articulated tracking. The claim that the proposed fusion framework can integrate weak modalities to improve tracking performance is supported by the promising results.

[1]  David J. Fleet,et al.  Robust Online Appearance Models for Visual Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Patrick Pérez,et al.  Data fusion for visual tracking with particles , 2004, Proceedings of the IEEE.

[3]  Qiang Ji,et al.  Online Spatial-temporal Data Fusion for Robust Adaptive Tracking , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Stanley T. Birchfield,et al.  Elliptical head tracking using intensity gradients and color histograms , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[5]  Gang Hua,et al.  Tracking appearances with occlusions , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[6]  Saharon Shelah,et al.  Black Boxes , 2008, 0812.0656.

[7]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[8]  Shai Avidan,et al.  Ensemble Tracking , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Larry S. Davis,et al.  Efficient mean-shift tracking via a new similarity measure , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[10]  Tony X. Han,et al.  Online appearance learning by template prediction , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[11]  Ming Yang,et al.  Multiple Collaborative Kernel Tracking , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Ying Wu,et al.  Robust Visual Tracking by Integrating Multiple Cues Based on Co-Inference Learning , 2004, International Journal of Computer Vision.

[13]  Ehud Rivlin,et al.  A General Framework for Combining Visual Trackers – The "Black Boxes" Approach , 2006, International Journal of Computer Vision.

[14]  Patrick Pérez,et al.  Color-Based Probabilistic Tracking , 2002, ECCV.

[15]  Michael J. Black,et al.  EigenTracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation , 1996, International Journal of Computer Vision.

[16]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[17]  Patrick Pérez,et al.  Towards Improved Observation Models for Visual Tracking: Selective Adaptation , 2002, ECCV.

[18]  Songhwai Oh,et al.  Markov chain Monte Carlo data association for general multiple-target tracking problems , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[19]  F. Dellaert,et al.  A Rao-Blackwellized particle filter for EigenTracking , 2004, CVPR 2004.

[20]  Yong Rui,et al.  Real-time speaker tracking using particle filter sensor fusion , 2004, Proceedings of the IEEE.

[21]  Bo Wu,et al.  Boosted Markov Chain Monte Carlo Data Association for Multiple Target Detection and Tracking , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[22]  Dorin Comaniciu,et al.  Real-time tracking of non-rigid objects using mean shift , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[23]  Ying Wu,et al.  Differential Tracking based on Spatial-Appearance Model (SAM) , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[24]  Thomas S. Huang,et al.  JPDAF based HMM for real-time contour tracking , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[25]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Ramakant Nevatia,et al.  Tracking multiple humans in complex situations , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  D. Kriegman,et al.  Visual tracking using learned linear subspaces , 2004, CVPR 2004.

[29]  Patrick Pérez,et al.  Maintaining multimodality through mixture tracking , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[30]  D. Luenberger Optimization by Vector Space Methods , 1968 .

[31]  Shai Avidan,et al.  Support Vector Tracking , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[32]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[33]  Gregory Hager,et al.  Multiple kernel tracking with SSD , 2004, CVPR 2004.

[34]  R. Collins,et al.  On-line selection of discriminative tracking features , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[35]  Yanxi Liu,et al.  Online Selection of Discriminative Tracking Features , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[37]  Robert T. Collins,et al.  Mean-shift blob tracking through scale space , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[38]  James J. Little,et al.  A Boosted Particle Filter: Multitarget Detection and Tracking , 2004, ECCV.

[39]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[40]  Ramakant Nevatia,et al.  Tracking of Multiple, Partially Occluded Humans based on Static Body Part Detection , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[41]  Simon Baker,et al.  Lucas-Kanade 20 Years On: A Unifying Framework , 2004, International Journal of Computer Vision.