Tracking failure detection by imitating human visual perception

In this paper, we present a tracking failure detection method by imitating human visual system. By adopting log-polar transformation, we could simulate properties of retina image, such as rotation and scaling invariance and foveal predominance. The rotation and scaling invariance helps to reduce false alarms caused by pose changes and intensify translational changes. Foveal predominant property helps to detect the tracking failing moment by amplifying the resolution around focus (tracking box center) and blurring the peripheries. Each ganglion cell corresponds to a pixel of log-polar image, and its adaptation is modeled as Gaussian mixture model. Its validity is shown through various experiments.

[1]  H. Holländer,et al.  Size and distribution of ganglion cells in the human retina , 2004, Anatomy and Embryology.

[2]  Dorin Comaniciu,et al.  Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Patrick Bouthemy,et al.  A Region Tracking Method with Failure Detection for an Interactive Video Indexing Environment , 1999, VISUAL.

[4]  Qian Chen,et al.  K-means Tracker: A General Algorithm for Tracking People , 2006, J. Multim..

[5]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Bruce A. Draper,et al.  Visual object tracking using adaptive correlation filters , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  Alexandre Bernardino,et al.  A review of log-polar imaging for visual perception in robotics , 2010, Robotics and Autonomous Systems.

[8]  Ming-Hsuan Yang,et al.  Incremental Learning for Visual Tracking , 2004, NIPS.

[9]  Alin Achim,et al.  18th IEEE International Conference on Image Processing, ICIP 2011, Brussels, Belgium, September 11-14, 2011 , 2011, ICIP.

[10]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[11]  Mubarak Shah,et al.  Probabilistic Modeling of Scene Dynamics for Applications in Visual Surveillance , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.