Visual object tracking using adaptive correlation filters

Although not commonly used, correlation filters can track complex objects through rotations, occlusions and other distractions at over 20 times the rate of current state-of-the-art techniques. The oldest and simplest correlation filters use simple templates and generally fail when applied to tracking. More modern approaches such as ASEF and UMACE perform better, but their training needs are poorly suited to tracking. Visual tracking requires robust filters to be trained from a single frame and dynamically adapted as the appearance of the target object changes. This paper presents a new type of correlation filter, a Minimum Output Sum of Squared Error (MOSSE) filter, which produces stable correlation filters when initialized using a single frame. A tracker based upon MOSSE filters is robust to variations in lighting, scale, pose, and nonrigid deformations while operating at 669 frames per second. Occlusion is detected based upon the peak-to-sidelobe ratio, which enables the tracker to pause and resume where it left off when the object reappears.

[1]  Bruce A. Draper,et al.  Simple real-time human detection using a single correlation filter , 2009, 2009 Twelfth IEEE International Workshop on Performance Evaluation of Tracking and Surveillance.

[2]  P. Khosla,et al.  Face Verification using Correlation Filters , 2002 .

[3]  Ehud Rivlin,et al.  Robust Fragments-based Tracking using the Integral Histogram , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[4]  Nikunj C. Oza,et al.  Online Ensemble Learning , 2000, AAAI/IAAI.

[5]  B. V. Vijaya Kumar,et al.  Unconstrained correlation filters. , 1994, Applied optics.

[6]  B. V. Kumar,et al.  Minimum squared error synthetic discriminant functions , 1992 .

[7]  B. V. Vijaya Kumar,et al.  Minimum-variance synthetic discriminant functions , 1986 .

[8]  Xiaoqin Zhang,et al.  Graph Based Discriminative Learning for Robust and Efficient Object Tracking , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[9]  P Refregier Optimal trade-off filters for noise robustness, sharpness of the correlation peak, and Horner efficiency. , 1991, Optics letters.

[10]  Bruce A. Draper,et al.  Average of Synthetic Exact Filters , 2009, CVPR.

[11]  William H. Press,et al.  Numerical recipes in C , 2002 .

[12]  Ming-Hsuan Yang,et al.  Visual tracking with online Multiple Instance Learning , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  David J. Fleet,et al.  Robust Online Appearance Models for Visual Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Marios Savvides,et al.  Reduced complexity face recognition using advanced correlation filters and fourier subspace methods for biometric applications , 2004 .

[15]  D Casasent,et al.  Unified synthetic discriminant function computational formulation. , 1984, Applied optics.

[16]  Ming-Hsuan Yang,et al.  Incremental Learning for Robust Visual Tracking , 2008, International Journal of Computer Vision.

[17]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[18]  D. Casasent,et al.  Minimum average correlation energy filters. , 1987, Applied optics.

[19]  D Casasent,et al.  Multivariant technique for multiclass pattern recognition. , 1980, Applied optics.