An online learning target tracking method based on extreme learning machine

Target tracking is one of the important tasks in computer vision. It aims to detect and track one or more particular objects in videos. The target and background may change in the process of tracking. In order to solve this problem, this paper proposes an online learning target tracking method based on extreme learning machine (ELM). First of all, we capture the target and background regions in the first few frames of video and extract the histograms of oriented gradients (HOG) features of regions into ELM. Secondly, using the method of sliding window to detect the candidate region after loading a new image. Finally, according to the tracking result, the classifier can be updated for online learning. In order to promote the detection speed, this method predicts a region in the current frame according to the target position of the previous frame. The predicted region is called the candidate region. Experiment results have shown that this proposed method not only achieves high accuracy but also can adapt to the changes of target and background.

[1]  Gregory D. Hager,et al.  Efficient Region Tracking With Parametric Models of Geometry and Illumination , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Guang-Bin Huang,et al.  An Insight into Extreme Learning Machines: Random Neurons, Random Features and Kernels , 2014, Cognitive Computation.

[3]  D HagerGregory,et al.  Efficient Region Tracking With Parametric Models of Geometry and Illumination , 1998 .

[4]  Clark F. Olson,et al.  Maximum-likelihood template matching , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[5]  Horst Bischof,et al.  On-line Boosting and Vision , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[6]  Juergen Gall,et al.  Class-specific Hough forests for object detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[8]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[9]  Rudolph van der Merwe,et al.  The unscented Kalman filter for nonlinear estimation , 2000, Proceedings of the IEEE 2000 Adaptive Systems for Signal Processing, Communications, and Control Symposium (Cat. No.00EX373).

[10]  Luc Van Gool,et al.  Hough Forests for Object Detection, Tracking, and Action Recognition , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Juergen Gall,et al.  Class-specific Hough forests for object detection , 2009, CVPR.

[12]  Yi Liu,et al.  Video object segmentation and tracking using /spl psi/-learning classification , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Rainer Lienhart,et al.  An extended set of Haar-like features for rapid object detection , 2002, Proceedings. International Conference on Image Processing.

[14]  Leslie G. Valiant,et al.  Cryptographic Limitations on Learning Boolean Formulae and Finite Automata , 1993, Machine Learning: From Theory to Applications.

[15]  Michael J. Black,et al.  EigenTracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation , 1996, International Journal of Computer Vision.

[16]  Jiri Matas,et al.  Training sequential on-line boosting classifier for visual tracking , 2008, 2008 19th International Conference on Pattern Recognition.

[17]  Xin Li,et al.  Contour-based object tracking with occlusion handling in video acquired using mobile cameras , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Shin Ishii,et al.  Switching particle filters for efficient visual tracking , 2006, Robotics Auton. Syst..

[19]  Ignacio Parra,et al.  Combination of Feature Extraction Methods for SVM Pedestrian Detection , 2007, IEEE Transactions on Intelligent Transportation Systems.

[20]  Narasimhan Sundararajan,et al.  A Fast and Accurate Online Sequential Learning Algorithm for Feedforward Networks , 2006, IEEE Transactions on Neural Networks.

[21]  George K. I. Mann,et al.  A Single-Object Tracking Method for Robots using Object-Based Visual Attention , 2012, Int. J. Humanoid Robotics.

[22]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[23]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[24]  Chee Kheong Siew,et al.  Universal Approximation using Incremental Constructive Feedforward Networks with Random Hidden Nodes , 2006, IEEE Transactions on Neural Networks.

[25]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[26]  Kyunghee Lee,et al.  Eye and face detection using SVM , 2004, Proceedings of the 2004 Intelligent Sensors, Sensor Networks and Information Processing Conference, 2004..