Adaptive Edge-Enhanced Correlation Based Robust And Real-Time Visual Tracking Framework And Its Deployment In Machine Vision Systems

An adaptive edge-enhanced correlation based robust and real-time visual tracking framework, and two machine vision systems based on the framework are proposed. The visual tracking algorithm can track any object of interest in a video acquired from a stationary or moving camera. It can handle the real-world problems, such as noise, clutter, occlusion, uneven illumination, varying appearance, orientation, scale, and velocity of the maneuvering object, and object fading and obscuration in low contrast video at various zoom levels. The proposed machine vision systems are an active camera tracking system and a vision based system for a UGV (unmanned ground vehicle) to handle a road intersection. The core of the proposed visual tracking framework is an Edge Enhanced Back-propagation neural-network Controlled Fast Normalized Correlation (EEBCFNC), which makes the object localization stage efficient and robust to noise, object fading, obscuration, and uneven illumination. The incorrect template initialization and template-drift problems of the traditional correlation tracker are handled by a best-match rectangle adjustment algorithm. The varying appearance of the object and the short-term neighboring clutter are addressed by a robust template updating scheme. The background clutter and varying velocity of the object are handled by looking for the object only in a dynamically resizable search window, in which the likelihood of the presence of the object is high. The search window is created using the prediction and the prediction error of a Kalman filter. The effect of the long-term neighboring clutter is reduced by weighting the template pixels using a 2D Gaussian weighting window with adaptive standard deviation parameters. The occlusion is addressed by a data association technique. The varying scale of the object is handled by correlating the search window with three scales of the template, and accepting the best-match region that produces the highest peak in the three correlation surfaces. The proposed visual tracking algorithm is compared with the traditional correlation tracker and, in some cases, with the mean-shift and the condensation trackers on real-world imagery. The proposed algorithm outperforms them in robustness and executes at the speed of 25 to 75 frames/second depending on the current sizes of the adaptive template and the dynamic search window. The proposed active camera tracking system can be used to get the target always in focus (i.e. in the center of the video frame) regardless of the motion of the target in the scene. It feeds the target coordinates estimated by the visual tracking framework into a predictive open-loop car-following control (POL-CFC) algorithm which in turn generates the precise control signals for the pan-tilt motion of the camera. The performance analysis of the system shows that its percent overshoot, rise time, and maximum steady state error are 0%, 1.7 second, and ±1 pixel, respectively. The hardware of the proposed vision based system, that enables a UGV to handle a road intersection, consists of three on-board computers and three cameras (mounted on top of the UGV) looking towards the other three roads merging at the intersection. The software in each computer consists of a vehicle detector, the proposed tracker, and a finite state machine model (FSM) of the traffic. The information from the three FSMs is combined to make an autonomous decision whether it is safe for the UGV to cross the intersection or not. The results of the actual UGV experiments are provided to validate the robustness of the proposed system. Index terms visual tracking, adaptive edge-enhanced correlation, active camera, unmanned ground vehicle.

[1]  T. Ross Fuzzy Logic with Engineering Applications , 1994 .

[2]  Roberto Manduchi,et al.  Accuracy analysis for correlation-based image registration algorithms , 1993, 1993 IEEE International Symposium on Circuits and Systems.

[3]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[4]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[5]  Martin Fodslette Meiller A Scaled Conjugate Gradient Algorithm for Fast Supervised Learning , 1993 .

[6]  Christopher Rasmussen,et al.  Combining laser range, color, and texture cues for autonomous road following , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[7]  John Fitts Precision correlation tracking via optimal weighting functions , 1979, 1979 18th IEEE Conference on Decision and Control including the Symposium on Adaptive Processes.

[8]  Edward Brown,et al.  Machine Vision for Obstacle Detection and Ordnance Recognition , 1996 .

[9]  Rómer Rosales,et al.  3D trajectory recovery for tracking multiple objects and trajectory guided recognition of actions , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[10]  Guanrong Chen,et al.  Introduction to Fuzzy Sets, Fuzzy Logic, and Fuzzy Control Systems , 2000 .

[11]  Arif Mahmood,et al.  Early Termination Algorithms for Correlation Coefficient Based Block Matching , 2007, 2007 IEEE International Conference on Image Processing.

[12]  Shai Avidan,et al.  Support Vector Tracking , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[13]  Fatih Murat Porikli,et al.  Multi-Kernel Object Tracking , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[14]  Franklin C. Crow,et al.  Summed-area tables for texture mapping , 1984, SIGGRAPH.

[15]  A. Shashua,et al.  Pedestrian detection for driving assistance systems: single-frame classification and system level performance , 2004, IEEE Intelligent Vehicles Symposium, 2004.

[16]  Bo Tao,et al.  Analysis of image registration noise due to rotationally dependent aliasing , 2003, J. Vis. Commun. Image Represent..

[17]  Gary Bradski,et al.  Learning-Based Computer Vision with Intels Open Source Computer Vision Library , 2005 .

[18]  Dorin Comaniciu,et al.  Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  J. P. Lewis Fast Normalized Cross-Correlation , 2010 .

[20]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[21]  Bohyung Han,et al.  Bayesian Filtering and Integral Image for Visual Tracking , 2005 .

[22]  Dorin Comaniciu,et al.  Real-time tracking of non-rigid objects using mean shift , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[23]  M. Noman Jafri,et al.  Improved Phase Correlation Matching , 2008, ICISP.

[24]  Yu Qilian,et al.  Phase correlation-based matching method with sub-pixel accuracy for translated and rotated images , 2002, 6th International Conference on Signal Processing, 2002..

[25]  Hassan Foroosh,et al.  Extension of phase correlation to subpixel registration , 2002, IEEE Trans. Image Process..

[26]  R. E. Kalman,et al.  New Results in Linear Filtering and Prediction Theory , 1961 .

[27]  Mark S. Nixon,et al.  Feature Extraction and Image Processing , 2002 .

[28]  James F. Riker,et al.  Precision trackpoint control via correlation track referenced to simulated imagery , 1992, Defense, Security, and Sensing.

[29]  Larry H. Matthies,et al.  Stereo vision for planetary rovers: Stochastic modeling to near real-time implementation , 1991, Optics & Photonics.

[30]  Paul W. Fieguth,et al.  Color-based tracking of heads and other mobile objects at video frame rates , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[31]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Hung T. Nguyen,et al.  A First Course in Fuzzy Logic , 1996 .

[33]  Laurene V. Fausett,et al.  Fundamentals Of Neural Networks , 1994 .

[34]  J. Ahmed,et al.  Best-match rectangle adjustment algorithm for persistent and precise correlation tracking , 2007, 2007 International Conference on Machine Vision.

[35]  Benjamin C. Kuo,et al.  AUTOMATIC CONTROL SYSTEMS , 1962, Universum:Technical sciences.

[36]  Christopher Fagiani,et al.  An Evaluation of Tracking Methods for Human-Computer Interaction , 2002 .

[37]  Xin Li,et al.  Contour-based object tracking with occlusion handling in video acquired using mobile cameras , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Fatih Murat Porikli,et al.  Integral histogram: a fast way to extract histograms in Cartesian spaces , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[39]  E. Brookner Tracking and Kalman Filtering Made Easy , 1998 .

[40]  Jean-Marc Odobez,et al.  AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking , 2004, MLMI.

[41]  Michel Defrise,et al.  Symmetric Phase-Only Matched Filtering of Fourier-Mellin Transforms for Image Registration and Recognition , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[42]  Yuntao Cui,et al.  Indoor monitoring via the collaboration between a peripheral sensor and a foveal sensor , 1998, Proceedings 1998 IEEE Workshop on Visual Surveillance.

[43]  N. Mir-Nasiri Camera-based 3D Tracking System , 2005, TENCON 2005 - 2005 IEEE Region 10 Conference.

[44]  Alan L. Yuille,et al.  Region Competition: Unifying Snakes, Region Growing, and Bayes/MDL for Multiband Image Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[45]  Mubarak Shah,et al.  Action MACH a spatio-temporal Maximum Average Correlation Height filter for action recognition , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[46]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[47]  Alexandros Eleftheriadis,et al.  Automatic face location detection and tracking for model-assisted coding of video teleconferencing sequences at low bit-rates , 1995, Signal Process. Image Commun..

[48]  Mohinder S. Grewal,et al.  Kalman Filtering: Theory and Practice Using MATLAB , 2001 .

[49]  M. N. Jafri,et al.  Target Tracking in an Image Sequence Using Wavelet Features and a Neural Network , 2005, TENCON 2005 - 2005 IEEE Region 10 Conference.

[50]  Fatih Murat Porikli,et al.  Covariance Tracking using Model Update Based on Lie Algebra , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[51]  David Suter,et al.  Effective Appearance Model and Similarity Measure for Particle Filtering and Visual Tracking , 2006, ECCV.

[52]  Jamil Ahmad,et al.  Design and Implementation of a Neural Network for Real-Time Object Tracking , 2007 .

[53]  Scott E. Umbaugh,et al.  Computer Imaging: Digital Image Analysis and Processing , 2005 .

[54]  C. D. Kuglin,et al.  The phase correlation image alignment method , 1975 .

[55]  Sebastien Wong,et al.  Advanced correlation tracking of objects in cluttered imagery , 2005 .

[56]  Mubarak Shah,et al.  Real-time edge-enhanced dynamic correlation and predictive open-loop car-following control for robust tracking , 2008, Machine Vision and Applications.

[57]  Patrick Pérez,et al.  Color-Based Probabilistic Tracking , 2002, ECCV.

[58]  Michael J. Black,et al.  EigenTracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation , 1996, International Journal of Computer Vision.

[59]  Dana H. Ballard,et al.  Computer Vision , 1982 .

[60]  M. Noman Jafri,et al.  Decaying extension based phase correlation for robust object localization in full search space , 2008, 2008 16th European Signal Processing Conference.

[61]  Stefanos D. Kollias,et al.  An efficient fully unsupervised video object segmentation scheme using an adaptive neural-network classifier architecture , 2003, IEEE Trans. Neural Networks.

[62]  Roberto Manduchi,et al.  Fast and reliable obstacle detection and segmentation for cross-country navigation , 2002, Intelligent Vehicle Symposium, 2002. IEEE.

[63]  Jake K. Aggarwal,et al.  A hierarchical Bayesian network for event recognition of human actions and interactions , 2004, Multimedia Systems.

[64]  Ehud Rivlin,et al.  Robust Fragments-based Tracking using the Integral Histogram , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[65]  J.P. Hespanha,et al.  Multiple-agent probabilistic pursuit-evasion games , 1999, Proceedings of the 38th IEEE Conference on Decision and Control (Cat. No.99CH36304).

[66]  Rachid Deriche,et al.  Geodesic Active Regions and Level Set Methods for Supervised Texture Segmentation , 2002, International Journal of Computer Vision.

[67]  Greg Welch,et al.  Welch & Bishop , An Introduction to the Kalman Filter 2 1 The Discrete Kalman Filter In 1960 , 1994 .

[68]  Peter S. Maybeck,et al.  Target tracking using infrared measurements and laser illumination , 1994 .

[69]  Raúl Rojas,et al.  Intelligent tracking Technical Report B-0315 , 2003 .

[70]  Amir Averbuch,et al.  Robust phase correlation , 2004, ICPR 2004.

[71]  Tien-Hsin Chao,et al.  MACH filter synthesizing for detecting targets in cluttered environment for grayscale optical correlator , 1999, Defense, Security, and Sensing.

[72]  Luc Van Gool,et al.  An adaptive color-based particle filter , 2003, Image Vis. Comput..

[73]  James W. Davis,et al.  Real-time closed-world tracking , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[74]  Allen R. Hanson,et al.  Qualitative obstacle detection , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[75]  Cor J. Veenman,et al.  Resolving Motion Correspondence for Densely Moving Points , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[76]  Jake K. Aggarwal,et al.  Segmentation and recognition of continuous human activity , 2001, Proceedings IEEE Workshop on Detection and Recognition of Events in Video.

[77]  Ronen Basri,et al.  Actions as Space-Time Shapes , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[78]  P Refregier Optimal trade-off filters for noise robustness, sharpness of the correlation peak, and Horner efficiency. , 1991, Optics letters.

[79]  Samuel S. Blackman,et al.  Design and Analysis of Modern Tracking Systems , 1999 .

[80]  Alonzo Kelly,et al.  Obstacle detection for unmanned ground vehicles: a progress report , 1995 .

[81]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[82]  S. Munkeby,et al.  UGV Demo II reuse technology for AHS , 1995, Proceedings of the Intelligent Vehicles '95. Symposium.

[83]  Mubarak Shah,et al.  A Vision-Based System for a UGV to Handle a Road Intersection , 2007, AAAI.

[84]  Olga Veksler,et al.  Fast variable window for stereo correspondence using integral images , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[85]  Joseph N. Wilson,et al.  Handbook of computer vision algorithms in image algebra , 1996 .

[86]  B. V. Vijaya Kumar,et al.  Unconstrained correlation filters. , 1994, Applied optics.