Real-time robust target tracking in videos via graph-cuts

Video tracking is a fundamental problem in computer vision with many applications. The goal of video tracking is to isolate a target object from its background across a sequence of frames. Tracking is inherently a three dimensional problem in that it incorporates the time dimension. As such, the computational efficiency of video segmentation is a major challenge. In this paper we present a generic and robust graph-theory-based tracking scheme in videos. Unlike previous graph-based tracking methods, the suggested approach treats motion as a pixel's property (like color or position) rather than as consistency constraints (i.e., the location of the object in the current frame is constrained to appear around its location in the previous frame shifted by the estimated motion) and solves the tracking problem optimally (i.e., neither heuristics nor approximations are applied). The suggested scheme is so robust that it allows for incorporating the computationally cheaper MPEG-4 motion estimation schemes. Although block matching techniques generate noisy and coarse motion fields, their use allows faster computation times as broad variety of off-the-shelf software and hardware components that specialize in performing this task are available. The evaluation of the method on standard and non-standard benchmark videos shows that the suggested tracking algorithm can support a fast and accurate video tracking, thus making it amenable to real-time applications.

[1]  Munchurl Kim,et al.  Graph-based object detection and tracking in H.264/AVC bitstreams for surveillance video , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[2]  Dorit S. Hochbaum,et al.  A Computational Study of the Pseudoflow and Push-Relabel Algorithms for the Maximum Flow Problem , 2009, Oper. Res..

[3]  Dorit S. Hochbaum Polynomial Time Algorithms for Ratio Regions and a Variant of Normalized Cut , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Daniel Freedman,et al.  Illumination-invariant tracking via graph cuts , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[5]  Thomas Brox,et al.  High Accuracy Optical Flow Estimation Based on a Theory for Warping , 2004, ECCV.

[6]  Aurélie Bugeau,et al.  Tracking with Occlusions via Graph Cuts , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Ian D. Reid,et al.  Stable multi-target tracking in real-time surveillance video , 2011, CVPR 2011.

[8]  Luc Van Gool,et al.  Online Multiperson Tracking-by-Detection from a Single, Uncalibrated Camera , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Kai-Kuang Ma,et al.  Adaptive rood pattern search for fast block-matching motion estimation , 2002, IEEE Trans. Image Process..

[10]  Ning Xu,et al.  Object segmentation using graph cuts based active contours , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[11]  Jie Yu,et al.  Multi-target Tracking in Crowded Scenes , 2011, DAGM-Symposium.

[12]  Patrick Pérez,et al.  Track and Cut: Simultaneous Tracking and Segmentation of Multiple Objects with Graph Cuts , 2008, EURASIP J. Image Video Process..

[13]  Yogesh Rathi,et al.  Multi-Object Tracking Through Clutter Using Graph Cuts , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[14]  Rachid Deriche,et al.  Universität Des Saarlandes Fachrichtung 6.1 – Mathematik Colour, Texture, and Motion in Level Set Based Segmentation and Tracking Colour, Texture, and Motion in Level Set Based Segmentation and Tracking , 2022 .

[15]  Barak Fishbain,et al.  Real-time stabilization of long range observation system turbulent video , 2007, Journal of Real-Time Image Processing.

[16]  Ofer Hadar,et al.  Super-resolution mosaicing from MPEG compressed video , 2005, ICIP.

[17]  Kai-Kuang Ma,et al.  A new diamond search algorithm for fast block-matching motion estimation , 2000, IEEE Trans. Image Process..

[18]  Mubarak Shah,et al.  Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Oscar C. Au,et al.  Highly efficient predictive zonal algorithms for fast block-matching motion estimation , 2002, IEEE Trans. Circuits Syst. Video Technol..

[20]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[21]  Mubarak Shah,et al.  Recognizing human actions using multiple features , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Dorit S. Hochbaum,et al.  The Pseudoflow Algorithm: A New Algorithm for the Maximum-Flow Problem , 2008, Oper. Res..

[23]  Joanna Isabelle Olszewska,et al.  Multi-feature vector flow for active contour tracking , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[24]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.