Visual Tracking Based on Dynamic Coupled Conditional Random Field Model

This paper proposes a novel approach to visual tracking of moving objects based on the dynamic coupled conditional random field (DcCRF) model. The principal idea is to integrate a variety of relevant knowledge about object tracking into a unified dynamic probabilistic framework, which is called the DcCRF model in this paper. Under this framework, the proposed approach integrates spatiotemporal contextual information of motion and appearance, as well as the compatibility between the foreground label and object label. An approximate inference algorithm, i.e., loopy belief propagation, is adopted to conduct the inference. Meanwhile, the background model is adaptively updated to deal with gradual background changes. Experimental results show that the proposed approach can accurately track moving objects (with or without occlusions) in monocular video sequences and outperforms some state-of-the-art methods in tracking and segmentation accuracy.

[1]  Andrew McCallum,et al.  An Introduction to Conditional Random Fields , 2010, Found. Trends Mach. Learn..

[2]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[3]  William T. Freeman,et al.  Signal and Image Processing with Belief Propagation , 2008 .

[4]  Yang Wang,et al.  A dynamic conditional random field model for foreground and shadow segmentation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Patrick Pérez,et al.  Track and Cut: Simultaneous Tracking and Segmentation of Multiple Objects with Graph Cuts , 2008, EURASIP J. Image Video Process..

[6]  Martial Hebert,et al.  Discriminative random fields: a discriminative framework for contextual interaction in classification , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[7]  Osama Masoud,et al.  Detection and classification of vehicles , 2002, IEEE Trans. Intell. Transp. Syst..

[8]  Yang Wang,et al.  A dynamic conditional random field model for object segmentation in image sequences , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[9]  P. KaewTrakulPong,et al.  An Improved Adaptive Background Mixture Model for Real-time Tracking with Shadow Detection , 2002 .

[10]  Michael J. Black,et al.  A Fully-Connected Layered Model of Foreground and Background Flow , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Shunsuke Kamijo,et al.  Vehicle infrastructure integration system using vision sensors to prevent accidents in traffic flow , 2011 .

[12]  Stanley T. Birchfield,et al.  An Energy Minimization Approach to Automatic Traffic Camera Calibration , 2013, IEEE Transactions on Intelligent Transportation Systems.

[13]  Antonio Criminisi,et al.  TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context , 2007, International Journal of Computer Vision.

[14]  Andrew McCallum,et al.  Dynamic conditional random fields: factorized probabilistic models for labeling and segmenting sequence data , 2004, J. Mach. Learn. Res..

[15]  Guillermo Sapiro,et al.  Video SnapCut: robust video object cutout using localized classifiers , 2009, SIGGRAPH 2009.

[16]  Martin D. Levine,et al.  Automated Real-Time Detection of Potentially Suspicious Behavior in Public Transport Areas , 2013, IEEE Transactions on Intelligent Transportation Systems.

[17]  Andrew Hunter,et al.  A spatially distributed model for foreground segmentation , 2009, Image Vis. Comput..

[18]  Zdenek Kalal,et al.  Tracking-Learning-Detection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Katsushi Ikeuchi,et al.  Traffic monitoring and accident detection at intersections , 2000, IEEE Trans. Intell. Transp. Syst..

[20]  Mohan M. Trivedi,et al.  Trajectory Learning for Activity Understanding: Unsupervised, Multilevel, and Long-Term Adaptive Approach , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Seong G. Kong,et al.  Visual Analysis of Eye State and Head Pose for Driver Alertness Monitoring , 2013, IEEE Transactions on Intelligent Transportation Systems.

[22]  Marie-Pierre Jolly,et al.  Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[23]  Antonio Manuel López Peña,et al.  Multiple target tracking for intelligent headlights control , 2010, 13th International IEEE Conference on Intelligent Transportation Systems.

[24]  Erik Sudderth,et al.  Signal and Image Processing with Belief Propagation [DSP Applications] , 2008, IEEE Signal Processing Magazine.

[25]  Avinash C. Kak,et al.  Computer Vision and Pattern Recognition 2010 A Probabilistic Framework for Joint Segmentation and Tracking , 2022 .

[26]  Mathias Perrollaz,et al.  Probabilistic Integration of Intensity and Depth Information for Part-Based Vehicle Detection , 2013, IEEE Transactions on Intelligent Transportation Systems.

[27]  Nikolaos Papanikolopoulos,et al.  Clustering of Vehicle Trajectories , 2010, IEEE Transactions on Intelligent Transportation Systems.

[28]  Avinash C. Kak,et al.  Tracking Vehicles Through Shadows and Occlusions in Wide-Area Aerial Video , 2014, IEEE Transactions on Aerospace and Electronic Systems.

[29]  Mohan M. Trivedi,et al.  Looking at Vehicles on the Road: A Survey of Vision-Based Vehicle Detection, Tracking, and Behavior Analysis , 2013, IEEE Transactions on Intelligent Transportation Systems.

[30]  Gunnar Farnebäck,et al.  Two-Frame Motion Estimation Based on Polynomial Expansion , 2003, SCIA.

[31]  Bo Li,et al.  Component-Based License Plate Detection Using Conditional Random Field Model , 2013, IEEE Transactions on Intelligent Transportation Systems.

[32]  Lin Zhu,et al.  Tracking multiple objects through occlusion with online sampling and position estimation , 2008, Pattern Recognit..

[33]  Bing-Fei Wu,et al.  A New Approach to Video-Based Traffic Surveillance Using Fuzzy Hybrid Information Inference Mechanism , 2013, IEEE Transactions on Intelligent Transportation Systems.

[34]  Mohan M. Trivedi,et al.  Part-Based Pedestrian Detection and Feature-Based Tracking for Driver Assistance: Real-Time, Robust Algorithms, and Evaluation , 2013, IEEE Transactions on Intelligent Transportation Systems.

[35]  Chunhong Pan,et al.  Forward–Backward Mean-Shift for Visual Tracking With Local-Background-Weighted Histogram , 2013, IEEE Transactions on Intelligent Transportation Systems.

[36]  W. Eric L. Grimson,et al.  Learning coupled conditional random field for image decomposition with application on object categorization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[37]  Aurélie Bugeau,et al.  Tracking with Occlusions via Graph Cuts , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Wei Zhang,et al.  Multilevel Framework to Detect and Handle Vehicle Occlusion , 2008, IEEE Transactions on Intelligent Transportation Systems.

[39]  Lu Wang,et al.  Multiple-Human Tracking by Iterative Data Association and Detection Update , 2014, IEEE Transactions on Intelligent Transportation Systems.

[40]  Michael F. Cohen,et al.  Monocular Video Foreground/Background Segmentation by Tracking Spatial-Color Gaussian Mixture Models , 2007, 2007 IEEE Workshop on Motion and Video Computing (WMVC'07).

[41]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[42]  Ivan V. Bajic,et al.  Video Object Tracking in the Compressed Domain Using Spatio-Temporal Markov Random Fields , 2013, IEEE Transactions on Image Processing.