Probabilistic modeling of texture transition for fast tracking and delineation

In this thesis a probabilistic approach to texture boundary detection for tracking applications is presented. We have developed a novel fast algorithm for Bayesian estimation of texture transition locations from a short sequence of pixels on a scanline that combines the desirable speed of edge-based line search and the sophistication of Bayesian texture analysis given a small set of observations. For the cases where the given observations are too few for reliable Bayesian estimation of probability of texture change we propose an innovative machine learning technique to generate a probabilistic texture transition model. This is achieved by considering a training dataset containing small patches of blending textures. By encompassing in the training set enough examples to accurately model texture transitions of interest we can construct a predictor that can be used for object boundary tracking that can deal with few observations and demanding cases of tracking of arbitrary textured objects against cluttered background. Object outlines are then obtained by combining the texture crossing probabilities across a set of scanlines. We show that a rigid geometric model of the object to be tracked or smoothness constraints in the absence of such a model can be used to coalesce the scanline texture crossing probabilities obtained using the methods mentioned above. We propose a Hidden Markov Model to aggregate robustly the sparse transition probabilities of scanlines sampled along the projected hypothesis model contour. As a result continuous object contours can be extracted using a posteriori maximization of texture transition probabilities. On the other hand, stronger geometric constraints such as available rigid models of the target are directly enforced by robust stochastic optimization. In addition to being fast, the allure of the proposed probabilistic framework is that it accommodates a unique infrastructure for tracking of heterogeneous objects which utilizes the machine learning-based predictor as well as the Bayesian estimator interchangeably in conjunction with robust optimization to extract object contours robustly. We apply the developed methods to tracking of textured and non textured rigid objects as well as deformable body outlines and monocular articulated human motion in challenging conditions. Finally, because it is fast, our method can also serve as an interactive texture segmentation tool.

[1]  Fumiaki Tomita,et al.  Computer analysis of visual textures , 1990 .

[2]  James W. Davis,et al.  A Robust Human-Silhouette Extraction Technique for Interactive Virtual Environments , 1998, CAPTECH.

[3]  Patrick Bouthemy,et al.  A 2D-3D model-based approach to real-time visual tracking , 2001, Image Vis. Comput..

[4]  Carlo Tomasi,et al.  Corner detection in textured color images , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[5]  Rajeev Sharma,et al.  Adaptive texture and color segmentation for tracking moving objects , 2002, Pattern Recognit..

[6]  Vincent Lepetit,et al.  Polyhedral object detection and pose estimation for augmented reality applications , 2002, Proceedings of Computer Animation 2002 (CA 2002).

[7]  Cristian Sminchisescu,et al.  Kinematic jump processes for monocular 3D human tracking , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[8]  Rachid Deriche,et al.  Geodesic Active Regions and Level Set Methods for Supervised Texture Segmentation , 2002, International Journal of Computer Vision.

[9]  Christos Faloutsos,et al.  QBIC project: querying images by content, using color, texture, and shape , 1993, Electronic Imaging.

[10]  Yoshiaki Shirai,et al.  Hand gesture estimation and model refinement using monocular camera-ambiguity limitation by inequality constraints , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[11]  Pascal Fua,et al.  Implicit meshes for surface reconstruction , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Marie-Pierre Jolly,et al.  Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[13]  Trygve Randen,et al.  Filtering for Texture Classification: A Comparative Study , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Rachid Deriche,et al.  Coupled Geodesic Active Regions for Image Segmentation: A Level Set Approach , 2000, ECCV.

[15]  Hamid Soltanian-Zadeh,et al.  Voxel-coding method for quantification of vascular structure from 3D images , 2001, SPIE Medical Imaging.

[16]  Joachim M. Buhmann,et al.  On learning texture edge detectors , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[17]  Éric Marchand,et al.  A real-time tracker for markerless augmented reality , 2003, The Second IEEE and ACM International Symposium on Mixed and Augmented Reality, 2003. Proceedings..

[18]  Nikos Paragios,et al.  A MRF-based approach for real-time subway monitoring , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[19]  Stan Sclaroff,et al.  Estimating 3D hand pose from a cluttered image , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[20]  Rachid Deriche,et al.  Geodesic active regions and level set methods for motion estimation and tracking , 2005, Comput. Vis. Image Underst..

[21]  V. Lepetit,et al.  Robust Data Association , 2022 .

[22]  Andrew Zisserman,et al.  Classifying materials from images: to cluster or not to cluster? , 2002, European Conference on Computer Vision.

[23]  Marie-Odile Berger,et al.  A two-stage robust statistical method for temporal registration from features of various type , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[24]  Lance R. Williams,et al.  Stochastic Completion Fields: A Neural Model of Illusory Contour Shape and Salience , 1995, Neural Computation.

[25]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[26]  Jianbo Shi,et al.  A Random Walks View of Spectral Segmentation , 2001, AISTATS.

[27]  T. J. Stonham,et al.  Texture image classification and segmentation using RANK-order clustering , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol. III. Conference C: Image, Speech and Signal Analysis,.

[28]  Jitendra Malik,et al.  Scale-Space and Edge Detection Using Anisotropic Diffusion , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Matti Pietikäinen,et al.  A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..

[30]  Roberto Cipolla,et al.  Real-Time Visual Tracking of Complex Structures , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Pat Langley,et al.  An Analysis of Bayesian Classifiers , 1992, AAAI.

[32]  Radu Horaud,et al.  Visual tracking of an end-effector by adaptive kinematic prediction , 1997, Proceedings of the 1997 IEEE/RSJ International Conference on Intelligent Robot and Systems. Innovative Robotics for Real-World Applications. IROS '97.

[33]  Richard W. Conners,et al.  A Theoretical Comparison of Texture Algorithms , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Matti Pietikäinen,et al.  View-based recognition of 3D-textured surfaces , 2003, 12th International Conference on Image Analysis and Processing, 2003.Proceedings..

[35]  Roberto Cipolla,et al.  Real-time tracking of highly articulated structures in the presence of noisy measurements , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[36]  Joachim M. Buhmann,et al.  Empirical Evaluation of Dissimilarity Measures for Color and Texture , 2001, Comput. Vis. Image Underst..

[37]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[38]  Vincent Lepetit,et al.  Bundle Adjustment for Markerless Body Tracking in Monocular Video Sequences , 2003 .

[39]  B. Wandell,et al.  Pattern—color separable pathways predict sensitivity to simple colored patterns , 1996, Vision Research.

[40]  L. Breiman Arcing Classifiers , 1998 .

[41]  Jianying Hu,et al.  Matching and retrieval based on the vocabulary and grammar of color patterns , 2000, IEEE Trans. Image Process..

[42]  Patrick Pérez,et al.  Interactive Image Segmentation Using an Adaptive GMMRF Model , 2004, ECCV.

[43]  B. Triggs,et al.  Tracking Articulated Motion with Piecewise Learned Dynamical Models , 2004 .

[44]  Pascal Fua,et al.  Fast texture-based tracking and delineation using texture entropy , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[45]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[46]  Ling-Hwei Chen,et al.  Unsupervised Texture Segmentation by Determining the Interior of Texture Regions Based on Wavelet Transform , 2001, Int. J. Pattern Recognit. Artif. Intell..

[47]  Donald Geman,et al.  Fast face detection with precise pose estimation , 2002, Object recognition supported by user interaction for service robots.

[48]  Carlo Tomasi,et al.  Color edge detection with the compass operator , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[49]  B. Triggs,et al.  3D human pose from silhouettes by relevance vector regression , 2004, CVPR 2004.

[50]  Vincent Lepetit,et al.  Markov-based Silhouette Extraction for Three--Dimensional Body Tracking in Presence of Cluttered Background , 2004, BMVC.

[51]  Éric Marchand,et al.  Virtual Visual Servoing: a framework for real‐time augmented reality , 2002, Comput. Graph. Forum.

[52]  Ali Shahrokni,et al.  A Robust Vision-based Moving Target Detection and Tracking System , 2001 .

[53]  B. Julesz Textons, the elements of texture perception, and their interactions , 1981, Nature.

[54]  Andrew Blake,et al.  Accurate, real-time, unadorned lip tracking , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[55]  Jitendra Malik,et al.  Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons , 2001, International Journal of Computer Vision.

[56]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[57]  Michael Beetz,et al.  The Contracting Curve Density Algorithm: Fitting Parametric Curve Models to Images Using Local Self-Adapting Separation Criteria , 2004, International Journal of Computer Vision.

[58]  James C. Gee,et al.  Two--level MRF Models for Image Restoration and Segmentation , 2004, BMVC.

[59]  Rama Chellappa,et al.  Contour-based 3D Face Modeling from a Monocular Video , 2004, BMVC.

[60]  Ali Shahrokni,et al.  Fast skeletonization algorithm for 3D elongated objects , 2001, SPIE Medical Imaging.

[61]  Rachid Deriche,et al.  Tracking line segments , 1990, Image Vis. Comput..

[62]  Hans-Hellmut Nagel,et al.  3D Pose Estimation by Directly Matching Polyhedral Models to Gray Value Gradients , 1997, International Journal of Computer Vision.

[63]  Larry S. Davis,et al.  Human body pose estimation using silhouette shape analysis , 2003, Proceedings of the IEEE Conference on Advanced Video and Signal Based Surveillance, 2003..

[64]  Jr. G. Forney,et al.  The viterbi algorithm , 1973 .

[65]  A FischlerMartin,et al.  Random sample consensus , 1981 .

[66]  Pascal Fua,et al.  Classifier-based Contour Tracking for Rigid and Deformable Objects , 2005, BMVC.

[67]  David G. Lowe,et al.  Robust model-based motion tracking through the integration of search and estimation , 1992, International Journal of Computer Vision.

[68]  Gunnar Rätsch,et al.  Regularizing AdaBoost , 1998, NIPS.

[69]  P. Tse,et al.  Visual form perception , 2003 .

[70]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[71]  Marko Heikkilä,et al.  A Texture-based Method for Detecting Moving Objects , 2004, BMVC.

[72]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[73]  Andrew Zisserman,et al.  OBJ CUT , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[74]  Hans-Hellmut Nagel,et al.  Model-based object tracking in monocular image sequences of road traffic scenes , 1993, International Journal of Computer 11263on.

[75]  Edwin R. Hancock,et al.  Image Segmentation using Commute Times , 2005, BMVC.

[76]  Donald B. Gennery,et al.  Visual tracking of known three-dimensional objects , 1992, International Journal of Computer Vision.

[77]  Pascal Fua,et al.  Texture Boundary Detection for Real-Time Tracking , 2004, ECCV.

[78]  K. H. Ruddock,et al.  Visual form perception , 1975 .

[79]  Erkki Oja,et al.  Reduced Multidimensional Co-Occurrence Histograms in Texture Classification , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[80]  Vincent Lepetit,et al.  Robust data association for online application , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[81]  Jitendra Malik,et al.  Textons, contours and regions: cue integration in image segmentation , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[82]  Andrew Zisserman,et al.  Robust Object Tracking , 2001 .

[83]  Azriel Rosenfeld,et al.  A Comparative Study of Texture Measures for Terrain Classification , 1975, IEEE Transactions on Systems, Man, and Cybernetics.

[84]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[85]  Vincent Lepetit,et al.  Combining edge and texture information for real-time accurate 3D camera tracking , 2004, Third IEEE and ACM International Symposium on Mixed and Augmented Reality.

[86]  Pascal Fua,et al.  Interaction techniques with virtual humans in mixed environments , 2002, 5th IEEE EMBS International Summer School on Biomedical Imaging, 2002..

[87]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[88]  Topi Mäenpää,et al.  The local binary pattern approach to texture analysis - extensions and applications , 2003 .

[89]  Akio Kosaka,et al.  Vision-based motion tracking of frigid objects using prediction of uncertainties , 1995, Proceedings of 1995 IEEE International Conference on Robotics and Automation.

[90]  Jitendra Malik,et al.  A real-time approach to stereopsis and lane-finding , 1996, Proceedings of Conference on Intelligent Vehicles.

[91]  Alan L. Yuille,et al.  Fundamental bounds on edge detection: an information theoretic evaluation of different edge cues , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[92]  C. Tomasi Coalescing Texture Descriptors , 1996 .

[93]  David Mumford,et al.  Filtering, Segmentation and Depth , 1993, Lecture Notes in Computer Science.