Probabilistic and Voting Approaches to Cue Integration for Figure-Ground Segmentation

This paper describes techniques for fusing the output of multiple cues to robustly and accurately segment foreground objects from the background in image sequences. Two different methods for cue integration are presented and tested. The first is a probabilistic approach which at each pixel computes the likelihood of observations over all cues before assigning pixels to foreground or background layers using Bayes Rule. The second method allows each cue to make a decision independent of the other cues before fusing their outputs with a weighted sum. A further important contribution of ourwork concerns demonstrating how models for some cues can be learnt and subsequently adapted online. In particular, regions of coherent motion are used to train distributions for colour and for a simple texture descriptor. An additional aspect of our framework is in providing mechanisms for suppressing cues when they are believed to be unreliable, for instance during training or when they disagree with the general consensus. Results on extended video sequences are presented.

[1]  Michael J. Black,et al.  The Robust Estimation of Multiple Motions: Parametric and Piecewise-Smooth Flow Fields , 1996, Comput. Vis. Image Underst..

[2]  Bernt Schiele,et al.  Towards robust multi-cue integration for visual tracking , 2001, Machine Vision and Applications.

[3]  Ying Wu,et al.  A co-inference approach to robust visual tracking , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[4]  Jitendra Malik,et al.  Color- and texture-based image segmentation using EM and its application to content-based image retrieval , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[5]  Richard Szeliski,et al.  An integrated Bayesian approach to layer extraction from image sequences , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[6]  Ying Wu,et al.  Bootstrap Initialization of Nonparametric Texture Models for Tracking , 2000, ECCV.

[7]  James J. Clark,et al.  Data Fusion for Sensory Information Processing Systems , 1990 .

[8]  Shaogang Gong,et al.  Colour Model Selection and Adaption in Dynamic Scenes , 1998, ECCV.

[9]  E. Hayman Figure-ground segmentation of image sequences from multiple cues , 2002 .

[10]  Henrik I. Christensen,et al.  A Model-Free Voting Approach for Integrating Multiple Cues , 1998, ECCV.

[11]  Mubarak Shah,et al.  Object based segmentation of video using color, motion and spatial information , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[12]  Eric Horvitz,et al.  Bayesian Modality Fusion: Probabilistic Integration of Multiple Vision Algorithms for Head Tracking , 1999 .

[13]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[14]  Shaogang Gong,et al.  Continuous global evidence-based Bayesian modality fusion for simultaneous tracking of multiple objects , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[15]  Edward H. Adelson,et al.  Spatio-temporal segmentation of video data , 1994, Electronic Imaging.

[16]  Harpreet S. Sawhney,et al.  Layered representation of motion video using robust maximum-likelihood estimation of mixture models and MDL encoding , 1995, Proceedings of IEEE International Conference on Computer Vision.

[17]  Harpreet S. Sawhney,et al.  Independent motion detection in 3D scenes , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[18]  Hai Tao,et al.  Dynamic layer representation with applications to tracking , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[19]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[20]  C. Malsburg,et al.  Self-organized integration of adaptive visual cues for face tracking , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[21]  Hayit Greenspan,et al.  Color- and Texture-based Image Segmentation Using the Expectation-Maximization Algorithm and its Application to Content-Based Image Retrieval. , 1998, ICCV 1998.

[22]  Jitendra Malik,et al.  Textons, contours and regions: cue integration in image segmentation , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[23]  P. Anandan,et al.  A unified approach to moving object detection in 2D and 3D scenes , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[24]  Danica Kragic Visual Servoing for Manipulation : Robustness and Integration Issues , 2001 .

[25]  A. Murat Tekalp,et al.  Region-Based Parametric Motion Segmentation Using Color Information , 1998, Graph. Model. Image Process..