Figure-Ground Separation by Cue Integration

This letter presents an improved cue integration approach to reliably separate coherent moving objects from their background scene in video sequences. The proposed method uses a probabilistic framework to unify bottom-up and top-down cues in a parallel, democratic fashion. The algorithm makes use of a modified Bayes rule where each pixel's posterior probabilities of figure or ground layer assignment are derived from likelihood models of three bottom-up cues and a prior model provided by a top-down cue. Each cue is treated as independent evidence for figure-ground separation. They compete with and complement each other dynamically by adjusting relative weights from frame to frame according to cue quality measured against the overall integration. At the same time, the likelihood or prior models of individual cues adapt toward the integrated result. These mechanisms enable the system to organize under the influence of visual scene structure without manual intervention. A novel contribution here is the incorporation of a top-down cue. It improves the system's robustness and accuracy and helps handle difficult and ambiguous situations, such as abrupt lighting changes or occlusion among multiple objects. Results on various video sequences are demonstrated and discussed. (Video demos are available at http://organic.usc.edu:8376/tangx/neco/index.html.)

[1]  Jochen Triesch,et al.  Democratic Integration: Self-Organized Integration of Adaptive Cues , 2001, Neural Computation.

[2]  J. Cutting,et al.  Minimodularity and the perception of layout. , 1988, Journal of experimental psychology. General.

[3]  Jan-Olof Eklundh,et al.  Probabilistic and Voting Approaches to Cue Integration for Figure-Ground Segmentation , 2002, ECCV.

[4]  Anil K. Jain,et al.  Unsupervised texture segmentation using Gabor filters , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[5]  Robert A. Jacobs,et al.  Modeling the Combination of Motion, Stereo, and Vergence Angle Cues to Visual Depth , 1999, Neural Computation.

[6]  R. Kohler A segmentation system based on thresholding , 1981 .

[7]  B. Stein,et al.  The Merging of the Senses , 1993 .

[8]  Heiko Neumann,et al.  A neural model of the temporal dynamics of figure-ground segregation in motion perception , 2010, Neural Networks.

[9]  Mi-Suen Lee,et al.  A Computational Framework for Segmentation and Grouping , 2000 .

[10]  Mary J. Bravo,et al.  Top-down and bottom-up processes for object segmentation , 2010 .

[11]  Chris A. Glasbey,et al.  An Analysis of Histogram-Based Thresholding Algorithms , 1993, CVGIP Graph. Model. Image Process..

[12]  Chin-Hwa Lee Recursive region splitting at hierarchical scope views , 1986, Comput. Vis. Graph. Image Process..

[13]  M. Wertheimer Laws of organization in perceptual forms. , 1938 .

[14]  Jochen Triesch Self-organized integration of adaptive visual cues for face tracking , 2000, SPIE Defense + Commercial Sensing.

[15]  M. Farah,et al.  Is visual image segmentation a bottom-up or an interactive process? , 1997, Perception & psychophysics.

[16]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  James M. Hillis,et al.  Slant from texture and disparity cues: optimal cue combination. , 2004, Journal of vision.

[18]  Michael J. Black,et al.  The Robust Estimation of Multiple Motions: Parametric and Piecewise-Smooth Flow Fields , 1996, Comput. Vis. Image Underst..

[19]  Steven W. Zucker,et al.  Region growing: Childhood and adolescence* , 1976 .

[20]  Jochen Triesch,et al.  Democratic Integration: A Theory of Adaptive Sensory Integration , 2000 .

[21]  John Daugman Relaxation Neural Network For Complete Discrete 2-D Gabor Transforms , 1988, Other Conferences.

[22]  Ralph Gross,et al.  Concurrent Object Recognition and Segmentation by Graph Partitioning , 2002, NIPS.

[23]  Robin R. Murphy,et al.  Biological and cognitive foundations of intelligent sensor fusion , 1996, IEEE Trans. Syst. Man Cybern. Part A.

[24]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[25]  Gérard G. Medioni,et al.  Motion segmentation with accurate boundaries - a tensor voting approach , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[26]  R. Gregory The intelligent eye , 1970 .

[27]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  R A Kirsch,et al.  Computer determination of the constituent structure of biological images. , 1971, Computers and biomedical research, an international journal.

[29]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Xiaolin Wu,et al.  Adaptive Split-and-Merge Segmentation Based on Piecewise Least-Square Approximation , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Shimon Ullman,et al.  Combining Top-Down and Bottom-Up Segmentation , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[32]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[33]  Pascal Mamassian,et al.  Interaction of visual prior constraints , 2001, Vision Research.

[34]  Xiangyu Tang,et al.  A model for figure-ground segmentation by self-organized cue integration , 2005 .

[35]  Daniel Cremers,et al.  Diffusion Snakes: Introducing Statistical Shape Knowledge into the Mumford-Shah Functional , 2002, International Journal of Computer Vision.

[36]  Christoph von der Malsburg,et al.  Self-organized figure-ground segmentation by multiple-cue integration , 2005, SIP.

[37]  B. S. Manjunath,et al.  Color image segmentation , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).