论文信息 - Figure/Ground Assignment in Natural Images

Figure/Ground Assignment in Natural Images

Figure/ground assignment is a key step in perceptual organization which assigns contours to one of the two abutting regions, providing information about occlusion and allowing high-level processing to focus on non-accidental shapes of figural regions. In this paper, we develop a computational model for figure/ground assignment in complex natural scenes. We utilize a large dataset of images annotated with human-marked segmentations and figure/ground labels for training and quantitative evaluation. We operationalize the concept of familiar configuration by constructing prototypical local shapes, i.e. shapemes, from image data. Shapemes automatically encode mid-level visual cues to figure/ground assignment such as convexity and parallelism. Based on the shapeme representation, we train a logistic classifier to locally predict figure/ground labels. We also consider a global model using a conditional random field (CRF) to enforce global figure/ground consistency at T-junctions. We use loopy belief propagation to perform approximate inference on this model and learn maximum likelihood parameters from ground-truth labels. We find that the local shapeme model achieves an accuracy of 64% in predicting the correct figural assignment. This compares favorably to previous studies using classical figure/ground cues [1]. We evaluate the global model using either a set of contours extracted from a low-level edge detector or the set of contours given by human segmentations. The global CRF model significantly improves the performance over the local model, most notably when using human-marked boundaries (78%). These promising experimental results show that this is a feasible approach to bottom-up figure/ground assignment in natural images.

Jitendra Malik | Charless C. Fowlkes | Xiaofeng Ren | Xiaofeng Ren | Jitendra Malik

[1] Edgar Rubin. Visuell wahrgenommene Figuren : Studien in psychologischer Analyse , 1921 .

[2] Geoffrey E. Hinton,et al. Separating Figure from Ground with a Parallel Network , 1986, Perception.

[3] Rüdiger von der Heydt,et al. A computational model of neural contour processing: Figure-ground segregation and illusory contours , 1993, 1993 (4th) International Conference on Computer Vision.

[4] B. Gibson,et al. Must Figure-Ground Organization Precede Object Recognition? An Assumption in Peril , 1994 .

[5] Victor A. F. Lamme. The neurophysiology of figure-ground segregation in primary visual cortex , 1995, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[6] Laxmi Parida,et al. Visual organization for figure/ground separation , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7] Eric Saund. Perceptual Organization of Occluding Contours of Opaque Surfaces , 1999, Comput. Vis. Image Underst..

[8] Nava Rubin,et al. Measuring convexity for figure/ground separation , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[9] R. von der Heydt,et al. Coding of Border Ownership in Monkey Visual Cortex , 2000, The Journal of Neuroscience.

[10] H. Barlow. Vision Science: Photons to Phenomenology by Stephen E. Palmer , 2000, Trends in Cognitive Sciences.

[11] Jitendra Malik,et al. Shape contexts enable efficient retrieval of similar shapes , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[12] Jitendra Malik,et al. Geometric blur for template matching , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[13] Takeo Kanade,et al. A Hierarchical Markov Random Field Model for Figure-Ground Segregation , 2001, EMMCVPR.

[14] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[15] Jitendra Malik,et al. Learning to Detect Natural Image Boundaries Using Brightness and Texture , 2002, NIPS.

[16] Martial Hebert,et al. Discriminative random fields: a discriminative framework for contextual interaction in classification , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[17] R. Zemel,et al. Multiscale conditional random fields for image labeling , 2004, CVPR 2004.

[18] Miguel Á. Carreira-Perpiñán,et al. Multiscale conditional random fields for image labeling , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[19] Josh H. McDermott,et al. Psychophysics with junctions in real images. , 2010, Perception.

[20] Jitendra Malik,et al. Recovering human body configurations: combining segmentation and recognition , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[21] Jitendra Malik,et al. Scale-invariant contour completion using conditional random fields , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[22] Jitendra Malik,et al. On Measuring* the Ecological Validity of Local Figure-Ground Cues , 2005 .