In Search of Perceptually Salient Groupings

Finding meaningful groupings of image primitives has been a long-standing problem in computer vision. This paper studies how salient groupings can be produced using established theories in the field of visual perception alone. The major contribution is a novel definition of the Gestalt principle of Prägnanz, based upon Koffka's definition that image descriptions should be both stable and simple. Our method is global in the sense that it operates over all primitives in an image at once. It works regardless of the type of image primitives and is generally independent of image properties such as intensity, color, and texture. A novel experiment is designed to quantitatively evaluate the groupings outputs by our method, which takes human disagreement into account and is generic to outputs of any grouper. We also demonstrate the value of our method in an image segmentation application and quantitatively show that segmentations deliver promising results when benchmarked using the Berkeley Segmentation Dataset (BSDS).

[1]  Steven W. Zucker,et al.  Local Scale Control for Edge Detection and Blur Estimation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Joachim M. Buhmann,et al.  A Compositionally Architecture for Perceptual Feature Grouping , 2003, EMMCVPR.

[3]  Yuhong Yang,et al.  Information Theory, Inference, and Learning Algorithms , 2005 .

[4]  David W. Jacobs,et al.  Robust and Efficient Detection of Salient Convex Groups , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Jitendra Malik,et al.  From contours to regions: An empirical evaluation , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  M. Wertheimer Laws of organization in perceptual forms. , 1938 .

[7]  A. Witkin,et al.  On the Role of Structure in Vision , 1983 .

[8]  O. Reiser,et al.  Principles Of Gestalt Psychology , 1936 .

[9]  Jitendra Malik,et al.  Learning Probabilistic Models for Contour Completion in Natural Images , 2008, International Journal of Computer Vision.

[10]  David G. Lowe,et al.  Perceptual Organization and Visual Recognition , 2012 .

[11]  J. Elder,et al.  Ecological statistics of Gestalt laws for the perceptual organization of contours. , 2002, Journal of vision.

[12]  Cordelia Schmid,et al.  Bandit Algorithms for Tree Search , 2007, UAI.

[13]  Steven W. Zucker,et al.  Radial Projection: An Efficient Update Rule for Relaxation Labeling , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Pablo Andrés Arbeláez,et al.  Finding Semantic Structures in Image Hierarchies Using Laplacian Graph Energy , 2010, ECCV.

[15]  David Sinclair,et al.  Language-based querying of image collections on the basis of an extensible ontology , 2004, Image Vis. Comput..

[16]  Andrew Zisserman,et al.  A Boundary-Fragment-Model for Object Detection , 2006, ECCV.

[17]  Antonio Robles-Kelly,et al.  Graph edit distance from spectral seriation , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Steven W. Zucker,et al.  Trace Inference, Curvature Consistency, and Curve Detection , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Jianbo Shi,et al.  Segmentation given partial grouping constraints , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Yee-Hong Yang,et al.  Multiresolution Color Image Segmentation , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Sudeep Sarkar,et al.  Supervised Learning of Large Perceptual Organization: Graph Spectral Partitioning and Learning Automata , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Jacob Feldman,et al.  Perceptual Grouping by Selection of a Logically Minimal Model , 2003, International Journal of Computer Vision.

[23]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[24]  Andrew Blake,et al.  Contour-based learning for object detection , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[25]  Andrew P. Witkin,et al.  Scale-Space Filtering , 1983, IJCAI.

[26]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[27]  Gérard G. Medioni,et al.  Inferring global perceptual contours from local features , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Robert B. Ash,et al.  Information Theory , 2020, The SAGE International Encyclopedia of Mass Media and Society.

[29]  Allan D. Jepson,et al.  Perceptual grouping for contour extraction , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[30]  Majid Mirmehdi,et al.  Perceptual primitives from an extended 4D Hough transform , 2002, Image Vis. Comput..

[31]  Jitendra Malik,et al.  A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[32]  H. Barlow Vision: A computational investigation into the human representation and processing of visual information: David Marr. San Francisco: W. H. Freeman, 1982. pp. xvi + 397 , 1983 .

[33]  Richard S. Weiss,et al.  Perceptual Grouping Of Curved Lines , 1989, Other Conferences.

[34]  U. Feige,et al.  Spectral Graph Theory , 2015 .

[35]  Frédéric Jurie,et al.  Groups of Adjacent Contour Segments for Object Detection , 2008, IEEE Trans. Pattern Anal. Mach. Intell..