Scene analysis by integrating primitive segmentation and associative memory

Scene analysis is a major aspect of perception and continues to challenge machine perception. This paper addresses the scene-analysis problem by integrating a primitive segmentation stage with a model of associative memory. The model is a multistage system that consists of an initial primitive segmentation stage, a multimodule associative memory, and a short-term memory (STM) layer. Primitive segmentation is performed by a locally excitatory globally inhibitory oscillator network (LEGION), which segments the input scene into multiple parts that correspond to groups of synchronous oscillations. Each segment triggers memory recall and multiple recalled patterns then interact with one another in the STM layer. The STM layer projects to the LEGION network, giving rise to memory-based grouping and segmentation. The system achieves scene analysis entirely in phase space, which provides a unifying mechanism for both bottom-up analysis and top-down analysis. The model is evaluated with a systematic set of three-dimensional (3-D) line drawing objects, which are arranged in an arbitrary fashion to compose input scenes that allow object occlusion. Memory-based organization is responsible for a significant improvement in performance. A number of issues are discussed, including input-anchored alignment, top-down organization, and the role of STM in producing context sensitivity of memory recall.

[1]  Stephen Grossberg,et al.  A neural network architecture for figure-ground separation of connected scenic figures , 1991, Neural Networks.

[2]  H. Pashler The Psychology of Attention , 1997 .

[3]  D C Van Essen,et al.  Shifter circuits: a computational strategy for dynamic aspects of visual processing. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Carlos Lourenço,et al.  Pattern segmentation in a binary/analog world: unsupervised learning versus memory storing , 2000, Neural Networks.

[5]  Christoph von der Malsburg,et al.  The Correlation Theory of Brain Function , 1994 .

[6]  Joachim M. Buhmann,et al.  Pattern Segmentation in Associative Memory , 1990, Neural Computation.

[7]  Milan Sonka,et al.  Image Processing, Analysis and Machine Vision , 1993, Springer US.

[8]  J. Mattingley,et al.  Preattentive Filling-in of Visual Surfaces in Parietal Extinction , 1997, Science.

[9]  W Singer,et al.  Visual feature integration and the temporal correlation hypothesis. , 1995, Annual review of neuroscience.

[10]  R. Parasuraman The attentive brain in aging and Alzheimer's disease. , 1998 .

[11]  Ch. von der Malsburg,et al.  A neural cocktail-party processor , 1986, Biological Cybernetics.

[12]  Matthias M. Müller,et al.  Human Gamma Band Activity and Perception of a Gestalt , 1999, The Journal of Neuroscience.

[13]  M. Just,et al.  From the SelectedWorks of Marcel Adam Just 1992 A capacity theory of comprehension : Individual differences in working memory , 2017 .

[14]  P. S. Lindsey,et al.  Fast numerical integration of relaxation oscillator networks based on singular limit solutions , 1996 .

[15]  DeLiang Wang,et al.  Locally excitatory globally inhibitory oscillator networks , 1995, IEEE Transactions on Neural Networks.

[16]  L. Finkel,et al.  Extraction of perceptually salient contours by striate cortical networks , 1998, Vision Research.

[17]  Yoshua Bengio,et al.  Pattern Recognition and Neural Networks , 1995 .

[18]  DeLiang Wang,et al.  Relaxation Oscillators and Networks , 1999 .

[19]  John J. Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities , 1999 .

[20]  Shinsuke Shimojo,et al.  Visual surface representation: a critical link between lower-level and higher level vision , 1995 .

[21]  Michael D. Byrne,et al.  ACT-R/PM and menu selection: applying a cognitive architecture to HCI , 2001, Int. J. Hum. Comput. Stud..

[22]  DeLiang Wang,et al.  Pattern recognition: neural networks in perspective , 1993, IEEE Expert.

[23]  L. Stark,et al.  Dissertation Abstract , 1994, Journal of Cognitive Education and Psychology.

[24]  R Blake,et al.  Visual form created solely from temporal structure. , 1999, Science.

[25]  Brian D. Ripley,et al.  Pattern Recognition and Neural Networks , 1996 .

[26]  M. Usher,et al.  Parallel Activation of Memories in an Oscillatory Neural Network , 1991, Neural Computation.

[27]  C. L. M. The Psychology of Attention , 1890, Nature.

[28]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[29]  Tony Lindeberg,et al.  Feature Detection with Automatic Scale Selection , 1998, International Journal of Computer Vision.

[30]  Qing Ma Adaptive associative memories capable of pattern segmentation , 1996, IEEE Trans. Neural Networks.

[31]  Haim Sompolinsky,et al.  Segmentation by a Network of Oscillators with Stored Memories , 1994, Neural Computation.

[32]  Milan Sonka,et al.  Image processing analysis and machine vision [2nd ed.] , 1999 .

[33]  M. Just,et al.  Computational modeling of high‐level cognition and brain function , 1999, Human brain mapping.

[34]  J. Wolfe,et al.  Guided Search 2.0 A revised model of visual search , 1994, Psychonomic bulletin & review.

[35]  Laxmi Parida,et al.  Junctions: Detection, Classification, and Reconstruction , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  DeLiang Wang,et al.  Image Segmentation Based on Oscillatory Correlation , 1997, Neural Computation.

[37]  Deliang Wang,et al.  Global competition and local cooperation in a network of neural oscillators , 1995 .

[38]  D E Kieras,et al.  A computational theory of executive cognitive processes and multiple-task performance: Part 1. Basic mechanisms. , 1997, Psychological review.

[39]  Marius Usher,et al.  Visual synchrony affects binding and segmentation in perception , 1998, Nature.

[40]  H. Müller,et al.  Synchronous Information Presented in 40-HZ Flicker Enhances Visual Feature Binding , 1998 .

[41]  DeLiang Wang,et al.  Object selection based on oscillatory correlation , 1999, Neural Networks.

[42]  DeLiang Wang,et al.  Boundary detection by contextual non-linear smoothing , 2000, Pattern Recognit..

[43]  DeLiang Wang,et al.  Segmentation of medical images using LEGION , 1999, IEEE Transactions on Medical Imaging.

[44]  M. Livingstone Oscillatory firing and interneuronal correlations in squirrel monkey striate cortex. , 1996, Journal of neurophysiology.