Pyramid Vision Using Key Features to Integrate Image-Driven Bottom-Up and Model-Driven Top-Down Processes

Pyramidlike parallel hierarchical structures have been shown to be suitable for many computer vision tasks and have the potential for achieving the speeds needed for the real-time processing of real-world images. Algorithms are being developed to explore the pyramid's massively parallel and shallowly serial-hierarchical computing ability in an integrated system that combines both low-level and higher level vision tasks. Micromodular transforms are used to embody the program's knowledge of the different objects it must recognize. Pyramid vision programs are described that, starting with the image, use transforms that assess key features to dynamically imply other feature-detecting and characterizing transforms and additional top-down model-driven processes to apply. Program performance is presented for four real-world images of buildings. The use of key features in pyramid vision programs and the related search and control issues are discussed. To expedite the detection of various key features, feature-adaptable windows are developed. In addition to image-driven bottom-up and model-driven top-down processing, lateral search is used and is shown to be helpful, efficient, and feasible. The results indicate that with the use of key features and the combination of a variety of powerful search patterns, the pyramidlike structure is effective and efficient for supporting parallel and hierarchical object recognition algorithms.

[1]  佐藤 孝紀,et al.  A Hierarchical Data Structure for Picture Processing , 1976 .

[2]  Azriel Rosenfeld,et al.  Multiresolution image processing and analysis , 1984 .

[3]  Russ Miller,et al.  Pyramid computer algorithms for determining geometric properties of images , 1985, SCG '85.

[4]  Kenneth E. Batcher,et al.  Design of a Massively Parallel Processor , 1980, IEEE Transactions on Computers.

[5]  R. Bolles,et al.  Recognizing and Locating Partially Visible Objects: The Local-Feature-Focus Method , 1982 .

[6]  Leonard Uhr,et al.  Layered "Recognition Cone" Networks That Preprocess, Classify, and Describe , 1972, IEEE Transactions on Computers.

[7]  Leonard Uhr,et al.  The Several Steps from Icon to Symbol Using Structured Cone/Pyramids , 1984 .

[8]  Robert C. Bolles,et al.  Locating Partially Visible Objects: The Local Feature Focus Method , 1980, AAAI.

[9]  Daniel Sabbah,et al.  Design Of A Highly Parallel Visual Recognition System , 1981, IJCAI.

[10]  Kendall Preston,et al.  Evaluation of Multicomputers for Image Processing , 1987 .

[11]  横矢 直和,et al.  Quadtrees and Pyramids for Pattern Recognition and Image Processing , 1981 .

[12]  Charles R. Dyer,et al.  Two-dimensional object recognition using multiresolution models , 1986, Comput. Vis. Graph. Image Process..

[13]  Leonard Uhr,et al.  A parallel-serial recognition cone system for perception: some test results , 1979, Pattern Recognit..

[14]  Ze-Nian Li,et al.  Evidential Reasoning in Parallel Hierarchical Vision Programs , 1986, UAI 1986.

[15]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[16]  Leonard Uhr,et al.  A pyramidal approach for the recognition of neurons using key features , 1986, Pattern Recognit..

[17]  John H. R. Maunsell,et al.  Hierarchical organization and functional streams in the visual cortex , 1983, Trends in Neurosciences.

[18]  D. Marr,et al.  Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[19]  Leonard M. Uhr Algorithm-structured computer arrays and networks , 1984 .

[20]  D. Hubel,et al.  Anatomy and physiology of a color system in the primate visual cortex , 1984, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[21]  Azriel Rosenfeld,et al.  Segmentation and Estimation of Image Region Properties through Cooperative Hierarchial Computation , 1981, IEEE Transactions on Systems, Man, and Cybernetics.

[22]  John D. Lowrance,et al.  Reasoning About Control: An Evidential Approach , 1984 .

[23]  Ze-Nian Li Pyramid vision: using key features and evidential reasoning , 1986 .

[24]  M. Mesulam,et al.  The functional anatomy and hemispheric specialization for directed attention The role of the parietal lobe and its connectivity , 1983, Trends in Neurosciences.