Robust classification of arbitrary object classes based on hierarchical spatial feature-matching

Abstract. We present a novel approach to the robust classification of arbitrary object classes in complex, natural scenes. Starting from a re-appraisal of Marr's ‘primal sketch’, we develop an algorithm that (1) employs local orientations as the fundamental picture primitives, rather than the more usual edge locations, (2) retains and exploits the local spatial arrangement of features of different complexity in an image and (3) is hierarchically arranged so that the level of feature abstraction increases at each processing stage. The resulting, simple technique is based on the accumulation of evidence in binary channels, followed by a weighted, non-linear sum of the evidence accumulators. The steps involved in designing a template for recognizing a simple object are explained. The practical application of the algorithm is illustrated, with examples taken from a broad range of object classification problems. We discuss the performance of the algorithm and describe a hardware implementation. First successful attempts to train the algorithm, automatically, are presented. Finally, we compare our algorithm with other object classification algorithms described in the literature.

[1]  Michael Brady,et al.  The Curvature Primal Sketch , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Ellen C. Hildreth,et al.  The detection of intensity changes by computer and biological vision systems , 1983, Comput. Vis. Graph. Image Process..

[3]  Peter Seitz,et al.  Minimum class entropy: A maximum information approach to layered networks , 1989, Neural Networks.

[4]  G. Turin,et al.  An introduction to matched filters , 1960, IRE Trans. Inf. Theory.

[5]  Theodosios Pavlidis An asynchronous thinning algorithm , 1982, Comput. Graph. Image Process..

[6]  Alex Pentland,et al.  Automatic extraction of deformable part models , 1990, International Journal of Computer Vision.

[7]  Richard A. Volz,et al.  Recognizing Partially Occluded Parts , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Peter J. Burt,et al.  Smart sensing within a pyramid vision machine , 1988, Proc. IEEE.

[9]  J G Daugman,et al.  Pattern and motion vision without Laplacian zero crossings. , 1988, Journal of the Optical Society of America. A, Optics and image science.

[10]  Peter Seitz,et al.  Fast Contour Detection Algorithm for High Precision Quantitative CT , 1983, IEEE Transactions on Medical Imaging.

[11]  Dana H. Ballard,et al.  Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..

[12]  D. Hubel Eye, brain, and vision , 1988 .

[13]  Peter Seitz,et al.  The Robust Recognition of Traffic Signs from a Moving Car , 1991, DAGM-Symposium.

[14]  R.P. Lippmann,et al.  Pattern classification using neural networks , 1989, IEEE Communications Magazine.

[15]  Peter Seitz,et al.  Using Local Orientational Information As Image Primitive For Robust Object Recognition , 1989, Other Conferences.

[16]  Axel Korn,et al.  Toward a Symbolic Representation of Intensity Changes in Images , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Berthold K. P. Horn Robot vision , 1986, MIT electrical engineering and computer science series.

[18]  Roland T. Chin,et al.  On the Detection of Dominant Points on Digital Curves , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  A. Yuille Deformable Templates for Face Recognition , 1991, Journal of Cognitive Neuroscience.

[20]  Edward H. Adelson,et al.  PYRAMID METHODS IN IMAGE PROCESSING. , 1984 .

[21]  W. Eric L. Grimson,et al.  On the recognition of curved objects , 1988, Proceedings. 1988 IEEE International Conference on Robotics and Automation.

[22]  M. Bichsel Strategies of robust object recognition for the automatic identification of human faces , 1991 .

[23]  Rangasami L. Kashyap,et al.  Using Polygons to Recognize and Locate Partially Occluded Objects , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Olaf Kübler,et al.  Simulation of neural contour mechanisms: from simple to end-stopped cells , 1992, Vision Research.

[25]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  T. Pavlidis Algorithms for Graphics and Image Processing , 1981, Springer Berlin Heidelberg.

[27]  David Marr,et al.  Visual information processing: artificial intelligence and the sensorium of sight , 1987 .

[28]  Gary E. Ford,et al.  The Position-Orientation Masking Approach to Parametric Search for Template Matching , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  D Marr,et al.  Early processing of visual information. , 1976, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[30]  Martin A. Fischler,et al.  Locating Perceptually Salient Points on Planar Curves , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Tomaso Poggio,et al.  Learning to see , 1996 .

[32]  L. Zadeh,et al.  Fuzzy Logic for the Management of Uncertainty , 1992 .

[33]  R. Lippmann,et al.  An introduction to computing with neural nets , 1987, IEEE ASSP Magazine.

[34]  J. V. Vleck,et al.  The spectrum of clipped noise , 1966 .

[35]  Thomas O. Binford,et al.  Ignorance, myopia, and naiveté in computer vision systems , 1991, CVGIP Image Underst..

[36]  Peter Seitz,et al.  The robust recognition of object primitives using local axes of symmetry , 1989 .