Geometric feature distributions for shape representation and recognition

One of the fundamental problems in computer vision is the identification of objects from their shape. The research reported in this thesis is directed toward the development of a scheme for representing the shape of an object which allows it to be recognised both quickly and robustly across a wide range of viewing conditions. Given a shape described by a set of primitive elements, eg. straight line segments, the proposed scheme involves using a histogram to record the distribution of geometric features, eg. angle and distance, measured between pairs of primitives. This form of shape representation has a number advantages over previously proposed schemes. Foremost among these is the fact that it is able to produce local representations of shape, based on individual line segments. Recognition based on such representation is robust to the problems arising in cluttered scenes. Representations produced by the scheme are also invariant to certain object transformations, they degrade gracefully as the shape is fragmented and are strong enough to support discrimination between dissimilar objects. By treating the histogram recording a geometric feature distribution as a feature vector it is possible to match shapes using techniques from statistical pattern classification. This has the advantage that optimal matching accuracy can be achieved using processing which is both simple and uniform. The approach is therefore ideally suited to implementation in dedicated hardware. A detailed analysis is undertaken of the effect on recognition of changes in the description of a shape caused by fragmentation noise, scene clutter and sensor error. It is found that the properties of both the representation and matching components of the system combine to ensure that recognition is, in theory, unaffected by fragmentation noise, while it is maintained to very high levels of scene clutter. The factors which determine the effect of sensor error on the performance of the recognition system are fully analysed. The ability of the representational scheme to support object recognition is demonstrated in a number of different domains. The recognition of both 2D and 3D objects from a fixed viewpoint is demonstrated in conditions of severe fragmentation noise, occlusion and clutter. The scheme is then shown to extend straightforwardly to the representation of 3D shape. This is exploited to perform recognition and localisation of 3D objects from an arbitrary viewpoint, based on the matching of 3D scene and ,model shape descriptions. Finally, the use of the scheme within a multiple view-based approach to 3D object recognition is demonstrated.

[1]  John Porrill,et al.  Matching geometrical descriptions in three-space , 1987, Image Vis. Comput..

[2]  King-Sun Fu,et al.  Shape Discrimination Using Fourier Descriptors , 1977, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Steven J. Rak,et al.  Performance of a neural-network-based 3-D object recognition system , 1991, Defense, Security, and Sensing.

[4]  David Harwood,et al.  An iterative hough procedure for three-dimensional object recognition , 1984, Pattern Recognit..

[5]  D. Forsyth,et al.  Fast recognition using algebraic invariants , 1992 .

[6]  A. J. Mistlin,et al.  Visual neurones responsive to faces , 1987, Trends in Neurosciences.

[7]  Anil K. Jain,et al.  Chord distributions for shape matching , 1982, Comput. Graph. Image Process..

[8]  W. Grimson,et al.  Model-Based Recognition and Localization from Sparse Range or Tactile Data , 1984 .

[9]  Ralph Roskies,et al.  Fourier Descriptors for Plane Closed Curves , 1972, IEEE Transactions on Computers.

[10]  Larry S. Davis,et al.  Object recognition using oriented model points , 1986 .

[11]  Andrew Zisserman,et al.  Recognizing general curved objects efficiently , 1992 .

[12]  J P Frisby,et al.  PMF: A Stereo Correspondence Algorithm Using a Disparity Gradient Limit , 1985, Perception.

[13]  Harry Wechsler,et al.  2-D Invariant Object Recognition Using Distributed Associative Memory , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Andrew M. Wallace,et al.  Interpretation of 2D Scenes Using a General Relational Model , 1987, Alvey Vision Conference.

[15]  Rama Chellappa,et al.  Classification of Partial 2-D Shapes Using Fourier Descriptors , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  D. W. Murray Model-based recognition using 3D shape alone , 1987, Comput. Vis. Graph. Image Process..

[17]  Tony P. Pridmore,et al.  TINA: A 3D Vision System for Pick and Place , 1987, Alvey Vision Conference.

[18]  Keinosuke Fukunaga,et al.  Three-Dimensional Shape Analysis Using Local Shape Descriptors , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Gabriel Taubin,et al.  Estimation of Planar Curves, Surfaces, and Nonplanar Space Curves Defined by Implicit Equations with Applications to Edge and Range Image Segmentation , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Anthony P. Reeves,et al.  Three-Dimensional Shape Analysis Using Moments and Fourier Descriptors , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  A. J. Mistlin,et al.  Visual cells in the temporal cortex sensitive to face view and gaze direction , 1985, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[22]  Yehezkel Lamdan,et al.  Affine invariant model-based object recognition , 1990, IEEE Trans. Robotics Autom..

[23]  Haim J. Wolfson,et al.  Transformation invariant indexing , 1992 .

[24]  S Ullman,et al.  Low-level aspects of segmentation and recognition. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[25]  David J. Parker,et al.  Analysis of global pattern features , 1974, Pattern Recognit..

[26]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[27]  Robert M. Haralick Performance Characterization in Computer Vision , 1992, BMVC.

[28]  V. F. F. Leavers Shape Detection in Computer Vision Using the Hough Transform , 2011 .

[29]  Yehezkel Lamdan,et al.  Geometric Hashing: A General And Efficient Model-based Recognition Scheme , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[30]  David G. Lowe,et al.  Perceptual Organization and Visual Recognition , 2012 .

[31]  Gérard G. Medioni,et al.  Structural Indexing: Efficient 2D Object Recognition , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Owen Robert Mitchell,et al.  Partial Shape Recognition Using Dynamic Programming , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  Neil A. Thacker,et al.  Designing a layered network for context sensitive pattern classification , 1990, Neural Networks.

[34]  W. Eric L. Grimson,et al.  Localizing Overlapping Parts by Searching the Interpretation Tree , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[36]  John E. W. Mayhew,et al.  Hypothesis and Verification in 3D Model Matching , 1989, Alvey Vision Conference.

[37]  William Grimson,et al.  Object recognition by computer - the role of geometric constraints , 1991 .

[38]  M. Tarr,et al.  Mental rotation and orientation-dependence in shape recognition , 1989, Cognitive Psychology.

[39]  Stephen Grossberg,et al.  Competitive Learning: From Interactive Activation to Adaptive Resonance , 1987, Cogn. Sci..

[40]  Gösta H. Granlund,et al.  Fourier Preprocessing for Hand Print Character Recognition , 1972, IEEE Transactions on Computers.

[41]  P. Wintz,et al.  An efficient three-dimensional aircraft recognition algorithm using normalized fourier descriptors , 1980 .

[42]  G. Humphrey,et al.  Recognizing novel views of three-dimensional objects. , 1992, Canadian journal of psychology.

[43]  Allen M. Waxman,et al.  Adaptive 3-D Object Recognition from Multiple Views , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[44]  I. Rock,et al.  A case of viewer-centered object perception , 1987, Cognitive Psychology.

[45]  David Zipser,et al.  Feature Discovery by Competive Learning , 1986, Cogn. Sci..

[46]  Teuvo Kohonen,et al.  Self-organization and associative memory: 3rd edition , 1989 .

[47]  Clarence L. Coates,et al.  Visual Learning from Multiple Views , 1975, IEEE Transactions on Computers.

[48]  S. Marshall,et al.  Review of shape coding techniques , 1989, Image Vis. Comput..

[49]  Allen M. Waxman,et al.  Learning Aspect Graph Representations from View Sequences , 1989, NIPS.

[50]  Michael C. Seibert,et al.  Neural networks for machine vision: learning three-dimensional object representations , 1991 .

[51]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[52]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[53]  Glyn W. Humphreys,et al.  Normal and pathological processes in visual object constancy. , 1987 .

[54]  Tony P. Pridmore,et al.  Segmentation and description of binocularly viewed contours , 1987, Image Vis. Comput..

[55]  John R. Kender,et al.  What is a 'Degenerate' View? , 1987, IJCAI.

[56]  Richard Lippmann,et al.  Neural Network Classifiers Estimate Bayesian a posteriori Probabilities , 1991, Neural Computation.

[57]  Gérard G. Medioni,et al.  Structural Indexing: Efficient 3-D Object Recognition , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[58]  Werner von Seelen,et al.  Contextual feature similarities for model-based object recognition , 1993, 1993 (4th) International Conference on Computer Vision.