Recognition and pose estimation of unoccluded three-dimensional objects from a two-dimensional perspective view by banks of neural networks

This paper describes a neural network (NN) based system for recognition and pose estimation of an unoccluded three-dimensional (3-D) object from any single two-dimensional (2-D) perspective view. The approach is invariant to translation, orientation, and scale. First, the binary silhouette of the object is obtained and normalized for translation and scale. Then, the object is represented by a set of rotation invariant features derived from the complex orthogonal pseudo-Zernike moments of the image. The recognition scheme combines the decisions of a bank of multilayer perceptron NN classifiers operating in parallel on the same data. These classifiers have different topologies and internal parameters, but are trained on the same set of exemplar perspective views of the objects. Next, two pose parameters, elevation and aspect angles, are obtained by a novel two-stage NN system consisting of a quadrant classifier followed by NN angle estimators. Performance is tested on clean and noisy data bases of military ground vehicles. Comparative studies with three other classifiers (a single NN, the weighted nearest-neighbor classifier, and a binary decision tree) are carried out.

[1]  P. Wintz,et al.  An efficient three-dimensional aircraft recognition algorithm using normalized fourier descriptors , 1980 .

[2]  M. Teague Image analysis via the general theory of moments , 1980 .

[3]  Sung-Bae Cho,et al.  A two-stage classification scheme with backpropagation neural network classifiers , 1992, Pattern Recognit. Lett..

[4]  Sei-Wang Chen,et al.  Strategies of multi-view and multi-matching for 3D object recognition , 1993 .

[5]  Alireza Khotanzad,et al.  Rotation invariant image recognition using features selected via a systematic method , 1990, Pattern Recognit..

[6]  Anthony P. Reeves,et al.  Three-Dimensional Shape Analysis Using Moments and Fourier Descriptors , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Adam Krzyżak,et al.  Methods of combining multiple classifiers and their applications to handwriting recognition , 1992, IEEE Trans. Syst. Man Cybern..

[8]  Alireza Khotanzad,et al.  Classification of invariant image representations using a neural network , 1990, IEEE Trans. Acoust. Speech Signal Process..

[9]  Alireza Khotanzad,et al.  Invariant Image Recognition by Zernike Moments , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Paulo J. G. Lisboa,et al.  Translation, rotation, and scale invariant pattern recognition by high-order neural networks and moment classifiers , 1992, IEEE Trans. Neural Networks.

[11]  Roland T. Chin,et al.  On Image Analysis by the Methods of Moments , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Robert B. McGhee,et al.  Aircraft Identification by Moment Invariants , 1977, IEEE Transactions on Computers.

[13]  Anil K. Jain,et al.  Chord distributions for shape matching , 1982, Comput. Graph. Image Process..

[14]  D. Stoller Univariate Two-Population Distribution-Free Discrimination , 1954 .

[15]  Lars Kai Hansen,et al.  Neural Network Ensembles , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Sargur N. Srihari,et al.  Decision Combination in Multiple Classifier Systems , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Geoffrey E. Hinton,et al.  Adaptive Mixtures of Local Experts , 1991, Neural Computation.