Information-conserving object recognition

Following the theory of statistical estimation, the problem of recognizing objects imaged in complex real-world scenes is examined from a parametric perspective. A scalar measure of an object's complexity, which is invariant under affine transformation and changes in image noise level, is extracted from the object's Fisher information. The volume of Fisher information is shown to provide an overall statistical measure of the object's recognizability in a particular image, while the complexity provides an intrinsically physical measure that characterizes the object in any image. An information-conserving method is then developed for recognizing an object imaged in a complex scene. Here the term information-conserving means that the method uses all the measured data pertinent to the object's recognizability, attains the theoretical lower bound on estimation error for any unbiased estimate, and therefore is statistically optimal. This method is then successfully applied to finding objects imaged in thousands of complex real-world scenes.

[1]  Dennis Gabor,et al.  Theory of communication , 1946 .

[2]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[3]  K. Torrance,et al.  Theory for off-specular reflection from roughened surfaces , 1967 .

[4]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[5]  W. Siebert Circuits, Signals and Systems , 1985 .

[6]  J. Goodman Statistical Optics , 1985 .

[7]  Berthold K. P. Horn Robot vision , 1986, MIT electrical engineering and computer science series.

[8]  H. Szu Fast simulated annealing , 1987 .

[9]  Gil J. Ettinger,et al.  Large hierarchical object recognition using libraries of parameterized model sub-parts , 1988, Proceedings CVPR '88: The Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  David B. Cooper,et al.  Toward a Model-Based Bayesian Theory for Estimating and Recognizing Parameterized 3-D Objects Using Two or More Images Taken from Different Positions , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Audra E. Kosh,et al.  Linear Algebra and its Applications , 1992 .

[12]  Charles A. Poynton,et al.  Gamma and Its Disguises : The Nonlinear Mappings of Intensity in Perception, CRTs, Film, and Video , 1993 .

[13]  Marco Campani,et al.  A robust method for road sign detection and recognition , 1994, ECCV.

[14]  Nasser Kehtarnavaz,et al.  Traffic sign recognition in noisy outdoor scenes , 1995, Proceedings of the Intelligent Vehicles '95. Symposium.

[15]  S K Nayar,et al.  Visual appearance of matte surfaces , 1995, Science.

[16]  Lutz Priese,et al.  Ideogram identification in a realtime traffic sign recognition system , 1995, Proceedings of the Intelligent Vehicles '95. Symposium.

[17]  N. Makris,et al.  A foundation for logarithmic measures of fluctuating intensity in pattern recognition. , 1995, Optics letters.

[18]  Dhanistha Panyasak,et al.  Circuits , 1995, Annals of the New York Academy of Sciences.

[19]  M. Aoki,et al.  Route guidance sign recognition , 1995, Proceedings of the Intelligent Vehicles '95. Symposium.

[20]  Margrit Betke,et al.  Fast object recognition in noisy images using simulated annealing , 1995, Proceedings of IEEE International Conference on Computer Vision.

[21]  Margrit Betke,et al.  Mobile robot localization using landmarks , 1997, IEEE Trans. Robotics Autom..