View-based dynamic object recognition based on human perception

Psychophysical studies have shown that humans actively exploit temporal information such as contiguity of images in object recognition. We have recently developed a recognition system which uses temporal contiguity to learn extensible representations of objects on-line. The system performs well both on real-world and synthetic data and shows robustness under illumination changes. In this paper, we present results which compare the proposed representation against simple image-based representations of the same complexity using Minkowski minimum distance classifiers and support vector machine classifiers. Recognition results for all classifiers show large improvements with incorporated temporal information.

[1]  L. Kaufman,et al.  Spontaneous fixation tendencies for visual forms , 1969 .

[2]  Stephen K. Reed,et al.  Pattern recognition and categorization , 1972 .

[3]  D. Marr,et al.  Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[4]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[5]  Y. Miyashita Neuronal correlate of visual associative long-term memory in the primate temporal cortex , 1988, Nature.

[6]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[7]  Lawrence Sirovich,et al.  Application of the Karhunen-Loeve Procedure for the Characterization of Human Faces , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  H. C. Longuet-Higgins,et al.  An algorithm for associating the features of two images , 1991, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[9]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Heinrich H. Bülthoff,et al.  Psychophysical support for a 2D view interpolation theory of object recognition , 1991 .

[11]  H H Bülthoff,et al.  Psychophysical support for a two-dimensional view interpolation theory of object recognition. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[12]  O. Faugeras Three-dimensional computer vision: a geometric viewpoint , 1993 .

[13]  Keiji Tanaka,et al.  Neuronal selectivities to complex object features in the ventral visual pathway of the macaque cerebral cortex. , 1994, Journal of neurophysiology.

[14]  Vladimir Vapnik,et al.  The Nature of Statistical Learning , 1995 .

[15]  Paul A. Beardsley,et al.  3D Model Acquisition from Extended Image Sequences , 1996, ECCV.

[16]  Bartlett W. Mel SEEMORE: Combining Color, Shape, and Texture Histogramming in a Neurally Inspired Approach to Visual Object Recognition , 1997, Neural Computation.

[17]  Cordelia Schmid,et al.  Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Maurizio Pilu,et al.  A direct method for stereo correspondence based on singular value decomposition , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[20]  Koen Lamberts,et al.  Knowledge, Concepts, and Categories , 1997 .

[21]  Steffen Schmalz,et al.  Combining Multiple Views and Temporal Associations for 3-D object Recognition , 1998, ECCV.

[22]  James V. Stone,et al.  Object recognition: view-specificity and motion-specificity , 1999, Vision Research.

[23]  Heinrich H. Bülthoff,et al.  Object recognition in man, monkey, and machine , 1999 .

[24]  Matthew Turk,et al.  A Morphable Model For The Synthesis Of 3D Faces , 1999, SIGGRAPH.

[25]  B. Schölkopf,et al.  Advances in kernel methods: support vector learning , 1999 .

[26]  Thomas Vetter,et al.  A morphable model for the synthesis of 3D faces , 1999, SIGGRAPH.

[27]  Pietro Perona,et al.  Unsupervised Learning of Models for Recognition , 2000, ECCV.

[28]  Heiko Hecht,et al.  Mental rota-tion of facial components and configurations , 2000 .

[29]  G. Hauske,et al.  Object and scene analysis by saccadic eye-movements: an investigation with higher-order statistics. , 2000, Spatial vision.

[30]  David G. Lowe,et al.  Towards a Computational Model for Object Recognition in IT Cortex , 2000, Biologically Motivated Computer Vision.

[31]  Alex M. Andrew,et al.  Object Recognition in Man, Monkey, and Machine , 2000 .

[32]  Rahul Sukthankar,et al.  Memory-based face recognition for visitor identification , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[33]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[34]  Shimon Ullman,et al.  Object Classification Using a Fragment-Based Representation , 2000, Biologically Motivated Computer Vision.

[35]  U. Hahn,et al.  Similarity and categorization , 2001 .

[36]  Heinrich H. Bülthoff,et al.  View-based recognition under illumination changes using local features , 2001, CVPR 2001.

[37]  H. Bülthoff,et al.  Effects of temporal association on recognition memory , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Heinrich H. Bülthoff,et al.  Automatic acquisition of exemplar-based representations for recognition from image sequences , 2001, CVPR 2001.

[39]  Silvio Borer,et al.  Normalization in Support Vector Machines , 2001, DAGM-Symposium.

[40]  Cordelia Schmid,et al.  Evaluation of Interest Point Detectors , 2000, International Journal of Computer Vision.

[41]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[42]  B. Ripley,et al.  Robust Statistics , 2018, Encyclopedia of Mathematical Geosciences.

[43]  Bernt Schiele,et al.  Recognition without Correspondence using Multidimensional Receptive Field Histograms , 2004, International Journal of Computer Vision.