Object Recognition in Humans and Machines

The question of how humans learn, represent and recognize objects has been one of the core questions in cognitive research. With the advent of the field of computer vision — most notably through the seminal work of David Marr — it seemed that the solution lay in a three-dimensional (3D) reconstruction of the environment (Marr 1982, see also one of the first computer vision systems built by Roberts et al. 1965). The success of this approach, however, was limited both in terms of explaining experimental results emerging from cognitive research as well as in enabling computer systems to recognize objects with a performance similar to humans.

[1]  G. Hauske,et al.  Object and scene analysis by saccadic eye-movements: an investigation with higher-order statistics. , 2000, Spatial vision.

[2]  Markus Graf Form, Space and Object: Geometrical Transformations in Object Recognition and Categorization , 2002 .

[3]  Heinrich H. Bülthoff,et al.  Acquiring Robust Representations for Recognition from Image Sequences , 2001, DAGM-Symposium.

[4]  Heinrich H. Bülthoff,et al.  Object recognition in man, monkey, and machine , 1999 .

[5]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[6]  Michel Vidal-Naquet,et al.  Visual features of intermediate complexity and their use in classification , 2002, Nature Neuroscience.

[7]  Giulio Sandini,et al.  Babybot: a biologically inspired developing robotic agent , 2000, SPIE Optics East.

[8]  G. Wallis The role of object motion in forging long-term representations of objects , 2002 .

[9]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[10]  James V. Stone,et al.  Object recognition: view-specificity and motion-specificity , 1999, Vision Research.

[11]  H. Bülthoff,et al.  Face recognition under varying poses: The role of texture and shape , 1996, Vision Research.

[12]  Cordelia Schmid,et al.  Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  James T. Tippett,et al.  OPTICAL AND ELECTRO-OPTICAL INFORMATION PROCESSING, , 1965 .

[14]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[15]  Heinrich H. Bülthoff,et al.  Psychophysical support for a 2D view interpolation theory of object recognition , 1991 .

[16]  Lawrence Sirovich,et al.  Application of the Karhunen-Loeve Procedure for the Characterization of Human Faces , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  J. Koenderink,et al.  The internal representation of solid shape with respect to vision , 1979, Biological Cybernetics.

[18]  Y. Miyashita Neuronal correlate of visual associative long-term memory in the primate temporal cortex , 1988, Nature.

[19]  H. Bülthoff,et al.  Viewpoint Dependence in Visual and Haptic Object Recognition , 2001, Psychological science.

[20]  Heinrich H. Bülthoff,et al.  Automatic acquisition of exemplar-based representations for recognition from image sequences , 2001, CVPR 2001.

[21]  D. Foster,et al.  Recognizing novel three–dimensional objects by summing signals from parts and views , 2002, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[22]  H H Bülthoff,et al.  Psychophysical support for a two-dimensional view interpolation theory of object recognition. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[23]  H. Bülthoff,et al.  Effects of temporal association on recognition memory , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Thomas Vetter,et al.  A morphable model for the synthesis of 3D faces , 1999, SIGGRAPH.

[25]  Lawrence G. Roberts,et al.  Machine Perception of Three-Dimensional Solids , 1963, Outstanding Dissertations in the Computer Sciences.