Computer Vision: Past and Future

“What does it mean to see? The plain man’s answer (and Aristotle’s too) would be to know what is where by looking.” These introductory words in the seminal book of David Marr [54] capture the essence of what researchers in computer vision have been trying to make computers do for almost half a century. In this paper we will outline the development of the field, emphasising the last ten years, and the discuss what the challenges in the field are.

[1]  Ulf Grenander,et al.  A Unified Approach to Pattern Analysis , 1970, Adv. Comput..

[2]  Lawrence G. Roberts,et al.  Machine Perception of Three-Dimensional Solids , 1963, Outstanding Dissertations in the Computer Sciences.

[3]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[4]  I Biederman,et al.  To what extent can matching algorithms based on direct outputs of spatial filters account for human object recognition? , 1996, Spatial vision.

[5]  Emanuele Trucco,et al.  Introductory techniques for 3-D computer vision , 1998 .

[6]  Timothy F. Cootes,et al.  A mixture model for representing shape variation , 1999, Image Vis. Comput..

[7]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[8]  Ernst D. Dickmanns,et al.  Vehicles Capable of Dynamic Vision: A New Breed of Technical Beings? , 1998, Artif. Intell..

[9]  Andrew P. Witkin,et al.  Scale-Space Filtering , 1983, IJCAI.

[10]  Olivier D. Faugeras,et al.  Variational principles, surface evolution, PDEs, level set methods, and the stereo problem , 1998, IEEE Trans. Image Process..

[11]  S. Zeki A vision of the brain , 1993 .

[12]  T. Poggio A theory of how the brain might work. , 1990, Cold Spring Harbor symposia on quantitative biology.

[13]  Martha J. Farah [Visual agnosia]. , 1971, Shinkei kenkyu no shimpo. Advances in neurological sciences.

[14]  Jitendra Malik,et al.  Determining Three-Dimensional Shape from Orientation and Spatial Frequency Disparities , 1991, ECCV.

[15]  Yehezkel Lamdan,et al.  Object recognition by affine invariant matching , 2011, Proceedings CVPR '88: The Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Kokichi Sugihara,et al.  An Algebraic Approach to Shape-from-Image Problems , 1984, Artif. Intell..

[17]  S. Grossberg,et al.  Neural networks for vision and image processing , 1992 .

[18]  Peter Johansen,et al.  Gaussian Scale-Space Theory , 1997, Computational Imaging and Vision.

[19]  Takeo Watanabe,et al.  Neural networks for vision and image processing , 1993 .

[20]  Claus B. Madsen,et al.  A Viewpoint Planning Strategy for Determining True Angles on Polyhedral Objects by Camera Alignment , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Thomas S. Huang,et al.  Image processing , 1971 .

[22]  Gunnar Sparr,et al.  Perspective Area-Invariants , 1985 .

[23]  S. Mallat A wavelet tour of signal processing , 1998 .

[24]  D Marr,et al.  Early processing of visual information. , 1976, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[25]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[26]  Andrea J. van Doorn,et al.  Invariant Properties of the Motion Parallax Field due to the Movement of Rigid Bodies Relative to an Observer , 1975 .

[27]  Tony Lindeberg,et al.  Scale-Space Theory in Computer Vision , 1993, Lecture Notes in Computer Science.

[28]  Joachim M. Buhmann,et al.  Distortion Invariant Object Recognition in the Dynamic Link Architecture , 1993, IEEE Trans. Computers.

[29]  R. Hetherington The Perception of the Visual World , 1952 .

[30]  Michael Arbib,et al.  From Vision to Action via Distributed Computation , 1997 .

[31]  Robert L. Lillestrand,et al.  Techniques ror Change Detection , 1972, IEEE Transactions on Computers.

[32]  S Edelman,et al.  A model of visual recognition and categorization. , 1997, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[33]  James J. Clark,et al.  Modal Control Of An Attentive Vision System , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[34]  Dana H. Ballard,et al.  Animate Vision , 1991, Artif. Intell..

[35]  J J Koenderink,et al.  Affine structure from motion. , 1991, Journal of the Optical Society of America. A, Optics and image science.

[36]  Y. Bar-Shalom Tracking and data association , 1988 .

[37]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[38]  Michael Isard,et al.  Contour Tracking by Stochastic Propagation of Conditional Density , 1996, ECCV.

[39]  Thomas S. Huang,et al.  Estimating three-dimensional motion parameters of a rigid planar patch , 1981 .

[40]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[41]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[42]  S. Nayar,et al.  Early Visual Learning , 1996 .

[43]  Jan-Olof Eklundh,et al.  Integrating primary ocular processes , 1992, Image Vis. Comput..

[44]  Thomas S. Huang,et al.  Correction to "Estimating 3-D motion parameters of a rigid planar patch,II: Singular value decomposition" , 1983 .

[45]  Allen M. Waxman,et al.  Contour Evolution, Neighborhood Deformation, and Global Image Flow: Planar Surfaces in Motion , 1985 .

[46]  Jitendra Malik,et al.  Scale-Space and Edge Detection Using Anisotropic Diffusion , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[47]  Yiannis Aloimonos,et al.  Active vision , 2004, International Journal of Computer Vision.

[48]  Michael Isard,et al.  Active Contours , 2000, Springer London.

[49]  O. Faugeras Three-dimensional computer vision: a geometric viewpoint , 1993 .

[50]  Hans-Hellmut Nagel,et al.  Image sequence evaluation: 30 years and still going strong , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[51]  Richard I. Hartley,et al.  Estimation of Relative Camera Positions for Uncalibrated Cameras , 1992, ECCV.

[52]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[53]  Dana H. Ballard,et al.  Eye Fixation And Early Vision: Kinetic Depth , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[54]  Peter J. Burt,et al.  Smart sensing within a pyramid vision machine , 1988, Proc. IEEE.

[55]  Olivier D. Faugeras,et al.  What can be seen in three dimensions with an uncalibrated stereo rig , 1992, ECCV.

[56]  Nikola Kasabov,et al.  Brain-like Computing and Intelligent Information Systems , 1998 .

[57]  Berthold K. P. Horn Understanding Image Intensities , 1977, Artif. Intell..

[58]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[59]  Refractor Vision , 2000, The Lancet.

[60]  Shimon Edelman,et al.  Representation and recognition in vision , 1999 .

[61]  James L. Crowley,et al.  Vision as Process , 1995 .

[62]  Béla Julesz,et al.  Visual Pattern Discrimination , 1962, IRE Trans. Inf. Theory.

[63]  N. Gordon,et al.  Novel approach to nonlinear/non-Gaussian Bayesian state estimation , 1993 .

[64]  D. Gabor INFORMATION THEORY IN ELECTRON MICROSCOPY. , 1965, Laboratory investigation; a journal of technical methods and pathology.

[65]  Ulf Grenander,et al.  Hands: A Pattern Theoretic Study of Biological Shapes , 1990 .

[66]  R. Cipolla,et al.  Motion planning using image divergence and deformation , 1993 .

[67]  J. Sethian Level set methods : evolving interfaces in geometry, fluid mechanics, computer vision, and materials science , 1996 .

[68]  Azriel Rosenfeld,et al.  Scene Labeling by Relaxation Operations , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[69]  H. Wilson Psychophysical Evidence for Spatial Channels , 1983 .

[70]  Stuart C. Shapiro,et al.  Encyclopedia of artificial intelligence, vols. 1 and 2 (2nd ed.) , 1992 .

[71]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[72]  Thomas O. Binford,et al.  Inferring Surfaces from Images , 1981, Artif. Intell..