Binocular Shape Reconstruction: Psychological Plausibility of the 8-Point Algorithm

A similarity structure (shape) of a 3-D object can be reconstructed from two perspective views of the object obtained by a calibrated camera. One of the best-known methods for performing such a reconstruction is the 8-point algorithm. To evaluate the psychological plausibility of this algorithm, we compared its performance to that of human subjects in a task of binocular shape reconstruction. Results show that the reconstructions produced by the 8-point algorithm are substantially less accurate than those produced by human subjects. We then modified the 8-point algorithm by incorporating natural constraints that are present in human binocular vision. Specifically, the new algorithm simulates a fixating system with no torsion. The performance of this algorithm is more robust in the presence of noise compared to that of the 8-point algorithm and is similar to the performance of human subjects.

[1]  R. Hartley Triangulation, Computer Vision and Image Understanding , 1997 .

[2]  Tony Lindeberg,et al.  Direct Estimation of Local Surface Shape in a Fixating Binocular Vision System , 1994, ECCV.

[3]  Olivier D. Faugeras,et al.  What can be seen in three dimensions with an uncalibrated stereo rig , 1992, ECCV.

[4]  Shahriar Negahdaripour,et al.  Multiple Interpretations of the Shape and Motion of Objects from Two Perspective Images , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Du Q. Huynh,et al.  Direct Methods for Self-Calibration of a Moving Stereo Head , 1996, ECCV.

[6]  M. Ali Taalebinezhaad Direct Recovery of Motion and Shape in the General Case by Fixation , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  J J Koenderink,et al.  Affine structure from motion. , 1991, Journal of the Optical Society of America. A, Optics and image science.

[8]  Michael Brady,et al.  Fast Computation of the Fundamental Matrix for an Active Stereo Vision System , 1996, ECCV.

[9]  R. Watt Scanning from coarse to fine spatial scales in the human visual system after the onset of a stimulus. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[10]  Daniel Raviv,et al.  A Unified Approach to Camera Fixation and Vision-Based Road Following , 1994, IEEE Trans. Syst. Man Cybern. Syst..

[11]  E. Johnston Systematic distortions of shape from stereopsis , 1991, Vision Research.

[12]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[13]  D R Proffitt,et al.  Comparing depth from motion with depth from binocular disparity. , 1995, Journal of experimental psychology. Human perception and performance.

[14]  Arun N. Netravali,et al.  Experience in using homotopy methods to solve motion estimation problems , 1990, Other Conferences.

[15]  Michael S. Landy,et al.  Psychophysical estimation of the human depth combination rule , 1991, Other Conferences.

[16]  James J. Clark,et al.  Data Fusion for Sensory Information Processing Systems , 1990 .

[17]  J. Porrill,et al.  Stereopsis, vertical disparity and relief transformations , 1995, Vision Research.

[18]  Michael S. Landy,et al.  A Statistical Framework for Robust Fusion of Depth Information , 1989, Other Conferences.

[19]  S. McKee,et al.  The imprecision of stereopsis , 1990, Vision Research.

[20]  Olivier D. Faugeras,et al.  A comparison of projective reconstruction methods for pairs of views , 1995, Proceedings of IEEE International Conference on Computer Vision.

[21]  Pietro Perona,et al.  Motion from fixation , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22]  C. Wheatstone XVIII. Contributions to the physiology of vision. —Part the first. On some remarkable, and hitherto unobserved, phenomena of binocular vision , 1962, Philosophical Transactions of the Royal Society of London.

[23]  Azriel Rosenfeld,et al.  Recognition of planar shapes from perspective images using contour-based invariants , 1992, CVGIP Image Underst..

[24]  J. Aloimonos,et al.  Optimal motion estimation , 1989, [1989] Proceedings. Workshop on Visual Motion.

[25]  A. GLENNERSTER,et al.  Stereoscopic Depth Constancy Depends on the Subject's Task , 1996, Vision Research.

[26]  Thomas S. Huang,et al.  Uniqueness and Estimation of Three-Dimensional Motion Parameters of Rigid Objects with Curved Surfaces , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Zygmunt Pizlo,et al.  Perception of 3D scenes from pictures , 1998, Electronic Imaging.

[28]  R. Fox,et al.  The computation of retinal disparity , 1985, Perception & psychophysics.

[29]  Narendra Ahuja,et al.  Motion and Structure From Two Perspective Views: Algorithms, Error Analysis, and Error Estimation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Rajiv Gupta,et al.  Stereo from uncalibrated cameras , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[31]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[32]  M F Bradshaw,et al.  Vertical disparities, differential perspective and binocular stereopsis , 1993, Nature.

[33]  Karen K. De Valois,et al.  Discrimination of relative spatial position , 1990, Vision Research.

[34]  J S Tittle,et al.  The visual perception of three-dimensional length. , 1996, Journal of experimental psychology. Human perception and performance.

[35]  Zygmunt Pizlo,et al.  SHAPE RECONSTRUCTION BY A BINOCULAR FIXATING SYSTEM , 1996 .

[36]  Zygmunt Pizlo,et al.  3-D shape perception , 1995, Perception & psychophysics.

[37]  Richard I. Hartley,et al.  Estimation of Relative Camera Positions for Uncalibrated Cameras , 1992, ECCV.

[38]  David G. Lowe,et al.  Three-Dimensional Object Recognition from Single Two-Dimensional Images , 1987, Artif. Intell..

[39]  J. E. W. Mayhew,et al.  A computational model of binocular depth perception , 1982, Nature.

[40]  Robert H. Thouless,et al.  Phenomenal Regression to the Real Object , 1931, Nature.