Accurate vergence control in complex scenes

In binocular visual systems, vergence is the process of directing the gaze so that the optical axes intersect at a surface point. Correlation-based methods of disparity analysis provide fast estimates of the vergence error. Unfortunately most correlation techniques do not provide mechanisms to determine which image locations contributed to a given correlation peak. The result is that large correlation peaks may have contributions from image arena not relevant to the vergence task. This paper presents a vergence system that applies a cepstral filter to multiscale images obtained from a dominant-eye binocular sensor. As used by this system, the cepstral filter has two main advantages: it enhances targets through narrow-band signal suppression, and it supports a back-projection operation to determine the image locations associated with particular correlation peaks. The use of multiscale images allows the system to have both high resolution for precision in the final vergence and a large field of view for a wide range of initial camera orientations without undue computational cost.<<ETX>>

[1]  Yiannis Aloimonos,et al.  Active vision , 2004, International Journal of Computer Vision.

[2]  A. Lynn Abbott,et al.  Surface Reconstruction By Dynamic Integration Of Focus, Camera Vergence, And Stereo , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[3]  B. P. Bogert,et al.  The quefrency analysis of time series for echoes : cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking , 1963 .

[4]  Christopher M. Brown,et al.  Real-time smooth pursuit tracking for a moving binocular robot , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5]  Christopher M. Brown,et al.  Gaze controls cooperating through prediction , 1990, Image Vis. Comput..

[6]  Yehezkel Yeshurun,et al.  Cepstral Filtering on a Columnar Image Architecture: A Fast Algorithm for Binocular Stereo Segmentation , 2011, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Ralf Kories,et al.  Stereo Ranging with Verging Cameras , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Jake K. Aggarwal,et al.  Structure from stereo-a review , 1989, IEEE Trans. Syst. Man Cybern..

[9]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[10]  Thomas J. Olson,et al.  Fixation-based filtering , 1992, Other Conferences.

[11]  James J. Clark,et al.  Modal Control Of An Attentive Vision System , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[12]  James L. Crowley,et al.  Gaze Control for a Binocular Camera Head , 1992, ECCV.

[13]  Thomas J. Olson,et al.  Stereopsis for verging systems , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[14]  D. Whitteridge Movements of the eyes R. H. S. Carpenter, Pion Ltd, London (1977), 420 pp., $27.00 , 1979, Neuroscience.

[15]  Dana H. Ballard,et al.  Reference Frames for Animate Vision , 1989, IJCAI.