FOVEATION FOR 3D VISUALIZATION AND STEREO IMAGING

Even though computer vision and digital photogrammetry share a number of goals, techniques, and methods, the potential for cooperation between these fields is not fully exploited. In attempt to help bridging the two, this work brings a well-known computer vision and image processing technique called foveation and introduces it to photogrammetry, creating a hybrid application. The results may be beneficial for both fields, plus the general stereo imaging community, and virtual reality applications. Foveation is a biologically motivated image compression method that is often used for transmitting videos and images over networks. It is possible to view foveation as an area of interest management method as well as a compression technique. While the most common foveation applications are in 2D there are a number of binocular approaches as well. For this research, the current state of the art in the literature on level of detail, human visual system, stereoscopic perception, stereoscopic displays, 2D and 3D foveation, and digital photogrammetry were reviewed. After the review, a stereo-foveation model was constructed and an implementation was realized to demonstrate a proof of concept. The conceptual approach is treated as generic, while the implementation was conducted under certain limitations, which are documented in the relevant context. A stand-alone program called Foveaglyph is created in the implementation process. Foveaglyph takes a stereo pair as input and uses an image matching algorithm to find the parallax values. It then calculates the 3D coordinates for each pixel from the geometric relationships between the object and the camera configuration or via a parallax function. Once 3D coordinates are obtained, a 3D image pyramid is created. Then, using a distance dependent level of detail function, spherical volume rings with varying resolutions throughout the 3D space are created. The user determines the area of interest. The result of the application is a user controlled, highly compressed non-uniform 3D anaglyph image. 2D foveation is also provided as an option. This type of development in a photogrammetric visualization unit is beneficial for system performance. The research is particularly relevant for large displays and head mounted displays. Although, the implementation, because it is done for a single user, would possibly be best suited to a head mounted display (HMD) application. The resulting stereo-foveated image can be loaded moderately faster than the uniform original. Therefore, the program can potentially be adapted to an active vision system and manage the scene as the user glances around, given that an eye tracker determines where exactly the eyes accommodate. This exploration may also be extended to robotics and other robot vision applications. Additionally, it can also be used for attention management and the viewer can be directed to the object(s) of interest the demonstrator would like to present (e.g. in 3D cinema). Based on the literature, we also believe this approach should help resolve several problems associated with stereoscopic displays such as the accommodation convergence problem and diplopia. While the available literature provides some empirical evidence to support the usability and benefits of stereo foveation, further tests are needed. User surveys related to the human factors in using stereo foveated images, such as its possible contribution to prevent user discomfort and virtual simulator sickness (VSS) in virtual environments, are left as future work.

[1]  Patrik Ottoson Geographic Indexing and Data Management for 3D-Visualisation , 2001 .

[2]  S. B. Kang,et al.  An Active Multibaseline Stereo System with Real-Time Image Acquisition , 1994 .

[3]  H. Carr An introduction to space perception , 1935 .

[4]  Donald D. Hoffman,et al.  Visual Intelligence: How We Create What We See , 1998 .

[5]  David Kortenkamp,et al.  Integrating a Behavior-Based Approach to Active Stereo Vision With an Intelligent Control Architectu , 1995 .

[6]  HIROYUKI YAMAMOTO,et al.  An Active Foveated Vision System: Attentional Mechanisms and Scan Path Covergence Measures , 1996, Comput. Vis. Image Underst..

[7]  Alan C. Bovik,et al.  Real-time foveation techniques for H.263 video encoding in software , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[8]  Joel Pokorny,et al.  Sight and Mind. An Introduction to Visual Perception , 1974 .

[9]  Tetsuri Inoue,et al.  Measurement of the human factors of 3-D images on a large screen , 1990, Other Conferences.

[10]  M W Morgan,et al.  Accommodation and vergence. , 1968, American journal of optometry and archives of American Academy of Optometry.

[11]  S. Ellis Pictorial communication in virtual and real environments , 1991 .

[12]  Anup Basu,et al.  Prioritized region of interest coding in JPEG2000 , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Jan Paul Siebert,et al.  A fast foveated stereo matcher , 2000 .

[14]  Carlo Tomasi,et al.  A Pixel Dissimilarity Measure That Is Insensitive to Image Sampling , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  S. A. Talbot,et al.  Physiological Studies on Neural Mechanisms of Visual Localization and Discrimination , 1941 .

[16]  Chandrajit L. Bajaj,et al.  Active visualization in a multidisplay immersive environment , 2002, Comput. Graph..

[17]  D. Scharstein,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, Proceedings IEEE Workshop on Stereo and Multi-Baseline Vision (SMBV 2001).

[18]  Michael W. McGreevy,et al.  Methods for user-based reduction of model complexity for virtual planetary exploration , 1993, Electronic Imaging.

[19]  Mark Mon-Williams,et al.  Binocular Virtual Reality Displays: When Problems Do and Don't Occur , 1998, Hum. Factors.

[20]  Henrik Haggrén Stereoscopy application of spherical imaging , 2005 .

[21]  Active Stereo Vision System with Foveated Wide Angle Lenses , 1995, ACCV.

[22]  Ken Nakayama,et al.  Properties of early motion processing: Implications for the sensing of ego motion , 1990 .

[23]  Didier Arquès,et al.  Simulation of Blur in Stereoscopic Image Synthesis for Virtual Reality , 2005 .

[24]  Wilfried Linder,et al.  Digital Photogrammetry , 2003 .

[25]  Rawatee Maharaj-Sharma Online Lecture Notes , 2005 .

[26]  Stanley Fok Foveated Stereo Video Compression for Visual Telepresence , 2002 .

[27]  Jan Paul Siebert,et al.  Foveated vision for space-variant scene reconstruction , 2004 .

[28]  S M Anstis,et al.  Letter: A chart demonstrating variations in acuity with retinal position. , 1974, Vision research.

[29]  Patrick Rives,et al.  A new approach to visual servoing in robotics , 1992, IEEE Trans. Robotics Autom..

[30]  Ian van der Linde Multiresolution image compression using image foveation and simulated depth of field for stereoscopic displays , 2004 .

[31]  James H. Clark,et al.  Hierarchical geometric models for visible surface algorithms , 1976, CACM.

[32]  Anthony E. Cawkell,et al.  Understanding Virtual Reality , 2003, J. Documentation.

[33]  Martin D. Levine,et al.  A Review of Biologically Motivated Space-Variant Data Reduction Models for Robotic Vision , 1998, Comput. Vis. Image Underst..

[35]  Martin Reddy,et al.  Specification and evaluation of level of detail selection criteria , 1998, Virtual Reality.

[36]  Barbara Gillam,et al.  The Perception of Spatial Layout from Static Optical Information , 1995 .

[37]  Eugenia M. Kolasinski,et al.  Simulator Sickness in Virtual Environments. , 1995 .

[38]  Yasuo Kuniyoshi,et al.  Calibration of a foveated wide-angle lens on an active vision head , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[39]  M. Obeng,et al.  The American Heritage Stedman's Medical Dictionary , 2003 .

[40]  Ee-Chien Chang,et al.  Foveation Techniques and Scheduling Issues in Thinwire Visualization , 1998 .

[41]  Benjamin Watson,et al.  A User Study Evaluating Level of Detail Degradation in the Periphery of Head-Mounted Displays , 1995 .

[42]  A. Tescher Reconstruction of quadric surfaces from disparity measurements , 1994 .

[43]  Anup Basu,et al.  Prioritized region of interest coding in JPEG2000 , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[44]  Patrick Rives,et al.  A new approach to visual servoing in robotics , 1992, IEEE Trans. Robotics Autom..

[45]  Yiannis Aloimonos,et al.  Active vision , 2004, International Journal of Computer Vision.

[46]  M. Halle Autostereoscopic displays and computer graphics , 1997, COMG.

[47]  E. Schwartz The development of specific visual connections in the monkey and the goldfish: outline of a geometric theory of receptotopic structure. , 1977, Journal of theoretical biology.

[48]  Umberto Castellani,et al.  On-line Compendium of Computer Vision , 2002 .

[49]  Jirí Zára,et al.  Fast depth of field rendering with surface splatting , 2003, Proceedings Computer Graphics International 2003.

[50]  D. G. Green,et al.  Monocular versus Binocular Visual Acuity , 1965, Nature.

[51]  Ivan E. Sutherland,et al.  A head-mounted three dimensional display , 1968, AFIPS Fall Joint Computing Conference.

[52]  Carolina Cruz-Neira,et al.  Surround-Screen Projection-Based Virtual Reality: The Design and Implementation of the CAVE , 2023 .

[53]  Nick Holliman,et al.  3D Display Systems , 2003 .

[54]  Bent Dalgaard Larsen,et al.  Real-time Terrain Rendering using Smooth Hardware Optimized Level of Detail , 2003, WSCG.

[55]  P. R. ANDREWS,et al.  Images at the blind spot , 1991, Nature.

[56]  A. Hendrickson,et al.  The morphological development of the human fovea. , 1984, Ophthalmology.

[57]  Martin D. Levine,et al.  A Real-Time Foveated Sensor with Overlapping Receptive Fields , 1997, Real Time Imaging.

[58]  Jonathan D. Pfautz,et al.  Depth Perception in Computer Graphics , 2000 .

[59]  Colin Ware,et al.  Dynamic stereo displays , 1995, CHI '95.

[60]  L D Silverstein,et al.  Spatial Judgments with Monoscopic and Stereoscopic Presentation of Perspective Displays , 1992, Human factors.

[61]  M. H. Pirenne,et al.  VISION AND THE EYE , 1949 .

[62]  Avishai Henik,et al.  On stereo image coding , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[63]  A. Hendrickson,et al.  Distribution of cones in human and monkey retina: individual variability and radial asymmetry. , 1987, Science.

[64]  Ee-Chien Chang,et al.  A wavelet approach to foveating images , 1997, SCG '97.

[65]  R. Clark Visual astronomy of the deep sky , 1990 .

[66]  P. Howarth,et al.  The occurrence of virtual simulation sickness symptoms when an HMD was used as a personal viewing system , 1997 .

[67]  R. Haber,et al.  Visual Perception , 2018, Encyclopedia of Database Systems.

[68]  Hideyuki Tamura,et al.  Gaze-directed adaptive rendering for interacting with virtual space , 1996, Proceedings of the IEEE 1996 Virtual Reality Annual International Symposium.

[69]  Ee-Chien Chang,et al.  Realtime visualization of large images over a thinwire , 1997 .

[70]  G.E. Moore,et al.  Cramming More Components Onto Integrated Circuits , 1998, Proceedings of the IEEE.

[71]  James A. Ferwerda,et al.  Elements of Early Vision for Computer Graphics (Tutorial) , 2001, IEEE Computer Graphics and Applications.

[72]  Richard Szeliski,et al.  High-accuracy stereo depth maps using structured light , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[73]  G. Hung Quantitative analysis of the accommodative convergence to accommodation ratio: linear and nonlinear static models , 1997, IEEE Transactions on Biomedical Engineering.

[74]  Robert Skerjanc,et al.  New generation of 3D desktop computer interfaces , 1997, Electronic Imaging.

[75]  Irene Cheng,et al.  Foveated online 3D visualization , 2002, Object recognition supported by user interaction for service robots.

[76]  William L. Martens,et al.  Physiological approach to optimal stereographic game programming: a technical guide , 1996, Electronic Imaging.

[77]  Percy W. Cobb,et al.  The Retinal Processes Concerned with Visual Acuity and Color Vision , 1932 .

[78]  P K Ahnelt,et al.  Identification of a subtype of cone photoreceptor, likely to be blue sensitive, in the human retina , 1987, The Journal of comparative neurology.

[79]  Helmut Gernsheim,et al.  The history of photography: From the camera obscura to the beginning of the modern era, , 1969 .

[80]  Gregg Podnar,et al.  Geometry of binocular imaging , 1994, Electronic Imaging.

[81]  Oliver Schreer,et al.  Segmentation-Based Postprocessing in Real-Time Immersive Video Conferencing , 2001, VMV.

[82]  Wilson S. Geisler,et al.  Variable‐Resolution Displays for Visual Communication and Simulation , 1999 .

[83]  Benjamin B. Bederson,et al.  Voice-bandwidth visual communication through logmaps: the Telecortex , 1992, [1992] Proceedings IEEE Workshop on Applications of Computer Vision.

[84]  E. Chang Wavelet Foveation , 1999 .

[85]  Chee Yap,et al.  Responsive Thinwire Visualization of Large Geographic Datasets , 2002 .

[86]  Wilson S. Geisler,et al.  Implementation of a foveated image coding system for image bandwidth reduction , 1996, Electronic Imaging.

[87]  Mark Mon-Williams,et al.  Natural problems for stereoscopic depth perception in virtual environments , 1995, Vision Research.

[88]  Paul Debevec,et al.  Modeling and Rendering Architecture from Photographs , 1996, SIGGRAPH 1996.

[89]  Lawrence W. Stark,et al.  Visual enhancements in pick-and-place tasks: Human operators controlling a simulated cylindrical manipulator , 1987, IEEE Journal on Robotics and Automation.

[90]  Martin Reddy,et al.  Perceptually modulated level of detail for virtual environments , 1997 .

[91]  Geoffrey S. Hubona,et al.  The relative contributions of stereo, lighting, and background scenes in promoting 3D depth visualization , 1999, TCHI.

[92]  Woodrow Barfield,et al.  Judgments of Azimuth and Elevation as a Function of Monoscopic and Binocular Depth Cues Using a Perspective Display , 1995, Hum. Factors.

[93]  Paul Milgram,et al.  Positioning accuracy of a virtual stereographic pointer in a real stereoscopic video world , 1991, Electronic Imaging.

[94]  Arnulf Remote,et al.  PICTORIAL COMMUNICATION IN VIRTUAL AND REAL ENVIRONMENTS , 1992 .

[95]  Carlo Tomasi,et al.  Depth Discontinuities by Pixel-to-Pixel Stereo , 1999, International Journal of Computer Vision.

[96]  Shojiro Nagata,et al.  How to reinforce perception of depth in single two-dimensional pictures , 1991 .

[97]  J. Bullinaria,et al.  Incorporating Developmental Factors into Models of Accommodation and Vergence , 2002 .

[98]  Giulio Sandini,et al.  Visuo-inertial stabilization in space-variant binocular systems , 2000, Robotics Auton. Syst..

[99]  M. Smith Close range photogrammetry and machine vision , 1997 .

[100]  E. L. Schwartz,et al.  Spatial mapping in the primate sensory projection: Analytic structure and relevance to perception , 1977, Biological Cybernetics.

[102]  Wilson S. Geisler,et al.  Gaze-contingent real-time simulation of arbitrary visual fields , 2002, IS&T/SPIE Electronic Imaging.

[103]  Paul Sharkey,et al.  Operator performance evaluation of controlled depth of field in a stereographically displayed virtual environment , 2001, IS&T/SPIE Electronic Imaging.

[104]  Lenny Lipton,et al.  Foundations of the stereoscopic cinema : a study in depth , 1984 .

[105]  Anup Basu,et al.  Variable resolution teleconferencing , 1993, Proceedings of IEEE Systems Man and Cybernetics Conference - SMC.

[106]  Richard Alan Peters,et al.  Centering peripheral features in an indoor environment using a binocular log-polar 4 DOF camera head , 1996, Robotics Auton. Syst..

[107]  Arzu Çöltekin,et al.  Stereo-foveation for anaglyph imaging , 2005 .

[108]  S. Pastoor,et al.  Stereoscopic image representation with synthetic depth of field , 1997 .

[109]  Rajesh P. N. Rao,et al.  Eye Movements in Visual Cognition: A Computational Study , 1997 .

[110]  Colin Ware,et al.  Dynamic adjustment of stereo display parameters , 1998, IEEE Trans. Syst. Man Cybern. Part A.

[111]  Colin Ware,et al.  Information Visualization: Perception for Design , 2000 .

[112]  Benjamin Watson,et al.  Effectiveness of spatial level of detail degradation in the periphery of head-mounted displays , 1996, CHI Conference Companion.

[113]  N. Drasdo The neural representation of visual space , 1977, Nature.

[114]  A. Bernardino,et al.  Binocular Visual Tracking : Integration of Perception and Control , 1999 .

[115]  Alan C. Bovik,et al.  High quality, low delay foveated visual communications over mobile channels , 2005, J. Vis. Commun. Image Represent..

[116]  Sovira Tan,et al.  Performance of three recursive algorithms for fast space-variant Gaussian filtering , 2003, Real-time imaging.

[117]  Robert van Liere,et al.  Fast perception-based depth of field rendering , 2000, VRST '00.

[118]  G. F. McLean,et al.  Teleoperated system performance evaluation , 1994 .

[119]  Herbert Couchman McKay,et al.  Three-dimensional photography : principles of stereoscopy , 1951 .

[120]  M. E. Cox Handbook of Optics , 1980 .

[121]  Whitman Richards,et al.  Stereopsis and stereoblindness , 2006, Experimental Brain Research.

[122]  Zoran Constantinescu,et al.  Levels of Detail: An Overview , 2000 .

[123]  Peter M. Atkinson,et al.  Modelling scale in geographical information science , 2001 .

[124]  Dawid Pajak General-Purpose Computation Using Graphics Hardware for Fast HDR Image Processing , 2007 .

[125]  Scott S. Fisher,et al.  Stereoscopic Displays and Applications III , 1992 .

[126]  Matthias Wöpking,et al.  3-D displays: A review of current technologies , 1997 .

[127]  Chih-Ping Yeh,et al.  Cyclopean stereo vision for depth perception , 1993, Electronic Imaging.

[128]  Wilson S. Geisler,et al.  Real-time foveated multiresolution system for low-bandwidth video communication , 1998, Electronic Imaging.

[129]  Marc Levoy,et al.  Zippered polygon meshes from range images , 1994, SIGGRAPH.

[130]  John L. Barron,et al.  3D foveated visualization on the Web , 2000, IS&T/SPIE Electronic Imaging.

[131]  Alan C. Bovik,et al.  Real-time foveation techniques for low bit rate video coding , 2003, Real Time Imaging.

[132]  Alexandre Bernardino,et al.  A Binocular Stereo Algorithm for Log-Polar Foveated Systems , 2002, Biologically Motivated Computer Vision.

[133]  Elaine Nicpon Marieb,et al.  Essentials of Human Anatomy and Physiology , 1981 .