Digital Stereo Video: display, compression and transmission

Digital stereo video is a digitised stream of stereo image pairs which, when observed by a human with binocular vision, conveys a great deal of information about a particular scene. The main benefit over traditional digital video is that with stereo, depth in an image can be perceived with greater accuracy. With the growth of the Internet in the last decade, we are beginning to have enough bandwidth for the transmission of high quality digital video across the world in near realtime. This growth is set to accelerate, and the quality and feature set of digital video will subsequently increase. It is inevitable that the transmission of digital stereo video across the Internet will be commonplace in the near future. This dissertation presents several techniques that can be applied now to facilitate the production, transmission and display of digital stereo video. The results from different implementations indicate that many goals can be achieved cheaply with commodity components, and more importantly, that high quality digital stereo video can be transmitted over the Internet with bandwidth and processing power not much greater than available today.

[1]  A. Churchill The Physiology of the Eye , 1949, The British journal of ophthalmology.

[2]  Jon Postel,et al.  User Datagram Protocol , 1980, RFC.

[3]  Ieee Standards Board IEEE Standard for a High Performance Serial Bus-Amendment 1 , 2000 .

[4]  J. Goodman Introduction to Fourier optics , 1969 .

[5]  John R. Aschenbrenner,et al.  Open Systems Interconnection , 1986, IBM Syst. J..

[6]  Amara Lynn Graps,et al.  An introduction to wavelets , 1995 .

[7]  B. Duval Commission internationale de l’éclairage (CIE) , 2001, Optique Photonique.

[8]  David Wettergreen,et al.  Developing Nomad for robotic exploration of the Atacama Desert , 1999, Robotics Auton. Syst..

[9]  Michel Barlaud,et al.  Image coding using wavelet transform , 1992, IEEE Trans. Image Process..

[10]  S. A. Talbot Physiology of the retina and the visual pathway , 1961 .

[11]  Katsuya Matsunaga,et al.  Evaluation of stereoscopic video cameras synchronized with the movement of an operator's head on the teleoperation of the actual backhoe shovel , 1999, Electronic Imaging.

[12]  Blake Hannaford,et al.  Quantitative Evaluation of Perspective and Stereoscopic Displays in Three-Axis Manual Tracking Tasks , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[13]  Tomás F. Pena,et al.  Parallel Computation of Wavelet Transforms Using the Lifting Scheme , 2001, The Journal of Supercomputing.

[14]  Jörg Ott,et al.  RTP Payload Format for the 1998 Version of ITU-T Rec. H.263 Video (H.263+) , 1998, RFC.

[15]  Andrew J. Woods,et al.  Image distortions in stereoscopic video systems , 1993, Electronic Imaging.

[16]  John C. Hart,et al.  The CAVE: audio visual experience automatic virtual environment , 1992, CACM.

[17]  Henning Schulzrinne,et al.  RTP: A Transport Protocol for Real-Time Applications , 1996, RFC.

[18]  Ross L. Pepper,et al.  Stereo TV Improves Operator Performance Under Degraded Visibility Conditions , 1981 .

[19]  Larry H. Matthies,et al.  A photo-realistic 3-D mapping system for extreme nuclear environments: Chernobyl , 1998, Proceedings. 1998 IEEE/RSJ International Conference on Intelligent Robots and Systems. Innovations in Theory, Practice and Applications (Cat. No.98CH36190).

[20]  Kazunori Shidoji,et al.  Comparison of operation efficiency for the insert task when using stereoscopic images with additional lines, stereoscopic images, and a manipulator with force feedback , 1999, Electronic Imaging.

[21]  Y. Yeh,et al.  Limits of Fusion and Depth Judgment in Stereoscopic Color Displays , 1990, Human factors.

[22]  Wim Sweldens,et al.  Building your own wavelets at home , 2000 .

[23]  Mandy Porter,et al.  Standardisation , 1971, Nature.

[24]  Van Jacobson,et al.  TCP Extension for High-Speed Paths , 1990, RFC.

[25]  A. Robert Calderbank,et al.  Lossless image compression using integer to integer wavelet transforms , 1997, Proceedings of International Conference on Image Processing.

[26]  Itu-T Video coding for low bitrate communication , 1996 .

[27]  Tim Edwards,et al.  Discrete Wavelet Transforms: Theory and Implementation , 1991 .

[28]  Boyce Nemec,et al.  Society of Motion Picture and Television Engineers , 1954 .

[29]  W. Richard Stevens,et al.  TCP/IP Illustrated, Volume 1: The Protocols , 1994 .

[30]  RTP Payload Format for JPEG-compressed Video , 1996, RFC.

[31]  W. Richard Stevens,et al.  UNIX Network Programming: Networking APIs: Sockets and XTI , 1997 .

[32]  H. S. Osborne,et al.  The international electrotechnical commission , 1953, Electrical Engineering.

[33]  Bjarne Stroustrup,et al.  The C++ Programming Language, Second Edition , 1991 .

[34]  Wim Sweldens,et al.  The lifting scheme: a construction of second generation wavelets , 1998 .

[35]  Koiti Motokawa,et al.  Physiology of Color and Pattern Vision , 1970 .

[36]  Gregory K. Wallace,et al.  The JPEG Still Image Compression Standard , 1991 .

[37]  R. L. Claypoole,et al.  Nonlinear wavelet transforms for image coding , 1997, Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36136).

[38]  John Roby Benson Baron Charnwood An essay on binocular vision , 1965 .

[39]  C. Valens The Fast Lifting Wavelet Transform , 2004 .

[40]  Hongyang Chao,et al.  An Approach to Fast Integer Reversible Wavelet Transforms for Image Compression , 1996 .

[41]  Lothar Mühlbach,et al.  Telepresence in Videocommunications: A Study on Stereoscopy and Individual Eye Contact , 1995, Hum. Factors.

[42]  Gabriel Fernandez,et al.  LIFTPACK: a software package for wavelet transforms using lifting , 1996, Optics & Photonics.

[43]  Amnon Yariv,et al.  Optical Waves in Crystals: Propagation and Control of Laser Radiation , 1983 .

[44]  Bjarne Stroustrup,et al.  C++ Programming Language , 1986, IEEE Softw..

[45]  Hugh Davson Physiology of the Eye , 1951 .

[46]  Wim Sweldens,et al.  Lifting scheme: a new philosophy in biorthogonal wavelet constructions , 1995, Optics + Photonics.

[47]  J. Pokorny The Perception of Light and Colour , 1975 .

[48]  Henning Schulzrinne,et al.  Real Time Streaming Protocol (RTSP) , 1998, RFC.

[49]  W. Sweldens Wavelets and the lifting scheme : A 5 minute tour , 1996 .

[50]  Christian Huitema,et al.  RTP Payload Format for H.261 Video Streams , 1996, RFC.

[51]  I. Daubechies,et al.  Factoring wavelet transforms into lifting steps , 1998 .

[52]  Charles A. Poynton,et al.  A technical introduction to digital video , 1996 .

[53]  Gregg Podnar,et al.  Geometry of binocular imaging II: the augmented eye , 1995, Electronic Imaging.

[54]  I. Daubechies,et al.  Wavelet Transforms That Map Integers to Integers , 1998 .

[55]  Alexander Zelinsky,et al.  Development of a visually-guided autonomous underwater vehicle , 1998, IEEE Oceanic Engineering Society. OCEANS'98. Conference Proceedings (Cat. No.98CH36259).

[56]  W. Sweldens The Lifting Scheme: A Custom - Design Construction of Biorthogonal Wavelets "Industrial Mathematics , 1996 .

[57]  Frank E. Schneider,et al.  Teleoperation with compressed motion picture sequences , 1996 .

[58]  Gregg Podnar,et al.  Geometry of binocular imaging , 1994, Electronic Imaging.

[59]  G. Brink,et al.  What is the diplopia threshold? , 1981, Perception & psychophysics.

[60]  W. Richard Stevens Networking APIs : sockets and XTI , 1998 .