论文信息 - Extrafoveal Video Extension for an Immersive Viewing Experience

Extrafoveal Video Extension for an Immersive Viewing Experience

Between the recent popularity of virtual reality (VR) and the development of 3D, immersion has become an integral part of entertainment concepts. Head-mounted Display (HMD) devices are often used to afford users a feeling of immersion in the environment. Another technique is to project additional material surrounding the viewer, as is achieved using cave systems. As a continuation of this technique, it could be interesting to extend surrounding projection to current television or cinema screens. The idea would be to entirely fill the viewer<sc>'</sc>s field of vision, thus providing them with a more complete feeling of being in the scene and part of the story. The appropriate content can be captured using large field of view (FoV) technology, using a rig of cameras for 110<inline-formula> <tex-math notation="LaTeX">$^{\circ}$</tex-math><alternatives><inline-graphic xlink:href="guillotel-ieq1-2527649.gif"/> </alternatives></inline-formula> to 360<inline-formula><tex-math notation="LaTeX">$^{\circ}$</tex-math><alternatives> <inline-graphic xlink:href="guillotel-ieq2-2527649.gif"/></alternatives></inline-formula> capture, or created using computer-generated images. The FoV is, however, rather limited in its use for existing (legacy) content, achieving between 36 to 90 degrees (<inline-formula><tex-math notation="LaTeX">$^{\circ}$</tex-math><alternatives> <inline-graphic xlink:href="guillotel-ieq3-2527649.gif"/></alternatives></inline-formula>) field, depending on the distance from the screen. This paper seeks to improve this FoV limitation by proposing computer vision techniques to extend such legacy content to the peripheral (extrafoveal) vision without changing the original creative intent or damaging the viewer<sc>'</sc>s experience. A new methodology is also proposed for performing user tests in order to evaluate the quality of the experience and confirm that the sense of immersion has been increased. This paper thus presents: i) an algorithm to spatially extend the video based on human vision characteristics, ii) its subjective results compared to state-of-the-art techniques, iii) the protocol required to evaluate the quality of the experience (QoE), and iv) the results of the user tests.

[1] Yoav Y. Schechner,et al. Ultrawide Foveated Video Extrapolation , 2011, IEEE Journal of Selected Topics in Signal Processing.

[2] G E Legge,et al. Contrast discrimination in peripheral vision. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[3] Irfan A. Essa,et al. Graphcut textures: image and video synthesis using graph cuts , 2003, ACM Trans. Graph..

[4] Eyal Ofek,et al. IllumiRoom: peripheral projected illusions for interactive experiences , 2013, SIGGRAPH '13.

[5] Ovidiu Daescu,et al. Applying Parallel Design Techniques to Template Matching with GPUs , 2010, VECPAR.

[6] Christof Koch,et al. A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[7] A. Bradley,et al. Characterization of spatial aliasing and contrast sensitivity in peripheral vision , 1996, Vision Research.

[8] Eyal Ofek,et al. IllumiRoom: immersive experiences beyond the TV screen , 2015, Commun. ACM.

[9] Daniel E. Novy,et al. Computational immersive displays , 2013 .

[10] Maxim P. Sharabayko,et al. Intra compression efficiency in VP9 and HEVC , 2013 .

[11] Jochen Triesch,et al. Implementations and Implications of Foveated Vision , 2009 .

[12] Ioan R. Allen. Screen Size: The Impact on Picture and Sound , 1999 .

[13] Patrick Le Callet,et al. A coherent computational approach to model bottom-up visual attention , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Koji Nakano,et al. Fast and Accurate Template Matching Using Pixel Rearrangement on the GPU , 2011, 2011 Second International Conference on Networking and Computing.

[15] Mehmet Türkan,et al. Optimized neighbor embeddings for single-image super-resolution , 2013, 2013 IEEE International Conference on Image Processing.

[16] Takeo Kanade,et al. An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[17] Tom Drummond,et al. Fusing points and lines for high performance tracking , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[18] Thierry Baccino,et al. Medium Spatial Frequencies, a Strong Predictor of Salience , 2011, Cognitive Computation.

[19] Yoav Y. Schechner,et al. Multiscale ultrawide foveated video extrapolation , 2011, 2011 IEEE International Conference on Computational Photography (ICCP).

[20] Sugato Chakravarty,et al. Methodology for the subjective assessment of the quality of television pictures , 1995 .

[21] Gary J. Sullivan,et al. Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[22] A. Weffers-Albu,et al. Immersive TV viewing with advanced Ambilight , 2011, 2011 IEEE International Conference on Consumer Electronics (ICCE).

[23] S. McKee,et al. The detection of motion in the peripheral visual field , 1984, Vision Research.

[24] Edward H. Adelson,et al. A multiresolution spline with application to image mosaics , 1983, TOGS.