Subjective quality assessment of multichannel audio accompanied with video in representative broadcasting genres

Immersive broadcasting applications have received a lot of attention in the last years. In this context, the development of advanced HDTV and 3DTV formats is being successfully adopted by the consumer market, having a strong impact in the way that traditional broadcasting contents are displayed to final users. Together with the above advances in video technology, multichannel spatial audio has also experienced a considerable impulse within the audiovisual industry. However, the need for specific production tools and loudspeaker setups corresponding to multiple competing audio formats seems to be an important factor affecting their adoption by the consumer community. Moreover, it is well-known that the perceived audio quality is highly influenced by the reproduction context, where the existing multimodal interaction between audio and video plays a very important role. This paper presents a formal evaluation of the perceived sound quality provided by several spatial audio formats accompanied with video in the context of television broadcasting. Stereo, advanced surround formats and 3D Binaural sound are evaluated considering a set of representative broadcasting contents (sports, movies, music and animation) to assess their impact on the perceptual attributes contemplated within the international recommendations.

[1]  Ulrich Reiter,et al.  Subjective Assessment of the Optimum Number of Loudspeaker Channels in Audio-Visual Applications using Large Screens , 2006 .

[2]  Carlos Vázquez,et al.  3D-TV Content Creation: Automatic 2D-to-3D Video Conversion , 2011, IEEE Transactions on Broadcasting.

[3]  Clyde Young Kramer,et al.  Extension of multiple range tests to group means with unequal numbers of replications , 1956 .

[4]  Andreas Krause,et al.  Advances in Neural Information Processing Systems (NIPS) , 2014 .

[5]  Dominik Strohmeier,et al.  Designing for user experience: what to expect from mobile 3d tv and video? , 2008, UXTV '08.

[6]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[7]  A. Silzle,et al.  Investigation on the Quality of 3 D Sound Reproduction , 2012 .

[8]  Maya R. Gupta,et al.  How to Analyze Paired Comparison Data , 2011 .

[9]  Javier R. Movellan,et al.  Audio Vision: Using Audio-Visual Synchrony to Locate Sounds , 1999, NIPS.

[10]  Søren Bech,et al.  Perceptual Audio Evaluation-Theory, Method and Application: Bech/Perceptual Audio Evaluation-Theory, Method and Application , 2006 .

[11]  Tomlinson Holman Surround Sound: Up and running , 2007 .

[12]  Miska M. Hannuksela,et al.  Perceptual-based quality assessment for audio-visual services: A survey , 2010, Signal Process. Image Commun..

[13]  Tomlinson Holman,et al.  Surrounded by sound , 1999 .

[14]  D. S. Hands,et al.  A basic multimedia quality model , 2004, IEEE Transactions on Multimedia.

[15]  Dorte Hammershøi,et al.  Binaural Technique: Do We Need Individual Recordings? , 1996 .

[16]  John G. Beerends,et al.  The Influence of Video Quality on Perceived Audio Quality and Vice Versa , 1999 .

[17]  Michael Peter Hollier,et al.  Multi-modal Perception , 1999 .

[18]  Francis Rumsey,et al.  Subjective audio quality trade-offs in consumer multichannel audio-visual delivery systems. Part I: Effects of high frequency limitation. , 2002 .

[19]  Koichiro Hiyama,et al.  The 22.2 Multichannel Sound System and Its Application , 2005 .

[20]  N. F. Dixon,et al.  The Detection of Auditory Visual Desynchrony , 1980, Perception.

[21]  C. Jones,et al.  Development of opinion-based audiovisual quality models for desktop video-teleconferencing , 1998, 1998 Sixth International Workshop on Quality of Service (IWQoS'98) (Cat. No.98EX136).

[22]  V. Ralph Algazi,et al.  Headphone-Based Spatial Sound , 2011, IEEE Signal Processing Magazine.

[23]  D. Strohmeier,et al.  How Does My 3D Video Sound Like? - Impact of Loudspeaker Set-Ups on Audiovisual Quality on Mid-Sized Autostereoscopic Display , 2008, 2008 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[24]  J. Moran,et al.  Sensation and perception , 1980 .

[25]  Tomlinson Holman 5.1 Surround Sound: Up and Running , 2000 .

[26]  F. Mosteller Remarks on the method of paired comparisons: I. The least squares solution assuming equal standard deviations and equal correlations , 1951 .

[27]  丸井 淳史 Soren Bech and Nick Zacharov, Perceptual Audio Evaluation-Theory, Method, and Application, John Wiley & Sons Ltd., 2006(Coffee Break) , 2014 .

[28]  Helmut Wittek,et al.  Principles in Surround Recordings with Height , 2011 .

[29]  Dominik Strohmeier,et al.  Cognitive styles and visual quality , 2013, Electronic Imaging.

[30]  Göte Nyman,et al.  Experienced quality factors: qualitative evaluation approach to audiovisual quality , 2007, Electronic Imaging.

[31]  Gerhard Steinke Surround-Sound: Relations of Listening and Viewing Configurations , 2004 .

[32]  Ton Kalker,et al.  The Road to Immersive Communication , 2012, Proceedings of the IEEE.

[33]  Tomlinson Holman,et al.  Sound for film and television , 1997 .

[34]  Sugato Chakravarty,et al.  Methodology for the subjective assessment of the quality of television pictures , 1995 .

[35]  Marcus Barkowsky,et al.  The Importance of Visual Attention in Improving the 3D-TV Viewing Experience: Overview and New Perspectives , 2011, IEEE Transactions on Broadcasting.

[36]  Jacob Benesty,et al.  Immersive Audio Schemes , 2011, IEEE Signal Process. Mag..

[37]  Kjell Brunnström,et al.  Subjective Multimedia Quality Assessment , 2006, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[38]  Ville Pulkki,et al.  Virtual Sound Source Positioning Using Vector Base Amplitude Panning , 1997 .

[39]  Günther Theile,et al.  On the Naturalness of Two-Channel Stereo Sound , 1991 .

[40]  Christof Faller,et al.  Spatial Audio Processing , 2007 .

[41]  Søren Bech,et al.  Perceptual Audio Evaluation - Theory, Method and Application , 2006 .

[42]  Ville Pulkki,et al.  New 10.2-Channel Vertical Surround System (10.2-VSS); Comparison Study of Perceived Audio Quality in Various Multichannel Sound Systems with Height Loudspeakers , 2010 .

[43]  Christof Faller,et al.  Spatial Audio Processing: MPEG Surround and Other Applications , 2007 .

[44]  Marcus Barkowsky,et al.  Perceived 3D TV Transmission Quality Assessment: Multi-Laboratory Results Using Absolute Category Rating on Quality of Experience Scale , 2012, IEEE Transactions on Broadcasting.

[45]  Günther Theile HDTV Sound Systems: How Many Channels? , 1991 .

[46]  L. Thurstone A law of comparative judgment. , 1994 .

[47]  Sebastian Möller,et al.  Audio and video channel impact on perceived audio-visual quality in different interactive contexts , 2009, 2009 IEEE International Workshop on Multimedia Signal Processing.