Perceptual studies on spatial sound reproduction systems

Author Nick Zacharov Title Perceptual Studies on Spatial Sound Reproduction Systems This study considers research on spatial sound perception in the context of multichannel and other spatial sound reproduction systems. Issues associated with multichannel level alignment are discussed. Additionally aspects relating to the quality of spatial sound reproduction associated with loudspeaker directivity and 3D sound algorithms are considered. Lastly, a study of spatial sound perception is unfolded. This thesis contributes to the understanding of level alignment techniques from both a subjective and objective standpoint in multichannel sound reproduction schemes. The aim of these subjective experiments was to obtain a database of subject responses for a level alignment task under a wide range of normal usage situations, accounting for different source locations, distances, directivities, sensitivities, bandwidths and absolute reproduction levels. The data was analysed and correlated with a set of objective metrics measured for each test condition. From this analysis it is possible to ascertain superior test signal/metric combinations for perceptually motivated level alignment. Results may be directly applied to automated level calibration systems. Tools for binaural real-time loudness measurement are introduced allowing for the assessment of directional loudness characteristics and multiple source loudness alignment within arbitrary reveberation spaces. The second set of experiments focuses on issues influencing the quality of spatial sound reproduction under different multichannel sound reproduction scenarios. Firstly, the influence of loudspeaker directivity is subjectively assessed in a discrete five channel multichannel scenario. A set of experiments are presented that assess the influence of directivity both in the frontal and surround loudspeakers under idealised listening conditions. A range of virtual home theater systems are reviewed and a benchmark experiment is presented that compares their performance with respect to a discrete five channel multichannel reproduction. Evaluations are performed in an idealised “standard listening room” and real room conditions. Relative performances of each system are discussed in terms of relative spatial and timbral degradation. Lastly, initial studies are presented into the multidimensional perceptual unfolding of spatially processed speech reproduced over headphones. This study provides the basis for further studies in this area in an attempt to unravel or “unfold” the perceptual dimensions associated with spatial sound reproduction. UDC 534.8:681.1

[1]  Robert O. Gjerdingen,et al.  The psychology of music , 2002 .

[2]  M. Goldstein,et al.  Multivariate Analysis: Methods and Applications , 1984 .

[3]  John Allnatt,et al.  Transmitted-picture assessment , 1983 .

[4]  Manfred R. Schroeder,et al.  Toward better acoustics for concert halls , 1980 .

[5]  Nick Zacharov,et al.  GuineaPig - A Generic Subjective Test System for Multichannel Audio , 1999 .

[6]  Manfred R. Schroeder,et al.  Comparative study of European concert halls: correlation of subjective preference with geometric and acoustic parameters , 1974 .

[7]  S. Winsberg,et al.  A multidimentional technique for sound quality assessment , 1999 .

[8]  Georg Plenge,et al.  Localization of Lateral Phantom Sources , 1976 .

[9]  David G. Kirby,et al.  Programme Origination of 5-Channel Surround Sound , 1997 .

[10]  Duane H. Cooper,et al.  Discrete-Matrix Multichannel Stereo , 1972 .

[11]  Gary W. Elko,et al.  Effect of loudspeaker position on the robustness of acoustic crosstalk cancellation , 1999, IEEE Signal Processing Letters.

[12]  Morten Meilgaard,et al.  Sensory Evaluation Techniques , 2020 .

[13]  C. H. Chen,et al.  Signal processing handbook , 1988 .

[14]  A. Gabrielsson,et al.  Perceived sound quality of sound-reproducing systems. , 1979, The Journal of the Acoustical Society of America.

[15]  Durand R. Begault,et al.  3-D Sound for Virtual Reality and Multimedia Cambridge , 1994 .

[16]  Joseph L. Zinnes,et al.  Theory and Methods of Scaling. , 1958 .

[17]  Hugo Fastl,et al.  Psychoacoustics: Facts and Models , 1990 .

[18]  David J. Meares Perceptual Attributes of Multichannel Sound , 1993 .

[19]  Brian R. Shelton,et al.  Comparison of three adaptive psychophysical procedures , 1982 .

[20]  B Hagerman,et al.  Perceived sound quality of reproductions with different frequency responses and sound levels. , 1990, The Journal of the Acoustical Society of America.

[21]  Søren Bech Calibration of Relative Level Differences of a Domestic Multichannel Sound Reproduction System , 1998 .

[22]  Dermot Furlong,et al.  Improved Spectral Stereo Head Model , 1995 .

[23]  Dana S. Hougland,et al.  Concert and Opera Halls: How They Sound , 1996 .

[24]  Søren Bech The Influence of Room Acoustics on Reproduced Sound, Part 1: Selection and Training of Subjects for Listening Tests , 1989 .

[25]  H. Levitt Transformed up-down methods in psychoacoustics. , 1971, The Journal of the Acoustical Society of America.

[26]  Francis Rumsey,et al.  Spatial Attribute Identification and Scaling by Repertory Grid Technique and Other Methods , 1999 .

[27]  M. M. Taylor,et al.  PEST: Efficient Estimates on Probability Functions , 1967 .

[28]  G. Kelly The Psychology of Personal Constructs , 2020 .

[29]  Nick Zacharov Subjective appraisal of loudspeaker directivity for multichannel reproduction : On the loudspeaker directivity considerations for 5.1-channel audio-visual reproduction: A subjective appraisal , 1997 .

[30]  M. Barron The subjective effects of first reflections in concert halls—The need for lateral reflections , 1971 .

[31]  D. Deutsch,et al.  The Psychology of Music , 1983 .

[32]  Tomlinson Holman New Factors in Sound for Cinema and Television , 1991 .

[33]  Hareo Hamada,et al.  The "Stereo Dipole": Binaural Sound Reproduction Using Two Closely Spaced Loudspeakers , 1997 .

[34]  W. G. Gardner,et al.  3-D Audio Using Loudspeakers , 1998 .

[35]  Dermot Furlong,et al.  Spectral Stereo Surround Sound Pan-Pot , 1991 .

[36]  Ronaldus Maria Aarts,et al.  A Comparison of Some Loudness Measures for Loudspeaker Listening Tests , 1992 .

[37]  Søren Bech,et al.  Interaction Between Audio-Visual Factors in a Home Theater System: Definition of Subjective Attributes , 1995 .

[38]  William M. Hartmann,et al.  Psychoacoustics: Facts and Models , 2001 .

[39]  Alf Gabrielsson Dimension analyses of perceived sound quality of sound-reproducing systems , 1979 .

[40]  Nick Zacharov An overview of multichannel level alignment , 1998 .

[41]  Francis Rumsey,et al.  Identification of Perceived Spatial Attributes of Recordings by Repertory Grid Technique and Other Methods , 1999 .

[42]  B. R. Shelton,et al.  Two-alternative versus three-alternative procedures for threshold estimation , 1984, Perception & psychophysics.

[43]  R. Walker,et al.  A Controlled-Reflection Listening Room for Multi-Channel Sound , 1998 .

[44]  Setsu Komiyama,et al.  Subjective Evaluation of Multi-Channel Stereophony for HDTV , 1987, IEEE Transactions on Broadcasting.

[45]  RONALD M. AARTS Calculation of the loudness of loudspeakers during listening tests , 1990 .

[46]  Jerry Bauck,et al.  Generalized transaural stereo and applications , 1996 .

[47]  D. M. Green,et al.  A comparison of method-of-adjustment and forced-choice procedures in frequency discrimination , 1976 .

[48]  Søren Bech Selection and Training of Subjects for Listening Tests on Sound-Reproducing Equipment , 1992 .

[49]  John Vanderkooy,et al.  Transfer-Function Measurement with Maximum-Length Sequences , 1989 .

[50]  Günther Theile,et al.  Enlarging of the Listening Area by Increasing the Number of Loudspeakers , 1990 .

[51]  Paul Smith,et al.  DTS Multi-Channel 96 kHz Audio Compression , 1999 .

[52]  L. N. Kanal,et al.  Handbook of Statistics, Vol. 2. Classification, Pattern Recognition and Reduction of Dimensionality. , 1985 .

[53]  Antonio Bellacicco,et al.  Handbook of statistics 2: Classification, pattern recognition and reduction of dimensionality: P.R. KRISHNAIAH and L.N. KANAL (Eds.) North-Holland, Amsterdam, 1982, xxii + 903 pages, Dfl.275.00 , 1984 .

[54]  Gary W. Elko,et al.  Optimum loudspeaker spacing for robust crosstalk cancellation , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[55]  S Buus,et al.  Temporal integration of loudness, loudness discrimination, and the form of the loudness function. , 1997, The Journal of the Acoustical Society of America.

[56]  Masayuki Morimoto The Role of Rear Loudspeakers in Spatial Impression , 1997 .

[57]  J. Robert Stuart,et al.  The MLP Lossless Compression System , 1999 .

[58]  William L. Martens,et al.  Simulating the Cues of Spatial Hearing in Natural Environments , 1984, ICMC.

[59]  Jyri Huopaniemi,et al.  Efficient HRTF synthesis using an interaural transfer function model , 2000, 2000 10th European Signal Processing Conference.

[60]  James O. Ramsay,et al.  The joint analysis of direct ratings, pairwise preferences, and dissimilarities , 1980 .

[61]  William L. Martens,et al.  Multidimensional Perceptual Unfolding of Spatially Processed Speech I: Deriving Stimulus Space Using INDSCAL , 2000 .

[62]  Mendel Kleiner,et al.  Auralization-An Overview , 1993 .

[63]  Søren Bech,et al.  The use of subwoofers in the context of surround sound program reproduction , 1998 .

[64]  Mark Davis The AC-3 Multichannel Coder , 1993 .

[65]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[66]  F E Toole,et al.  In-head localization of acoustic images. , 1970, The Journal of the Acoustical Society of America.

[67]  J. L. Hall Maximum‐Likelihood Sequential Procedure for Estimation of Psychometric Functions , 1968 .

[68]  W. Heiser,et al.  PREFMAP-3 User's Guide , 1986 .

[69]  Günther Theile HDTV Sound Systems: How Many Channels? , 1991 .

[70]  Søren Bech The Influence of Stereophonic Width on the Perceived Quality of an Audiovisual Presentation Using a Multichannel Sound System , 1998 .

[71]  R. Rayleigh The Theory of Sound, Two Volumes In One , 1945 .

[72]  A. Spanias,et al.  Perceptual coding of digital audio , 2000, Proceedings of the IEEE.

[73]  R N Shepard,et al.  Multidimensional Scaling, Tree-Fitting, and Clustering , 1980, Science.

[74]  Kevin Kotorynski Digital Binaural/Stereo Conversion and Crosstalk Cancelling , 1990 .

[75]  S. S. Stevens Procedure for Calculating Loudness: Mark VI , 1961 .

[76]  J. Grey Multidimensional perceptual scaling of musical timbres. , 1977, The Journal of the Acoustical Society of America.

[77]  Günther Theile Trends and Activities in the Development of Multichannel Sound Systems , 1993 .

[78]  B. Moore,et al.  A revision of Zwicker's loudness model , 1996 .

[79]  Søren Bech,et al.  Methods for Subjective Evaluation of Spatial Characteristics of Sound , 1999 .

[80]  Harry T. Lawless,et al.  Sensory Evaluation of Food , 1999 .

[81]  H. Fletcher,et al.  Loudness, its definition, measurement and calculation. , 1933 .

[82]  Søren Bech,et al.  Interaction Between Audio-Visual Factors in a Home Theater System: Experimental Results , 1995 .

[83]  R. Fisher 014: On the "Probable Error" of a Coefficient of Correlation Deduced from a Small Sample. , 1921 .

[84]  A. Hesse Comparison of several psychophysical procedures with respect to threshold estimates, reproducibility and efficiency , 1986 .

[85]  Søren Bech Multichannel Level Alignment, Part V: The Effects of Reproduction Level, Reproduction Room, Step Size, and Symmetry , 2000 .

[86]  John M. Findlay,et al.  Estimates on probability functions: A more virulent PEST , 1978 .

[87]  S. S. Stevens Perceived Level of Noise by Mark VII and Decibels (E) , 1972 .

[88]  Thomas Baer,et al.  A model for the prediction of thresholds, loudness, and partial loudness , 1997 .

[89]  D. C. Howell Statistical Methods for Psychology , 1987 .

[90]  Francis Rumsey Subjective Assessment of the Spatial Attributes of Reproduced Sound , 1998 .

[91]  H. Wallach,et al.  The role of head movements and vestibular and visual cues in sound localization. , 1940 .

[92]  Elizabeth A. Peck,et al.  Introduction to Linear Regression Analysis , 2001 .

[93]  L. Tucker,et al.  An individual differences model for multidimensional scaling , 1963 .

[94]  B. Cardozo Adjusting the Method of Adjustment: SD vs DL , 1965 .

[95]  Gerhard Steinke Surround Sound-The New Phase , 1996 .

[96]  P. Damaske,et al.  Head‐Related Two‐Channel Stereophony with Loudspeaker Reproduction , 1971 .

[97]  Søren Bech,et al.  MULTICHANNEL LEVEL ALIGNMENT, PART IV: THE CORRELATION BETWEEN PHYSICAL MEASURES AND SUBJECTIVE LEVEL CALIBRATION , 2000 .

[98]  Floyd E. Toole Subjective Measurements of Loudspeaker Sound Quality and Listener Performance , 1985 .

[99]  Manfred R. Schroeder,et al.  Computer simulation of sound transmission in rooms , 1963 .

[100]  Tomlinson Holman,et al.  Sound for film and television , 1997 .

[101]  Søren Bech,et al.  Multichannel Level Alignment, Part III: The Influence of Loudspeaker Directivity and Reproduction Bandwidth , 1999 .

[102]  Søren Bech,et al.  Multichannel Level Alignment, Part II: The Influence of Signals and Loudspeaker Placement , 1998 .

[103]  Nick Zacharov,et al.  GLS - A generalised listener selection procedure , 2001 .

[104]  Forrest W. Young,et al.  Nonmetric individual differences multidimensional scaling: An alternating least squares method with optimal scaling features , 1977 .

[105]  W Jesteadt,et al.  Intensity and frequency discrimination in one- and two-interval paradigms. , 1972, The Journal of the Acoustical Society of America.

[106]  J. L. Hall Hybrid adaptive procedure for estimation of psychometric functions. , 1980, The Journal of the Acoustical Society of America.

[107]  P. Moran On the method of paired comparisons. , 1947, Biometrika.

[108]  Roy Martin Christensen,et al.  Compatible FM Broadcasting of Panoramic Sound , 1972 .

[109]  J. Chang,et al.  Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition , 1970 .

[110]  Tomlinson Holman,et al.  Comments on 'Subjective Appraisal of Loudspeaker Directivity for Multichannel Reproduction' and Author's Reply , 2000 .

[111]  Søren Bech,et al.  Multichannel Level Alignment, Part I: Signals and Methods , 1998 .

[112]  Bert Berlant Loudspeaker Directionality and the Perception of Reality , 1985 .

[113]  Michael J. Gerzon Periphony: With-Height Sound Reproduction , 1973 .

[114]  Nick Zacharov,et al.  A real‐time binaural loudness meter , 2000 .

[115]  Ronaldus Maria Aarts,et al.  On the design and psychophysical assessment of loudspeaker systems , 1995 .

[116]  Matti Karjalainen,et al.  Objective and Subjective Evaluation of Head-Related Transfer Function Filter Design , 1999 .

[117]  Günther Theile,et al.  Multichannel Natural Recording Based on Psychoacoustic Principles , 2000 .

[118]  M. Schroeder Integrated‐impulse method measuring sound decay without using impulses , 1979 .

[119]  M. Gardner Image fusion, broadening, and displacement in sound location. , 1969, The Journal of the Acoustical Society of America.

[120]  Michael Friis Sørensen,et al.  Directional dependence of loudness and binaural summation , 1995 .

[121]  S Buus,et al.  Temporal integration of loudness as a function of level. , 1995, The Journal of the Acoustical Society of America.

[122]  Duane H. Cooper,et al.  Prospects for Transaural Recording , 1989 .

[123]  Benjamin B. Bauer,et al.  Stereophonic Earphones and Binaural Loudspeakers , 1961 .

[124]  David Lubman,et al.  Spatial averaging in sound power measurements , 1971 .

[125]  P. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 1999 .

[126]  Jyri Huopaniemi,et al.  Results of a Round Robin Subjective Evaluation of Virtual Home Theatre Sound Systems , 1999 .

[127]  J Blauert,et al.  Auditory spaciousness: some further psychoacoustic analyses. , 1986, The Journal of the Acoustical Society of America.

[128]  Jyri Huopaniemi,et al.  VIRTUAL ACOUSTICS AND 3-D SOUND IN MULTIMEDIA SIGNAL PROCESSING , 2000 .

[129]  J Kreiman,et al.  Individual differences in voice quality perception. , 1989, Journal of speech and hearing research.

[130]  Björn Lindström,et al.  Perceived Sound Quality of High-Fidelity Loudspeakers , 1985 .

[131]  Francis Rumsey,et al.  In Search of the Spatial Dimensions of Reproduced Sound: Verbal Protocol Analysis and Cluster Analysis of Scaled Verbal Descriptors , 2000 .

[132]  Yoichi Ando,et al.  Concert Hall Acoustics , 1985 .

[133]  Francis Rumsey,et al.  Correlation between Emotive, Descriptive and Naturalness Attributes in Subjective Data Relating to Spatial Sound Reproduction , 2000 .

[134]  Tomlinson Holman 5.1 Surround Sound: Up and Running , 2000 .