Multidimensional Perceptual Unfolding of Spatially Processed Speech I: Deriving Stimulus Space Using INDSCAL

Multidimensional perceptual unfolding was executed for a set of spatially-processed speech samples reproduced via headphones. This paper describes only the first stage of a two-part study employing the analytic technique termed external unfolding. In this first stage, global dissimilarity ratings were made for all pairwise comparisons of the experimental stimuli under each of four listening conditions, which included broadband or bandlimited speech samples presented either simultaneously or sequentially. These four datasets were analyzed independently using INDSCAL (INdividual Differences SCALing), a method for processing inter-stimulus dissimilarity data that has specific advantages over classical MultiDimensional Scaling (MDS) analysis. First, it is designed to characterize quantitatively the individual differences in responses obtained from a group of experimental subjects. Second, the spatial configuration of points derived for the experimental stimuli, termed the Stimulus Space, has an inherently unique orientation that has none of the ambiguity that makes the interpretation of classical MDS results problematic. The Stimulus Space derived in the first stage of this study is one of two inputs required for external unfolding. It is combined with discrete attribute ratings collected in the second stage of this study to reveal the principal perceptual attributes of the set of spatially-processed speech samples, and their relative salience under various conditions.

[1]  R N Shepard,et al.  Multidimensional Scaling, Tree-Fitting, and Clustering , 1980, Science.

[2]  Andrzej Pietrzyk Computer Modeling of the Sound Field in Small Rooms , 1998 .

[3]  H. Colburn,et al.  On the minimum-phase approximation of head-related transfer functions , 1995, Proceedings of 1995 Workshop on Applications of Signal Processing to Audio and Accoustics.

[4]  Antonio Bellacicco,et al.  Handbook of statistics 2: Classification, pattern recognition and reduction of dimensionality: P.R. KRISHNAIAH and L.N. KANAL (Eds.) North-Holland, Amsterdam, 1982, xxii + 903 pages, Dfl.275.00 , 1984 .

[5]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[6]  William L. Martens,et al.  Simulating the Cues of Spatial Hearing in Natural Environments , 1984, ICMC.

[7]  M. Gardner Image fusion, broadening, and displacement in sound location. , 1969, The Journal of the Acoustical Society of America.

[8]  F. Wightman,et al.  A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction. , 1992, The Journal of the Acoustical Society of America.

[9]  M. Barron The subjective effects of first reflections in concert halls—The need for lateral reflections , 1971 .

[10]  M. Goldstein,et al.  Multivariate Analysis: Methods and Applications , 1984 .

[11]  Nick Zacharov,et al.  GuineaPig - A Generic Subjective Test System for Multichannel Audio , 1999 .

[12]  Joseph L. Zinnes,et al.  Theory and Methods of Scaling. , 1958 .

[13]  T. Næs,et al.  Multivariate analysis of data in sensory science , 1996 .

[14]  L. N. Kanal,et al.  Handbook of Statistics, Vol. 2. Classification, Pattern Recognition and Reduction of Dimensionality. , 1985 .

[15]  Nick Zacharov,et al.  GLS - A generalised listener selection procedure , 2001 .

[16]  安藤 四一,et al.  Music and concert hall acoustics , 1997 .

[17]  J. Chang,et al.  Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition , 1970 .

[18]  Mendel Kleiner,et al.  Auralization-An Overview , 1993 .

[19]  Henrik Møller,et al.  Audibility of all-pass components in head-related transfer functions , 2000 .

[20]  Durand R. Begault Virtual Acoustic Displays for Teleconferencing: Intelligibility Advantage for 'Telephone-Grade' Audio , 1999 .

[21]  W. Heiser,et al.  PREFMAP-3 User's Guide , 1986 .

[22]  L. Tucker,et al.  An individual differences model for multidimensional scaling , 1963 .

[23]  F E Toole,et al.  In-head localization of acoustic images. , 1970, The Journal of the Acoustical Society of America.

[24]  H. Colburn,et al.  Sensitivity of human subjects to head-related transfer-function phase spectra. , 1999, Journal of the Acoustical Society of America.

[25]  Søren Bech Perception of Timbre of Reproduced Sound in Small Rooms: Influence of Room and Loudspeaker Position , 1994 .

[26]  J Blauert,et al.  Auditory spaciousness: some further psychoacoustic analyses. , 1986, The Journal of the Acoustical Society of America.