Synthesis of Spatially Extended Virtual Sources with Time-Frequency Decomposition of Mono Signals

Synthesis of volumetric virtual sources is a useful technique for auditory displays and virtual worlds. This task can be simplified into synthesis of perceived spatial extent. Previous research in virtual-world Directional Audio Coding has shown that spatial extent can be synthesized with monophonic sources by applying a time-frequency-space decomposition, i.e., randomly distributing time-frequency bins of the source signal. However, although this technique often achieved perception of spatial extent, it was not guaranteed and the timbre could degrade. In this article this technique is revisited in detail and the effect of different parameters is examined to ultimately achieve optimal quality and perception in all situations. The results of a series of informal and formal experiments are presented here, and they suggest that the revised method is viable in many cases. There is some dependency on the signal content that requires proper tuning of parameters. Furthermore, it is shown that different distribution widths can be produced with the method as well. From a psychoacoustical perspective, it is interesting that distributed narrow frequency bands form a spatially extended auditory event with no apparent directional focus.

[1]  Jan Berg,et al.  The Contrasting and Conflicting Definitions of Envelopment , 2009 .

[2]  P. Coleman Failure to Localize the Source Distance of an Unfamiliar Sound , 1962 .

[3]  Koichiro Hiyama,et al.  The Minimum Number of Loudspeakers and its Arrangement for Reproducing the Spatial Impression of Diffuse Sound Field , 2002 .

[4]  Dana S. Hougland,et al.  Concert and Opera Halls: How They Sound , 1996 .

[5]  J Blauert,et al.  Spatial mapping of intracranial auditory events for various degrees of interaural coherence. , 1986, The Journal of the Acoustical Society of America.

[6]  Matthias Frank,et al.  Efficient Phantom Source Widening and Diffuseness in Ambisonics , 2014 .

[7]  Francis Rumsey,et al.  Spatial quality evaluation for reproduced sound: terminology, meaning and a scene-based paradigm , 2002 .

[8]  Ville Pulkki,et al.  Perception and Analysis of Selected Auditory Events with Frequency-Dependent Directions , 2006 .

[9]  Francis Rumsey,et al.  Frequency dependency of the relationship between perceived auditory source width and the interaural cross-correlation coefficient for time-invariant stimuli. , 2005, The Journal of the Acoustical Society of America.

[10]  Cumhur Erkut,et al.  Parametric time-frequency representation of spatial sound in virtual worlds , 2012, TAP.

[11]  H. Gaskell The precedence effect , 1983, Hearing Research.

[12]  Sascha Spors,et al.  Two Physical Models for Spatially Extended Virtual Sound Sources , 2011 .

[13]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[14]  Joshua D. Reiss,et al.  A Cross-Adaptive Dynamic Spectral Panning Technique , 2014, DAFx.

[15]  Sascha Disch,et al.  Reproducing Applause-Type Signals with Directional Audio Coding , 2011 .

[16]  Chris Kyriakakis,et al.  Audio Signal Decorrelation Based on a Critical Band Approach , 2004 .

[17]  Andy Field,et al.  Discovering Statistics Using SPSS (Ism Introducing Statistical Methods) , 2005 .

[18]  Ville Pulkki,et al.  Directional perception of distributed sound sources. , 2011, The Journal of the Acoustical Society of America.

[19]  Richard Kronland-Martinet,et al.  A 3-D Immersive Synthesizer for Environmental Sounds , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[20]  Ian Burnett,et al.  DECORRELATION TECHNIQUES FOR THE RENDERING OF APPARENT SOUND SOURCE WIDTH IN 3D AUDIO DISPLAYS , 2004 .

[21]  Sascha Disch,et al.  Sensitivity of Human Hearing to Changes in Phase Spectrum , 2013 .

[22]  Ville Pulkki,et al.  Perceived Spatial Distribution and Width of Horizontal Ensemble of Independent Noise Signals as Function of Waveform and Sample Length , 2008 .

[23]  Cumhur Erkut,et al.  Efficient Spatial Sound Synthesis for Virtual Worlds , 2009 .

[24]  D R Perrott,et al.  Judgments of sound volume: effects of signal duration, level, and interaural characteristics on the perceived extensity of broadband noise. , 1982, The Journal of the Acoustical Society of America.