Machine-learning-based estimation and rendering of scattering in virtual reality.

In this work, a technique to render the acoustic effect of scattering from finite objects in virtual reality is proposed, which aims to provide a perceptually plausible response for the listener, rather than a physically accurate response. The effect is implemented using parametric filter structures and the parameters for the filters are estimated using artificial neural networks. The networks may be trained with modeled or measured data. The input data consist of a set of geometric features describing a large quantity of source-object-receiver configurations, and the target data consist of the filter parameters computed using measured or modeled data. A proof-of-concept implementation is presented, where the geometric descriptions and computationally modeled responses of three-dimensional plate objects are used for training. In a dynamic test scenario, with a single source and plate, the approach is shown to provide a similar spectrogram when compared with a reference case, although some spectral differences remain present. Nevertheless, it is shown with a perceptual test that the technique produces only a slightly lower degree of plausibility than the state-of-the-art acoustic scattering model that accounts for diffraction, and also that the proposed technique yields a prominently higher degree of plausibility than a model that omits diffraction.

[1]  M. Biot,et al.  Formulation of Wave Propagation in Infinite Media by Normal Coordinates with an Application to Diffraction , 1957 .

[2]  Nikunj Raghuvanshi,et al.  Parametric wave field coding for precomputed sound propagation , 2014, ACM Trans. Graph..

[3]  Mark J. Beeson,et al.  Modelling Spatial Sound Occlusion and Diffraction Effects Using the Digital Waveguide Mesh , 2003 .

[4]  Mark D. Plumbley,et al.  Combining Fully Convolutional and Recurrent Neural Networks for Single Channel Audio Source Separation , 2018 .

[5]  U. Svensson,et al.  An analytic secondary source model of edge diffraction impulse responses , 1999 .

[6]  D. Hewett,et al.  Diffraction by a right-angled impedance wedge: an edge source formulation. , 2014, The Journal of the Acoustical Society of America.

[7]  Tomi Huttunen,et al.  Rapid Generation of Personalized HRTFs , 2014 .

[8]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.

[9]  Stefano Squartini,et al.  Deep Neural Networks for Road Surface Roughness Classification from Acoustic Signals , 2018 .

[10]  Tapio Lokki,et al.  Creating Interactive Virtual Acoustic Environments , 1999 .

[11]  Teuvo Kohonen,et al.  The 'neural' phonetic typewriter , 1988, Computer.

[12]  Nicolas Tsingos,et al.  Fast Rendering of Sound Occlusion and Diffraction Effects for Virtual Acoustic Environments , 1998 .

[13]  U. Svensson,et al.  An integral equation formulation for the diffraction from convex plates and polyhedra. , 2013, The Journal of the Acoustical Society of America.

[14]  Thomas A. Funkhouser,et al.  Modeling acoustics in virtual environments using the uniform theory of diffraction , 2001, SIGGRAPH.

[15]  Wenyu Jiang,et al.  Using Programmable Graphics Hardware for Acoustics and Audio Rendering , 2009 .

[16]  Renato S. Pellegrini,et al.  Quality assessment of auditory virtual environments , 2001 .

[17]  Dinesh Manocha,et al.  Acoustic Classification and Optimization for Multi-Modal Rendering of Real-World Scenes , 2018, IEEE Transactions on Visualization and Computer Graphics.

[18]  Tapio Lokki,et al.  An Efficient Auralization of Edge Diffraction , 2002 .

[19]  Tapio Lokki,et al.  Implementation and Visualization of Edge Diffraction with Image-source Method , 2002 .

[20]  Dinesh Manocha,et al.  High-order diffraction and diffuse reflections for interactive sound propagation in large environments , 2014, ACM Trans. Graph..

[21]  Colin Brown,et al.  Neural network classification of multibeam backscatter and bathymetry data from Stanton Bank (Area IV) , 2009 .

[22]  Heikki Huttunen,et al.  Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[23]  Lauri Savioja,et al.  Overview of geometrical room acoustic modeling techniques. , 2015, The Journal of the Acoustical Society of America.

[24]  M. Vorländer Computer simulations in room acoustics: concepts and uncertainties. , 2013, The Journal of the Acoustical Society of America.

[25]  Modeling sound scattering using a combination of the edge source integral equation and the boundary element method. , 2018, The Journal of the Acoustical Society of America.