Mobile maestro: enabling immersive multi-speaker audio applications on commodity mobile devices

The goal of this work is to provide an abstraction of ideal sound environments to a new emerging class of Mobile Multi-speaker Audio (MMA) applications. Typically, it is challenging for MMA applications to implement advanced sound features (e.g., surround sound) accurately in mobile environments, especially due to unknown, irregular loudspeaker configurations. Towards an illusion that MMA applications run over specific loudspeaker configurations (i.e., speaker type, layout), this work proposes AMAC, a new Adaptive Mobile Audio Coordination system that senses the acoustic characteristics of mobile environments and controls individual loud-speakers adaptively and accurately. The prototype of AMAC implemented on commodity smartphones shows that it provides the coordination accuracy in sound arrival time in several tens of microseconds and reduces the variance in sound level substantially.

[1]  Ajay D. Kshemkalyani,et al.  Clock synchronization for wireless sensor networks: a survey , 2005, Ad Hoc Networks.

[2]  Ganapati Panda,et al.  Advances in active noise control: A survey, with emphasis on recent nonlinear techniques , 2013, Signal Process..

[3]  Floyd E. Toole,et al.  Sound Reproduction: The Acoustics and Psychoacoustics of Loudspeakers and Rooms , 2008 .

[4]  Oliver Hellmuth,et al.  Spatial Audio Object Coding (SAOC) - The Upcoming MPEG Standard on Parametric Object Based Audio Coding , 2008 .

[5]  Young-Tae Kim,et al.  Practical implementation of personal audio in a mobile device , 2013 .

[6]  Yang-Hann Kim,et al.  Generation of an acoustically bright zone with an illuminated region using multiple sources. , 2002, The Journal of the Acoustical Society of America.

[7]  B. Bauer,et al.  Phasor analysis of some stereophonic phenomena , 1962 .

[8]  J. Borish,et al.  An efficient algorithm for measuring the impulse response using pseudorandom noise , 1983 .

[9]  Xiaolin Li,et al.  Guoguo: enabling fine-grained indoor localization via smartphone , 2013, MobiSys '13.

[10]  Ville Pulkki,et al.  Virtual Sound Source Positioning Using Vector Base Amplitude Panning , 1997 .

[11]  David Chu,et al.  SwordFight: enabling a new class of phone-to-phone action games on commodity phones , 2012, MobiSys '12.

[12]  Inseok Hwang,et al.  SocioPhone: everyday face-to-face interaction monitoring platform using multi-phone sensor fusion , 2013, MobiSys '13.

[13]  H. Gaskell The precedence effect , 1983, Hearing Research.

[14]  Abraham Lempel,et al.  On Fast M-Sequence Transforms , 1998 .

[15]  Yang-Hann Kim,et al.  Integral Approach for Reproduction of Virtual Sound Source Surrounded by Loudspeaker Array , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  Saurabh Ganeriwal,et al.  Timing-sync protocol for sensor networks , 2003, SenSys '03.

[17]  Marije a. j. Baalman Spatial composition techniques and sound spatialisation technologies , 2010 .

[18]  Benjamin B. Bauer Phasor Analysis of the Stereophonic Phenomena , 1961 .

[19]  Joji Kuriyama,et al.  Adaptive Loudspeaker System , 1989 .

[20]  Keith Barker,et al.  A New Approach to the Assessment of Stereophonic Sound System Performance , 1985 .

[21]  Masato Miyoshi,et al.  Inverse filtering of room acoustics , 1988, IEEE Trans. Acoust. Speech Signal Process..

[22]  Deborah Estrin,et al.  Proceedings of the 5th Symposium on Operating Systems Design and Implementation Fine-grained Network Time Synchronization Using Reference Broadcasts , 2022 .

[23]  Aki Härmä Online Acoustic Measurements in a Networked Audio System , 2006 .

[24]  Guobin Shen,et al.  BeepBeep: a high accuracy acoustic ranging system using COTS mobile devices , 2007, SenSys '07.

[25]  Franz Zotter,et al.  AN AMBISONICS FORMAT FOR FLEXIBLE PLAYBACK LAYOUTS , 2009 .

[26]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[27]  David L. Mills,et al.  Internet time synchronization: the network time protocol , 1991, IEEE Trans. Commun..

[28]  Yang-Hann Kim,et al.  Sound Visualization and Manipulation , 2013 .