Sound source localisation on Android smartphones: A first step to using smartphones as auditory sensors for training A.I systems with Big Data

The ability to estimate positions of sound sources is one that gives animals a 360° awareness of their acoustic environment. This helps complement the visual scene which is restricted to 180° in humans. Unfortunately, deaf people are left out on this ability. Smart phones are rapidly becoming a common tool amongst mobile users in developed and emerging markets. Their processing ability has more than doubled since their introduction to mass consumer markets by Apple in 2007. Top-end smart phones such as the Samsung Galaxy Series; 3, 4, and 5 models, have two microphones with which one can acquire stereo recordings. The purpose of this research project was to establish a feasible Sound source localization algorithm for current top-end smart phones, and to recommend hardware improvements for future smart phones, to pave way for the use of smart phones as advanced auditory sensory devices capable of acting as avatars for intelligent remote systems to learn about different acoustic scenes with help of human users.

[1]  Samuel Madden,et al.  From Databases to Big Data , 2012, IEEE Internet Comput..

[2]  J.B. Allen,et al.  A unified approach to short-time Fourier analysis and synthesis , 1977, Proceedings of the IEEE.

[3]  Joshua D. Reiss,et al.  Calculating Time Delays of Multiple Active Sources in Live Sound , 2010 .

[4]  Ck Cheng,et al.  The Age of Big Data , 2015 .

[5]  Stanley T. Birchfield,et al.  Acoustic localization by interaural level difference , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[6]  Stefan Wermter,et al.  Robotic sound-source localisation architecture using cross-correlation and recurrent neural networks , 2009, Neural Networks.

[7]  DeLiang Wang,et al.  A computational auditory scene analysis system for speech segregation and robust speech recognition , 2010, Comput. Speech Lang..

[8]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[9]  Jean Rouat,et al.  Robust localization and tracking of simultaneous moving sound sources using beamforming and particle filtering , 2007, Robotics Auton. Syst..

[10]  J. Gerard Wolff,et al.  Big Data and the SP Theory of Intelligence , 2013, IEEE Access.

[11]  Michael A Akeroyd,et al.  The psychoacoustics of binaural hearing , 2006, International journal of audiology.

[12]  Alex Doboli,et al.  Improved sound-based localization through a network of reconfigurable mixed-signal nodes , 2010, 2010 IEEE International Workshop on Robotic and Sensors Environments.

[13]  W. M. Rabinowitz,et al.  Auditory localization of nearby sources. Head-related transfer functions. , 1999, The Journal of the Acoustical Society of America.

[14]  Scott Rickard,et al.  Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[15]  Shihab A. Shamma,et al.  MEASUREMENT OF HEAD-RELATED TRANSFER FUNCTIONS BASED ON THE EMPIRICAL TRANSFER FUNCTION ESTIMATE , 2003 .