Infrasonic scene fingerprinting for authenticating speaker location

Ambient infrasound with frequency ranges well below 20 Hz is known to carry robust navigation cues that can be exploited to authenticate the location of a speaker. Unfortunately, many of the mobile devices like smartphones have been optimized to work in the human auditory range, thereby suppressing information in the infrasonic region. In this paper, we show that these ultra-low frequency cues can still be extracted from a standard smartphone recording by using acceleration-based cepstral features. To validate our claim, we have collected smartphone recordings from more than 30 different scenes and used the cues for scene fingerprinting. We report scene recognition rates in excess of 90% and a feature set analysis reveals the importance of the infrasonic signatures towards achieving the state-of-the-art recognition performance.

[1]  Romit Roy Choudhury,et al.  SurroundSense: mobile phone localization using ambient sound and light , 2009, MOCO.

[2]  Søren Krarup Olesen,et al.  Measurement of low-frequency noise in rooms , 2006 .

[3]  Jeffrey A. Nystuen,et al.  Weather Classification Using Passive Acoustic Drifters , 1997 .

[4]  J. Hagstrum,et al.  Infrasound and the Avian Navigational Map , 2001, Journal of Navigation.

[5]  Jr. J.P. Campbell,et al.  Speaker recognition: a tutorial , 1997, Proc. IEEE.

[6]  Gerhard Tröster,et al.  RoomSense: an indoor positioning system for smartphones using active sound probing , 2013, AH.

[7]  Martin Vetterli,et al.  Acoustic echoes reveal room shape , 2013, Proceedings of the National Academy of Sciences.

[8]  Christian Rathgeb,et al.  Efficient two-stage speaker identification based on universal background models , 2014, 2014 International Conference of the Biometrics Special Interest Group (BIOSIG).

[9]  Indranil Saha,et al.  journal homepage: www.elsevier.com/locate/neucom , 2022 .

[10]  François Pachet,et al.  The bag-of-frames approach to audio pattern recognition: a sufficient model for urban soundscapes but not for polyphonic music. , 2007, The Journal of the Acoustical Society of America.

[11]  Paul J. M. Havinga,et al.  Towards Smart Surroundings: Enabling Techniques and Technologies for Localization , 2005, LoCA.

[12]  Hong Zhao,et al.  Audio Recording Location Identification Using Acoustic Environment Signature , 2013, IEEE Transactions on Information Forensics and Security.

[13]  S. Chakrabartty,et al.  Exploiting jump-resonance hysteresis in silicon cochlea for formant trajectory encoding , 2012, 2012 IEEE 55th International Midwest Symposium on Circuits and Systems (MWSCAS).

[14]  Hui Liu,et al.  Catch You as I Can: Indoor Localization via Ambient Sound Signature and Human Behavior , 2013, Int. J. Distributed Sens. Networks.

[15]  Wade Trappe,et al.  Fingerprints in the Ether: Channel-Based Authentication , 2009 .

[16]  Anthony Rowe,et al.  Indoor pseudo-ranging of mobile devices using ultrasonic chirps , 2012, SenSys '12.

[17]  William T. Plummer,et al.  Infrasonic Resonances in Natural Underground Cavities , 1969 .

[18]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[19]  Douglas B. Quine,et al.  Infrasound detection by the homing pigeon: A behavioral audiogram , 1979, Journal of comparative physiology.

[20]  Peter A. Dinda,et al.  Indoor localization without infrastructure using the acoustic background spectrum , 2011, MobiSys '11.

[21]  R. Michael Buehrer,et al.  Characterization and detection of location spoofing attacks , 2012, Journal of Communications and Networks.

[22]  Nei Kato,et al.  Effectively Collecting Data for the Location-Based Authentication in Internet of Things , 2017, IEEE Systems Journal.

[23]  Uri Fehr,et al.  Measurements of infrasound from artificial and natural sources , 1967 .

[24]  Alexander Travis Adams,et al.  Public restroom detection on mobile phone via active probing , 2014, SEMWEB.

[25]  Shantanu Chakrabartty,et al.  Exploiting Jump-Resonance Hysteresis in Silicon Auditory Front-Ends for Extracting Speaker Discriminative Formant Trajectories , 2013, IEEE Transactions on Biomedical Circuits and Systems.

[26]  Peter A. Dinda,et al.  Acoustic sensing of location and user presence on mobile computers , 2011 .