The sound of silence

A list of the dynamically changing group membership of a meeting supports a variety of meeting-related activities. Effortless content sharing might be the most important application, but we can also use it to provide business card information for attendees, feed information into calendar applications to simplify scheduling of follow-up meetings, populate the membership of collaborative editing applications, mailing lists, and social networks, and perform many other tasks. We have developed a system that uses audio sensing to maintain meeting membership automatically. We choose audio since hearing the same conversation provides a human-centric notion of attending the same gathering. It takes into account walls and other sound barriers between otherwise closely situated people. It can sense participants attending remotely by teleconference. It does not require attendees to perform any explicit action when participants leave a meeting for which they should no longer have access to associated content. It works indoors and outdoors and does not require pre-populating databases with mapping information. For sensors, we require only the commonly available microphones on mobile devices. Our system exploits a new technique for matching sensed patterns of relative audio silence, or silence signatures, from mobile devices (mobile phones, tablets, laptops) to determine device co-location. A signature based on simple silence patterns rather than a detailed audio characterization reveals less information about the content of potentially private conversations and is also more robustly compared across devices that are not clock synchronized. We evaluate our method in formal indoor meetings and teleconferences and in ad hoc gatherings outdoors and in a noisy cafeteria. Across all our tests so far, our approach determines audio co-location with a worst-case accuracy of 96%, and recovery from these errors takes only a few seconds. We also describe a content sharing application supported by silence signature matching, the limitations of our approach, current status, and future plans.

[1]  Per K. Enge,et al.  Global positioning system: signals, measurements, and performance [Book Review] , 2002, IEEE Aerospace and Electronic Systems Magazine.

[2]  A Kataoka,et al.  Conjugate Structure and Algebraic CELP(CS-ACELP)Coder for the ITU-T 8-kbit/s Speech Coding Standard , 1995 .

[3]  E. Shlomot,et al.  ITU-T Recommendation G.729 Annex B: a silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications , 1997, IEEE Commun. Mag..

[4]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[5]  Hari Balakrishnan,et al.  6th ACM/IEEE International Conference on on Mobile Computing and Networking (ACM MOBICOM ’00) The Cricket Location-Support System , 2022 .

[6]  Paramvir Bahl,et al.  RADAR: an in-building RF-based user location and tracking system , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[7]  Cristina V. Lopes,et al.  Acoustic Modems for Ubiquitous Computing , 2003, IEEE Pervasive Comput..

[8]  Ken Hinckley,et al.  Synchronous gestures for multiple persons and computers , 2003, UIST '03.

[9]  Fabrice Labeau,et al.  Discrete Time Signal Processing , 2004 .

[10]  John Krumm,et al.  Accuracy characterization for metropolitan-scale Wi-Fi localization , 2005, MobiSys '05.

[11]  Trevor Darrell,et al.  Nearest-Neighbor Methods in Learning and Vision: Theory and Practice (Neural Information Processing) , 2006 .

[12]  Moustafa Youssef,et al.  The Horus WLAN location determination system , 2005, MobiSys '05.

[13]  Tomi Kinnunen,et al.  Real-time speaker identification and verification , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  Alexandr Andoni,et al.  Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[15]  Hanan Samet,et al.  Foundations of multidimensional and metric data structures , 2006, Morgan Kaufmann series in data management systems.

[16]  Jeff A. Bilmes,et al.  A Privacy-Sensitive Approach to Modeling Multi-Person Conversations , 2007, IJCAI.

[17]  Guobin Shen,et al.  BeepBeep: a high accuracy acoustic ranging system using COTS mobile devices , 2007, SenSys '07.

[18]  Kave Eshghi,et al.  Locality sensitive hash functions based on concomitant rank order statistics , 2008, KDD.

[19]  Young Sub Song,et al.  MOBILE ROBOT LOCALIZATION IN INDOOR ENVIRONMENT USING SCALE-INVARIANT VISUAL LANDMARKS , 2008 .

[20]  Claudio Soriente,et al.  Using audio in secure device pairing , 2009, Int. J. Secur. Networks.

[21]  Wei Pan,et al.  SoundSense: scalable sound sensing for people-centric applications on mobile phones , 2009, MobiSys '09.

[22]  Romit Roy Choudhury,et al.  SurroundSense: mobile phone localization via ambience fingerprinting , 2009, MobiCom '09.

[23]  René Mayrhofer,et al.  Shake Well Before Use: Intuitive and Secure Pairing of Mobile Devices , 2009, IEEE Transactions on Mobile Computing.

[24]  Romit Roy Choudhury,et al.  EnLoc: Energy-Efficient Localization for Mobile Phones , 2009, IEEE INFOCOM 2009.

[25]  Toyoaki Nishida,et al.  Neary: conversation field detection based on similarity of auditory situation , 2009, HotMobile '09.

[26]  Injong Rhee,et al.  Towards Mobile Phone Localization without War-Driving , 2010, 2010 Proceedings IEEE INFOCOM.

[27]  Venkata N. Padmanabhan,et al.  Indoor localization without the pain , 2010, MobiCom.

[28]  Alec Wolman,et al.  Virtual Compass: Relative Positioning to Sense Mobile Social Interactions , 2010, Pervasive.

[29]  Mitchell D. Trott,et al.  Reference-free audio matching for rendezvous , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[30]  Laurent Amsaleg,et al.  Locality sensitive hashing: A comparison of hash function types and querying mechanisms , 2010, Pattern Recognit. Lett..

[31]  Peter A. Dinda,et al.  Indoor localization without infrastructure using the acoustic background spectrum , 2011, MobiSys '11.

[32]  Hans-Werner Gellersen,et al.  Usability classification for spontaneous device association , 2011, Personal and Ubiquitous Computing.

[33]  Jie Liu,et al.  SpeakerSense: Energy Efficient Unobtrusive Speaker Identification on Mobile Phones , 2011, Pervasive.

[34]  Fahim Kawsar,et al.  Spatial co-location for device association: the connected object way , 2011, NoME-IoT '11.

[35]  Ig-Jae Kim,et al.  Indoor location sensing using geo-magnetism , 2011, MobiSys '11.

[36]  Hossein Mobahi,et al.  Toward a Practical Face Recognition System: Robust Alignment and Illumination by Sparse Representation , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Tom Minka,et al.  You are facing the Mona Lisa: spot localization using PHY layer information , 2012, MobiSys '12.

[38]  Prasant Mohapatra,et al.  Improving crowd-sourced Wi-Fi localization systems using Bluetooth beacons , 2012, 2012 9th Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks (SECON).

[39]  Mary Baker,et al.  Determining co-location using a sequential hypothesis test on patterns of silence , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[40]  Mary Baker,et al.  Line-of-sight signaling and positioning for mobile devices , 2013, 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).