Speech activity detection using accelerometer

The level of social activity is linked to the overall wellbeing and to various disorders, including stress. In this regard, a myriad of automatic solutions for monitoring social interactions have been proposed, usually including audio data analysis. Such approaches often face legal and ethical issues and they may also raise privacy concerns in monitored subjects thus affecting their natural behaviour. In this paper we present an accelerometer-based speech detection which does not require capturing sensitive data while being an easily applicable and a cost-effective solution.

[1]  Tanzeem Choudhury,et al.  Passive and In-Situ assessment of mental and physical well-being using mobile sensors , 2011, UbiComp '11.

[2]  B. Widrow,et al.  Adaptive noise cancelling: Principles and applications , 1975 .

[3]  Vittorio Murino,et al.  Look at Who's Talking: Voice Activity Detection by Automated Gesture Analysis , 2011, AmI Workshops.

[4]  Tom Chau,et al.  Augmentative Communication Based on Realtime Vocal Cord Vibration Detection , 2010, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[5]  Merryn J Mathie,et al.  Accelerometry: providing an integrated, practical method for long-term, ambulatory monitoring of human movement , 2004, Physiological measurement.

[6]  William M. Campbell,et al.  Exploiting Nonacoustic Sensors for Speech Encoding , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  I. Titze Physiologic and acoustic differences between male and female voices. , 1989, The Journal of the Acoustical Society of America.

[8]  Judith C Hays,et al.  Psychosocial and clinical predictors of unipolar depression outcome in older adults , 2002, International journal of geriatric psychiatry.

[9]  Alex Pentland,et al.  Sensing and modeling human networks using the sociometer , 2003, Seventh IEEE International Symposium on Wearable Computers, 2003. Proceedings..

[10]  Jeff A. Bilmes,et al.  Inferring colocation and conversation networks from privacy-sensitive audio with implications for computational social science , 2011, TIST.

[11]  Nathan Eagle,et al.  Machine perception and learning of complex social systems , 2005 .

[12]  Robert Stewart,et al.  Social activity and improvement in depressive symptoms in older people: a prospective community cohort study. , 2009, The American journal of geriatric psychiatry : official journal of the American Association for Geriatric Psychiatry.

[13]  J Sundberg Chest wall vibrations in singers. , 1983, Journal of speech and hearing research.

[14]  J. House,et al.  Social relationships and health. , 1988, Science.

[15]  Andrew T. Campbell,et al.  Bewell: A smartphone application to monitor, model and promote wellbeing , 2011, PervasiveHealth 2011.

[16]  Tsuhan Chen,et al.  Cross-modal prediction in audio-visual communication , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.