Comparison of two methods of voice activity detection in field studies.

PURPOSE To evaluate and compare the performance of 2 methods of voice activity detection (neck-attached accelerometer vs. binaural recordings) in field studies in environments where voice activity normally occurs. METHOD A group of 11 healthy adults wore recording equipment during their lunch break. We used binary classification to analyze the results from the 2 methods. The output was compared to a gold standard, obtained through listening tests, and the probability for sensitivity (Ps) and false positive (Pf) was rated. The binary classifiers were set for consistent sensitivity of 99%; thus, the lower false positive rate would indicate the method with the better performance. RESULTS The neck-attached accelerometer (Pf = 0.5%) performed significantly (p < .001) better than the binaural method (Pf = 7%). CONCLUSION The neck-attached accelerometer is more suitable than the binaural method for voice assessments in environments where people are speaking in close proximity to each other and where the signal-to-noise ratio is moderate to low.

[1]  Ingo R Titze,et al.  Estimation of sound pressure levels of voiced speech from skin vibration of the neck. , 2005, The Journal of the Acoustical Society of America.

[2]  H. Kingma,et al.  Vocal load as measured by the voice accumulator. , 1995, Folia phoniatrica et logopaedica : official organ of the International Association of Logopedics and Phoniatrics.

[3]  I. Titze,et al.  Objective Measurement of Vocal Fatigue in Classical Singers: A Vocal Dosimetry Pilot Study , 2006, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[4]  E Airo,et al.  A Method to Measure Speaking Time and Speech Sound Pressure Level , 2000, Folia Phoniatrica et Logopaedica.

[5]  Robert E Hillman,et al.  Ambulatory Monitoring of Disordered Voices , 2006, The Annals of otology, rhinology, and laryngology.

[6]  Ingo R Titze,et al.  Adaptation of a Pocket PC for use as a wearable voice dosimeter. , 2005, Journal of speech, language, and hearing research : JSLHR.

[7]  Anders Löfqvist,et al.  A Voice Accumulator—Validation and Application , 1989 .

[8]  Maria Södersten,et al.  Vocal behavior and vocal loading factors for preschool teachers at work studied with binaural DAT recordings. , 2002, Journal of voice : official journal of the Voice Foundation.

[9]  S Komiyama,et al.  A newly devised speech accumulator. , 1983, ORL; journal for oto-rhino-laryngology and its related specialties.

[10]  Svante Granqvist,et al.  The self-to-other ratio applied as a phonation detector for voice accumulation , 2003, Logopedics, phoniatrics, vocology.

[11]  Robert E Hillman,et al.  Development and testing of a portable vocal accumulator. , 2003, Journal of speech, language, and hearing research : JSLHR.

[12]  Lars-Ola Bligård,et al.  Proceedings of the 39th Nordic Ergonomics Society Conference, Oct 1-3 2007, Lysekil, Sweden , 2007 .

[13]  Fredric Lindström,et al.  A comparison of two active‐speaker‐detection methods suitable for usage in noise dosimeter measurements , 2008 .

[14]  George R. Wodicka,et al.  Air-Borne and Tissue-Borne Sensitivities of Bioacoustic Sensors Used on the Skin Surface , 2009, IEEE Transactions on Biomedical Engineering.

[15]  Ingo R Titze,et al.  Voicing and silence periods in daily and weekly vocalizations of teachers. , 2007, The Journal of the Acoustical Society of America.

[16]  Erkki Vilkman,et al.  Occupational Safety and Health Aspects of Voice and Speech Professions , 2004, Folia Phoniatrica et Logopaedica.