Automatic speech and singing classification in ambulatory recordings for normal and disordered voices.

Ambulatory voice monitoring is a promising tool for investigating phonotraumatic vocal hyperfunction (PVH), associated with the development of vocal fold lesions. Since many patients with PVH are professional vocalists, a classifier was developed to better understand phonatory mechanisms during speech and singing. Twenty singers with PVH and 20 matched healthy controls were monitored with a neck-surface accelerometer-based ambulatory voice monitor. An expert-labeled ground truth data set was used to train a logistic regression on 15 subject-pairs with fundamental frequency and autocorrelation peak amplitude as input features. Overall classification accuracy of 94.2% was achieved on the held-out test set.

[1]  I. Titze,et al.  Populations in the U.S. workforce who rely on voice as a primary tool of trade: a preliminary report. , 1997, Journal of voice : official journal of the Voice Foundation.

[2]  D. Mehta,et al.  Accuracy of Self-Reported Estimates of Daily Voice Use in Adults With Normal and Disordered Voices. , 2016, American journal of speech-language pathology.

[3]  Marzyeh Ghassemi,et al.  Using Ambulatory Voice Monitoring to Investigate Common Voice Disorders: Research Update , 2015, Front. Bioeng. Biotechnol..

[4]  Robert E. Hillman,et al.  Mobile Voice Health Monitoring Using a Wearable Accelerometer Sensor and a Smartphone Platform , 2012, IEEE Transactions on Biomedical Engineering.

[5]  Wei-Ho Tsai,et al.  Speech and Singing Discrimination for Audio Data Indexing , 2014, 2014 IEEE International Congress on Big Data.

[6]  J. Stemple,et al.  Description of laryngeal pathologies by age, sex, and occupation in a treatment-seeking sample. , 1988, The Journal of speech and hearing disorders.

[7]  Bernhard Lehner,et al.  An Introduction to Signal Processing for Singing-Voice Analysis: High Notes in the Effort to Automate the Understanding of Vocals in Music , 2019, IEEE Signal Processing Magazine.

[8]  Joakim Gustafsson,et al.  Direct Comparison of Three Commercially Available Devices for Voice Ambulatory Monitoring and Biofeedback , 2014 .

[9]  Daryush D. Mehta,et al.  Average Ambulatory Measures of Sound Pressure Level, Fundamental Frequency, and Vocal Dose Do Not Differ Between Adult Females With Phonotraumatic Lesions and Matched Control Subjects , 2015, The Annals of otology, rhinology, and laryngology.

[10]  Ingo R. Titze,et al.  Comparison of Vocal Vibration-Dose Measures for Potential-Damage Risk Criteria. , 2015, Journal of speech, language, and hearing research : JSLHR.

[11]  Ingo R Titze,et al.  Adaptation of a Pocket PC for use as a wearable voice dosimeter. , 2005, Journal of speech, language, and hearing research : JSLHR.

[12]  Robert E Hillman,et al.  Ambulatory Monitoring of Disordered Voices , 2006, The Annals of otology, rhinology, and laryngology.