Zero Frequency Filter Based Analysis of Voice Disorders

Pitch period and amplitude perturbations are widely used parameters to discriminate normal and voice disorder speech. Instantaneous pitch period and amplitude of glottal vibrations directly from the speech waveform may not give an accurate estimation of jitter and shimmer. In this paper, the significance of epochs (glottal closure instants) and strength of excitation (SoE) derived from the zero-frequency filter (ZFF) are exploited to discriminate the voice disorder and normal speech. Pitch epoch derived from ZFF is used to compute the jitter, and SoE derived around each epoch is used compute the shimmer. The derived epoch-based features are analyzed on the some of the voice disorders like Parkinson’s disease, vocal fold paralysis, cyst, and gastroesophageal reflux disease. The significance of proposed epoch-based features for discriminating normal and pathological voices is analyzed and compared with the state-of-the-art methods using a support vector machine classifier. The results show that epoch-based features performed significantly better than other methods both in clean and noisy conditions.

[1]  S. R. Mahadeva Prasanna,et al.  Detection of Glottal Activity Using Different Attributes of Source Information , 2015, IEEE Signal Processing Letters.

[2]  Pedro Gómez Vilda,et al.  Automatic detection of voice impairments from text-dependent running speech , 2009, Biomed. Signal Process. Control..

[3]  Yannis Stylianou,et al.  Spectral jitter modeling and estimation , 2009, Biomed. Signal Process. Control..

[4]  C. Hartnick,et al.  Pediatric voice analysis: comparison of 2 computerized analysis systems. , 2014, JAMA otolaryngology-- head & neck surgery.

[5]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[6]  Mohammad Pooyan,et al.  Identification of voice disorders using long-time features and support vector machine with different feature reduction methods. , 2011, Journal of voice : official journal of the Voice Foundation.

[7]  D. Jamieson,et al.  Acoustic discrimination of pathological voice: sustained vowels versus continuous speech. , 2001, Journal of speech, language, and hearing research : JSLHR.

[8]  Bayya Yegnanarayana,et al.  Event-Based Instantaneous Fundamental Frequency Estimation From Speech Signals , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Philip de Chazal,et al.  Telephony-based voice pathology assessment using automated speech analysis , 2006, IEEE Transactions on Biomedical Engineering.

[10]  Yannis Stylianou,et al.  Voice Pathology Detection Based eon Short-Term Jitter Estimations in Running Speech , 2009, Folia Phoniatrica et Logopaedica.

[11]  Bayya Yegnanarayana,et al.  Characterization of Glottal Activity From Speech Signals , 2009, IEEE Signal Processing Letters.

[12]  Bayya Yegnanarayana,et al.  Epoch Extraction From Speech Signals , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[14]  Pedro Gómez Vilda,et al.  Methodological issues in the development of automatic systems for voice pathology detection , 2006, Biomed. Signal Process. Control..

[15]  Pedro Gómez-Vilda,et al.  The effectiveness of the glottal to noise excitation ratio for the screening of voice disorders. , 2010, Journal of voice : official journal of the Voice Foundation.

[16]  Patrick A. Naylor,et al.  Detection of Glottal Closure Instants From Speech Signals: A Quantitative Review , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[17]  J R Orozco-Arroyave,et al.  Automatic detection of Parkinson's disease in running speech spoken in three different languages. , 2016, The Journal of the Acoustical Society of America.

[18]  P. Lieberman Some Acoustic Measures of the Fundamental Periodicity of Normal and Pathologic Larynges , 1963 .

[19]  Ghulam Muhammad,et al.  An Investigation of Multidimensional Voice Program Parameters in Three Different Databases for Voice Pathology Detection and Classification. , 2017, Journal of voice : official journal of the Voice Foundation.

[20]  K. T. Deepak,et al.  Speech and EGG polarity detection using Hilbert Envelope , 2015, TENCON 2015 - 2015 IEEE Region 10 Conference.