Acoustic Tracking of Pitch, Modal, and Subharmonic Vibrations of Vocal Folds in Parkinson’s Disease and Parkinsonism

The prominent and early presence of dysphonia is considered a valuable marker for differentiation of idiopathic Parkinson’s disease and parkinsonian syndromes. Objective quantification of vibrational regimes represented by modal and subharmonic vibrations may thus be vital for improving accuracy of diagnostic decision. The rationale for analyzing vibrational regimes is that abnormal subharmonic vibrations might be the key factor causing dysphonia in parkinsonian syndromes. This study introduces a new fully automated methodology based on robust pitch tracker for decoupling vibrations controlled by laryngeal muscles from the effect of subharmonics that provides distinguishing features of Parkinson’s disease and atypical parkinsonian syndromes. We tested the method on resynthesized signals with known parameters and demonstrated that vibrations controlled by laryngeal muscles as well as subharmonics can be detected reliably with a precision that outperforms available technologies. We analysed 337 sustained vowels of 22 patients with PD, 21 patients with multiple system atrophy, 18 patients with progressive supranuclear palsy and 22 healthy controls. Our results showed that subharmonics are more prominent in atypical parkinsonian syndromes compared to Parkinson’s disease. Also, increased modulation by laryngeal muscles appears to be a distinctive symptom of multiple system atrophy. Developed algorithm and proposed resynthesized voice signals provide further critical step to understanding and evaluation of dysphonia in Parkinsonism.

[1]  Heiga Zen,et al.  Using instantaneous frequency and aperiodicity detection to estimate F0 for high-quality speech synthesis , 2016, SSW.

[2]  Masanori Morise,et al.  Harvest: A High-Performance Fundamental Frequency Estimator from Speech Signals , 2017, INTERSPEECH.

[3]  D. J. Hermes,et al.  Measurement of pitch by subharmonic summation. , 1988, The Journal of the Acoustical Society of America.

[4]  Ingo R. Titze,et al.  Principles of voice production , 1994 .

[5]  S. Hertegård,et al.  Freddie Mercury—acoustic analysis of speaking fundamental frequency, vibrato, and subharmonics , 2017, Logopedics, phoniatrics, vocology.

[6]  Raymond D. Kent The MIT Encyclopedia of Communication Disorders , 2003 .

[7]  Evžen Růžička,et al.  Quantitative assessment of motor speech abnormalities in idiopathic rapid eye movement sleep behaviour disorder. , 2016, Sleep medicine.

[8]  Yi Xu,et al.  Perceived pitch of synthesized voice with alternate cycles. , 2002, Journal of voice : official journal of the Voice Foundation.

[9]  Hermann Ackermann,et al.  Gender-Specific Vocal Dysfunctions in Parkinson's Disease: Electroglottographic and Acoustic Analyses , 1995, The Annals of otology, rhinology, and laryngology.

[10]  D G Jamieson,et al.  A comparison of high precision F0 extraction algorithms for sustained vowels. , 1999, Journal of speech, language, and hearing research : JSLHR.

[11]  David Talkin,et al.  A Robust Algorithm for Pitch Tracking ( RAPT ) , 2005 .

[12]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[13]  L. Ramig,et al.  Acoustic Analyses of Phonation in Patients with Huntington's Disease , 1986, The Annals of otology, rhinology, and laryngology.

[14]  P. Snyder,et al.  Acoustic characteristics of Parkinsonian speech: a potential biomarker of early disease progression and treatment , 2004, Journal of Neurolinguistics.

[15]  Y Ben-Shlomo,et al.  Prevalence of progressive supranuclear palsy and multiple system atrophy: a cross-sectional study , 1999, The Lancet.

[16]  Arturo Camacho Lozano,et al.  SWIPE: A Sawtooth Waveform Inspired Pitch Estimator for Speech and Music , 2011 .

[17]  M. Schroeder Period histogram and product spectrum: new methods for fundamental-frequency measurement. , 1968, The Journal of the Acoustical Society of America.

[18]  A. Aronson,et al.  Clusters of deviant speech dimensions in the dysarthrias. , 1969, Journal of speech and hearing research.

[19]  Mike Brookes,et al.  Estimation of Glottal Closure Instants in Voiced Speech Using the DYPSA Algorithm , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[20]  J. Hughes,et al.  Accuracy of clinical diagnosis of idiopathic Parkinson's disease: a clinico-pathological study of 100 cases. , 1992, Journal of neurology, neurosurgery, and psychiatry.

[21]  Matti Airas,et al.  TKK Aparat: An environment for voice inverse filtering and parameterization , 2008, Logopedics, phoniatrics, vocology.

[22]  M P Karnell,et al.  Comparison of fundamental frequency and perturbation measurements among three analysis systems. , 1995, Journal of voice : official journal of the Voice Foundation.

[23]  Meysam Asgari,et al.  Improvements to harmonic model for extracting better speech features in clinical applications , 2018, Comput. Speech Lang..

[24]  Albert C. Ludolph,et al.  Disease Severity and Progression in Progressive Supranuclear Palsy and Multiple System Atrophy: Validation of the NNIPPS – PARKINSON PLUS SCALE , 2011, PloS one.

[25]  H. K. Schutte,et al.  A subharmonic vibratory pattern in normal vocal folds. , 1996, Journal of speech and hearing research.

[26]  Raymond D. Kent,et al.  Motor speech disorders : essays for Ray Kent , 2007 .

[27]  C Manfredi,et al.  A comparative analysis of fundamental frequency estimation methods with application to pathological voices. , 2000, Medical engineering & physics.

[28]  Kanru Hua Improving YANGsaf F0 Estimator with Adaptive Kalman Filter , 2017, INTERSPEECH.

[29]  Xuejing Sun,et al.  Pitch determination and voice quality analysis using Subharmonic-to-Harmonic Ratio , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[30]  J. Volkmann,et al.  Parkinson disease , 2017, Nature Reviews Disease Primers.

[31]  Abeer Alwan,et al.  Multi-band summary correlogram-based pitch detection for noisy speech , 2013, Speech Commun..

[32]  Michal Novotný,et al.  Speech disorders reflect differing pathophysiology in Parkinson’s disease, progressive supranuclear palsy and multiple system atrophy , 2015, Journal of Neurology.

[33]  J Kreiman,et al.  Comparison of voice analysis systems for perturbation measurement. , 1993, Journal of speech and hearing research.

[34]  M. Hallett,et al.  Clinical research criteria for the diagnosis of progressive supranuclear palsy (Steele-Richardson-Olszewski syndrome) , 1996, Neurology.

[35]  M. Gasior,et al.  POUR LA RECHERCHE NUCLEAIRE CERN – AB DIVISION AB-Note-2004-021 BDI IMPROVING FFT FREQUENCY MEASUREMENT RESOLUTION BY PARABOLIC AND GAUSSIAN INTERPOLATION , 2004 .

[36]  Liming Shi,et al.  Multipitch Estimation Using Block Sparse Bayesian Learning and Intra-Block Clustering , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[37]  P. Boersma ACCURATE SHORT-TERM ANALYSIS OF THE FUNDAMENTAL FREQUENCY AND THE HARMONICS-TO-NOISE RATIO OF A SAMPLED SOUND , 1993 .

[38]  Sabine Skodda,et al.  Phonatory Dysfunction as a Preclinical Symptom of Huntington Disease , 2014, PloS one.

[39]  Hideki Kawahara,et al.  Fast and Reliable F0 Estimation Method Based on the Period Extraction of Vocal Fold Vibration of Singing Voice and Speech , 2009 .

[40]  I. Titze The myoelastic aerodynamic theory of phonation , 2006 .

[41]  John G Harris,et al.  A sawtooth waveform inspired pitch estimator for speech and music. , 2008, The Journal of the Acoustical Society of America.

[42]  George R. Doddington,et al.  An integrated pitch tracking algorithm for speech systems , 1983, ICASSP.

[43]  Gari D Clifford,et al.  Robust fundamental frequency estimation in sustained vowels: detailed algorithmic comparisons and information fusion with adaptive Kalman filtering. , 2014, The Journal of the Acoustical Society of America.

[44]  Mike Brookes,et al.  PEFAC - A Pitch Estimation Algorithm Robust to High Levels of Noise , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[45]  G. Stebbins,et al.  Factor structure of the unified Parkinson's disease rating scale: Motor examination section , 1998, Movement disorders : official journal of the Movement Disorder Society.

[46]  I R Titze,et al.  Perception of pitch and roughness in vocal signals with subharmonics. , 2001, Journal of voice : official journal of the Voice Foundation.

[47]  Max A. Little,et al.  Robust Bayesian Pitch Tracking Based on the Harmonic Model , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[48]  P Sandroni,et al.  Second consensus statement on the diagnosis of multiple system atrophy , 2008, Neurology.

[49]  A. Aronson,et al.  Differential diagnostic patterns of dysarthria. , 1969, Journal of speech and hearing research.