Voice signal features analysis and classification: looking for new diseases related parameters

Dysphonia is an often-underestimated vocal tract problem caused by irregular vocal cords vibration. It has been proved that dysphonia can be considered as a symptom of vocal tract diseases; also, voice anomalies represent neurological or neurodegenerative diseases triggers. Indeed, dysarthric voice patterns are studied as early detection for Parkinson's syndrome. Voice acoustic features can thus be extracted from simple vocalism (or short reading) and considered as diseases indicators. The interest is also improved thank to the possibility of using mobile and/or portable technologies to analyze voice parameters in a background manner. Nevertheless, voice can be considered as a personal signature, thus the identification of a general purpose features identification is still an open issue. In this work we focus on identifying voice signal features by using a comparative study of different classifiers. The target is to identify frequencies values and noise indexes (or combination of them) to be used as general purpose indicators, aiming to discriminate between healthy and pathological voices. The work is part of a larger project on voice analysis aiming to define a mobile and reliable based system allowing the acquisition and features extraction of vocal signals to be used as healthy monitor.

[1]  João Paulo Teixeira,et al.  Jitter, Shimmer and HNR Classification within Gender, Tones and Vowels in Healthy Voices , 2014 .

[2]  J. A. Wilson,et al.  Is an organic/functional distinction psychologically meaningful in patients with dysphonia? , 1999, Journal of psychosomatic research.

[3]  Carlos Dias Maciel,et al.  Wavelet time-frequency analysis and least squares support vector machines for the identification of voice disorders , 2007, Comput. Biol. Medicine.

[4]  Sabine Skodda,et al.  Acoustic analysis of speech as a promising instrument for monitoring and differential diagnosis of Parkinsons's disease , 2011, MAVEBA.

[5]  Roland Linder,et al.  Artificial neural network-based classification to screen for dysphonia using psychoacoustic scaling of acoustic voice features. , 2008, Journal of voice : official journal of the Voice Foundation.

[6]  Y. Qi,et al.  Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals. , 1997, The Journal of the Acoustical Society of America.

[7]  Thomas Murry,et al.  Development and Validation of the Voice Handicap Index‐10 , 2004, The Laryngoscope.

[8]  Ronald J. Baken,et al.  Clinical measurement of speech and voice , 1987 .

[9]  H. Kasuya,et al.  Normalized noise energy as an acoustic measure to evaluate pathologic voice. , 1986, The Journal of the Acoustical Society of America.

[10]  Joseph C. Stemple,et al.  Clinical Voice Pathology: Theory and Management , 1984 .

[11]  Claudia Manfredi,et al.  Adaptive noise energy estimation in pathological speech signals , 2000, IEEE Transactions on Biomedical Engineering.

[12]  Max A. Little,et al.  Suitability of Dysphonia Measurements for Telemonitoring of Parkinson's Disease , 2008, IEEE Transactions on Biomedical Engineering.

[13]  Robert E. Hillman,et al.  Mobile Voice Health Monitoring Using a Wearable Accelerometer Sensor and a Smartphone Platform , 2012, IEEE Transactions on Biomedical Engineering.

[14]  P. Boersma ACCURATE SHORT-TERM ANALYSIS OF THE FUNDAMENTAL FREQUENCY AND THE HARMONICS-TO-NOISE RATIO OF A SAMPLED SOUND , 1993 .

[15]  Steven L. Salzberg,et al.  Book Review: C4.5: Programs for Machine Learning by J. Ross Quinlan. Morgan Kaufmann Publishers, Inc., 1993 , 1994, Machine Learning.

[16]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..

[17]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[18]  Jianglin Wang,et al.  Vocal Folds Disorder Detection using Pattern Recognition Methods , 2007, 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[19]  Claudio Storck,et al.  Reliable jitter and shimmer measurements in voice clinics: the relevance of vowel, gender, vocal intensity, and fundamental frequency effects in a typical clinical task. , 2011, Journal of voice : official journal of the Voice Foundation.

[20]  V. Sellam,et al.  Classification of Normal and Pathological Voice Using SVM and RBFNN , 2014 .

[21]  A. Carullo,et al.  A low-cost platform for voice monitoring , 2013, 2013 IEEE International Instrumentation and Measurement Technology Conference (I2MTC).

[22]  M A Birchall,et al.  Reliability of OperaVOX against Multidimensional Voice Program (MDVP) , 2015, Clinical otolaryngology : official journal of ENT-UK ; official journal of Netherlands Society for Oto-Rhino-Laryngology & Cervico-Facial Surgery.

[23]  Emily Mower Provost,et al.  Ecologically valid long-term mood monitoring of individuals with bipolar disorder using speech , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[24]  Frank Rosenblatt,et al.  PRINCIPLES OF NEURODYNAMICS. PERCEPTRONS AND THE THEORY OF BRAIN MECHANISMS , 1963 .

[25]  Yu Zhang,et al.  Objective Acoustic Analysis of Pathological Voices from Patients with Vocal Nodules and Polyps , 2009, Folia Phoniatrica et Logopaedica.

[26]  Karthikeyan Umapathy,et al.  Discrimination of pathological voices using a time-frequency approach , 2005, IEEE Transactions on Biomedical Engineering.

[27]  B Boyanov,et al.  Acoustic analysis of pathological voices. A voice analysis system for the screening of laryngeal diseases. , 1997, IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society.

[28]  Tim Ritchings,et al.  Pathological voice quality assesment using artificial neural networks , 2001, MAVEBA.

[29]  Mun Kyung Sunwoo,et al.  Depression and voice handicap in Parkinson disease , 2014, Journal of the Neurological Sciences.

[30]  Jeff Pepper,et al.  Usage results of a mobile app for managing urinary incontinence. , 2015, The Journal of urology.

[31]  Tong Liu,et al.  Author's Personal Copy Biomedical Signal Processing and Control Effective Detection of Parkinson's Disease Using an Adaptive Fuzzy K-nearest Neighbor Approach , 2022 .

[32]  D. Jamieson,et al.  Acoustic discrimination of pathological voice: sustained vowels versus continuous speech. , 2001, Journal of speech, language, and hearing research : JSLHR.

[33]  C. Newman,et al.  The Voice Handicap Index (VHI)Development and Validation , 1997 .

[34]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[35]  Giuseppe Tradigo,et al.  A system for ubiquitous distributed acquisition of voice alteration samples through a mobile application , 2014, BCB.

[36]  Thomas J. Watson,et al.  An empirical study of the naive Bayes classifier , 2001 .

[37]  Piero Tortoli,et al.  Voice quality monitoring: A portable device prototype , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[38]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[39]  Marsha J Handel,et al.  mHealth (mobile health)-using Apps for health and wellness. , 2011, Explore.

[40]  Leonardo Bocchi,et al.  A multipurpose user-friendly tool for voice analysis: Application to pathological adult voices , 2009, Biomed. Signal Process. Control..

[41]  Christopher R Watts,et al.  BMC Ear, Nose and Throat Disorders BioMed Central BMC 1 2001, Ear, Nose and Throat Disorders , 2001 .

[42]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[43]  Mario Cannataro,et al.  Experiences of using a DSP based device for vocal signal analysis , 2011, MAVEBA.

[44]  Stefan Hadjitodorov,et al.  ACOUSTIC ANALYSIS OF PATHOLOGICAL VOICES , 1997 .

[45]  Kumara Shama,et al.  Study of Harmonics-to-Noise Ratio and Critical-Band Energy Spectrum of Speech as Acoustic Indicators of Laryngeal and Voice Pathology , 2007, EURASIP J. Adv. Signal Process..

[46]  Mario Cannataro,et al.  Early detection of voice diseases via a web-based system , 2007, MAVEBA.

[47]  Max A. Little,et al.  Novel Speech Signal Processing Algorithms for High-Accuracy Classification of Parkinson's Disease , 2012, IEEE Transactions on Biomedical Engineering.

[48]  Berit Schneider-Stickler,et al.  Biofeedback on voice use in call center agents in order to prevent occupational voice disorders. , 2012, Journal of voice : official journal of the Voice Foundation.

[49]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.