The Application of a Genetic Algorithm in the Noninvasive Assessment of Vocal Nodules in Children

The application of IT solutions in medicine makes it possible to develop new, more accurate, and noninvasive medical diagnostics. The aim of this study was to propose this kind of solution. It enables the accurate assessment of vocal nodules in children while measuring glottal insufficiency. The input data includes voice and electroglottographic recordings of patients’ voices as well as diagnoses made by practitioners. The recordings were parameterized and used to develop a classifier to assess glottal insufficiency of vocal nodules. The classifier was designed with the help of a genetic algorithm. The diagnoses established thanks to the classifier show a 92% agreement with those reached through medical examination. Such effective performance renders the classifier a useful noninvasive screening tool. We compared our method with Deep Neural Network classifier and the Covariance Matrix Adaptation Evolution Strategy (CMA-ES) evolutionary algorithm. The solution that we propose offers a more accurate continuous diagnosis in comparison with the discrete diagnosis of a deep neural network as well as greater accuracy in relation to the CMA-ES algorithm. Another advantage of the proposed solution is the ease with which it can be implemented by healthcare professionals. A Visual Basic for the Applications (VBA) code for LibreOffice macro for the classifier is attached at the end of this paper.

[1]  G. Fant Dept. for Speech, Music and Hearing Quarterly Progress and Status Report the Lf-model Revisited. Transformations and Frequency Domain Analysis the Lf-model Revisited. Transformations and Frequency Domain Analysis* , 2022 .

[3]  M. Cowan,et al.  Airway management in children with mucopolysaccharidoses. , 2009, Archives of otolaryngology--head & neck surgery.

[4]  Mohamed Fezari,et al.  Acoustic Analysis for Detection of Voice Disorders Using Adaptive Features and Classifiers , 2014 .

[5]  Hariharan Muthusamy,et al.  Optimal Selection of Long Time Acoustic Features Using GA for the Assessment of Vocal Fold Disorders , 2012 .

[6]  P. Alku,et al.  Normalized amplitude quotient for parametrization of the glottal flow. , 2002, The Journal of the Acoustical Society of America.

[7]  Paavo Alku,et al.  Time-domain parameterization of the closing phase of glottal airflow waveform from voices over a large intensity range , 2002, IEEE Trans. Speech Audio Process..

[8]  John Kane,et al.  COVAREP — A collaborative voice analysis repository for speech technologies , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  R. Martins,et al.  Perceptual and acoustic parameters of vocal nodules in children. , 2014, International journal of pediatric otorhinolaryngology.

[10]  D N Sorenson,et al.  A fundamental frequency investigation of children ages 6-10 years old. , 1989, Journal of communication disorders.

[11]  Babak Seyed Aghazadeh,et al.  Optimal feature selection for the assessment of vocal fold disorders , 2009, Comput. Biol. Medicine.

[12]  Nikolaus Hansen,et al.  A restart CMA evolution strategy with increasing population size , 2005, 2005 IEEE Congress on Evolutionary Computation.

[13]  Muhammad Ghulam,et al.  Voice pathology detection using interlaced derivative pattern on glottal source excitation , 2017, Biomed. Signal Process. Control..

[14]  Nikolaus Hansen,et al.  The CMA Evolution Strategy: A Comparing Review , 2006, Towards a New Evolutionary Computation.

[15]  D G Childers,et al.  Vocal quality factors: analysis, synthesis, and perception. , 1991, The Journal of the Acoustical Society of America.

[16]  I R Titze,et al.  Vocal intensity in speakers and singers. , 1991, The Journal of the Acoustical Society of America.

[17]  E Abberton,et al.  First applications of a new laryngograph. , 1971, Medical & biological illustration.

[18]  John Kane,et al.  Identifying Regions of Non-Modal Phonation Using Features of the Wavelet Transform , 2011, INTERSPEECH.

[19]  Meisam Khalil Arjmandi,et al.  An efficient voice pathology classification scheme based on applying multi-layer linear discriminant analysis to wavelet packet-based features , 2014, Biomed. Signal Process. Control..

[20]  H. K. Schutte,et al.  Development and Application of Videokymography for High-Speed Examination of Vocal-Fold Vibration , 2002 .

[21]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. , 1996, Journal of speech and hearing research.

[22]  A. Laukkanen,et al.  Electroglottographic contact quotient in different phonation types using different amplitude threshold levels , 2012, Logopedics, phoniatrics, vocology.

[23]  R. Gubrynowicz,et al.  Analysis of voice quality in patients with late-onset Pompe disease , 2016, Orphanet Journal of Rare Diseases.

[24]  Nikhil R. Pal,et al.  Genetic programming for simultaneous feature selection and classifier design , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[25]  Zbigniew Michalewicz,et al.  Genetic algorithms + data structures = evolution programs (2nd, extended ed.) , 1994 .

[26]  PAAVO ALKU,et al.  Glottal inverse filtering analysis of human voice production — A review of estimation and parameterization methods of the glottal excitation and their applications , 2011 .

[27]  R. Nuss,et al.  Correlation of Vocal Fold Nodule Size in Children and Perceptual Assessment of Voice Quality , 2010, The Annals of otology, rhinology, and laryngology.

[28]  E. Silverman,et al.  Incidence of chronic hoarseness among school-age children. , 1975, The Journal of speech and hearing disorders.

[29]  G. Lindsey,et al.  Toward the Quantification of Vocal Efficiency , 1990 .

[30]  Axel Röbel,et al.  Phase Minimization for Glottal Model Estimation , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[31]  M. Johns Update on the etiology, diagnosis, and treatment of vocal fold nodules, polyps, and cysts , 2003, Current opinion in otolaryngology & head and neck surgery.

[32]  John Kane,et al.  Wavelet Maxima Dispersion for Breathy to Tense Voice Discrimination , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[33]  M. Elif Karsligil,et al.  Classification of laryngeal disorders based on shape and vascular defects of vocal folds , 2015, Comput. Biol. Medicine.

[34]  Paavo Alku,et al.  Parabolic spectral parameter - A new method for quantification of the glottal flow , 1997, Speech Commun..

[35]  R. Nuss,et al.  A grading scale for pediatric vocal fold nodules , 2007, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[36]  J. Liljencrants,et al.  Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Four-parameter Model of Glottal Flow , 2022 .

[37]  T. Hacki [Classification of glottal dysfunctions on the basis of electroglottography]. , 1989, Folia phoniatrica.

[38]  Axel Röbel,et al.  Function of Phase-Distortion for glottal model estimation , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[39]  R. Nuss,et al.  Validation of a Pediatric Vocal Fold Nodule Rating Scale Based on Digital Video Images , 2012, The Annals of otology, rhinology, and laryngology.

[40]  Adnane Cherif,et al.  Artificial Neural Networks and Support Vector Machine for Voice Disorders Identification , 2016 .

[41]  E. Okur,et al.  The prevalence of vocal fold nodules in school age children. , 2004, International journal of pediatric otorhinolaryngology.

[42]  Assessment of effectiveness of acoustic analysis of voice for monitoring the evolution of vocal nodules after vocal treatment , 2014, European Archives of Oto-Rhino-Laryngology.

[43]  J. Stemple,et al.  Description of laryngeal pathologies in children evaluated by otolaryngologists. , 1990, The Journal of speech and hearing disorders.

[44]  D. Howard Variation of electrolaryngographically derived closed quotient for trained and untrained adult female singers. , 1995, Journal of voice : official journal of the Voice Foundation.

[45]  A. Tylki-Szymańska,et al.  Follow-up analysis of voice quality in patients with late-onset Pompe disease , 2018, Orphanet Journal of Rare Diseases.

[46]  Joseana Macêdo Fechine,et al.  Feature Estimation for Vocal Fold Edema Detection Using Short-Term Cepstral Analysis , 2007, 2007 IEEE 7th International Symposium on BioInformatics and BioEngineering.

[47]  F L Wuyts,et al.  Evolution of vocal fold nodules from childhood to adolescence. , 2007, Journal of voice : official journal of the Voice Foundation.

[48]  R. Gubrynowicz,et al.  Voice alterations in patients with Morquio A syndrome , 2017, Journal of Applied Genetics.

[49]  A. Ysunza,et al.  Voice parameters and videonasolaryngoscopy in children with vocal nodules: a longitudinal study, before and after voice therapy. , 2012, International journal of pediatric otorhinolaryngology.

[50]  Ingo R. Titze,et al.  Principles of voice production , 1994 .