Quantitative assessment of videolaryngostroboscopic images in patients with glottic pathologies

Abstract Introduction: Digital imaging techniques enable exploration of novel visualization modalities of the vocal folds during phonation and definition of parameters, facilitating more precise diagnosis of voice disorders. Aim: Application of computer vision algorithms for analysis of videolaryngostroboscopic (VLS) images aimed at qualitative and quantitative description of phonatory vibrations. Materials and methods: VLS examinations were conducted for 45 females, including 15 subjects with vocal nodules, 15 subjects with glottal incompetence, and 15 normophonic females. The recorded VLS images were preprocessed, the glottis area was segmented out, and the glottal cycles were identified. The glottovibrograms were built, and then the glottal area waveforms (GAW) were quantitatively described by computing the following parameters: open quotient (OQ), closing quotient (CQ), speed quotient (SQ), minimal relative glottal area (MRGA), and a new parameter termed closure difference index (CDI). Results: Profiles of the glottal widths assessed along the glottal length differentiated the study groups (P < 0.001). Moreover, it was shown that the OQ, CQ, CDI, and MRGA indices can be considered as viable parameters for quantifying kinematics of the vocal folds for normophonic subjects and patients with diagnosed vocal nodules and glottal incompetence (P < 0.001). Conclusions: Computer image processing and analysis methods applied to videolaryngostroboscopic images allow for their quantitative assessment. Computation of the size-related and time-related parameters characterizing glottic pathologies is of interest for evidence-based voice diagnostics. Results of the performed ROC curve analysis suggest that the evaluated parameters can distinguish patients with voice disorders from normophonic subjects.

[1]  H. K. Schutte,et al.  Videokymography in Voice Disorders: What to Look For? , 2007, The Annals of otology, rhinology, and laryngology.

[2]  A. Alwan,et al.  Variability in the relationships among voice quality, harmonic amplitudes, open quotient, and glottal area waveform shape in sustained phonation. , 2012, The Journal of the Acoustical Society of America.

[3]  Melda Kunduk,et al.  Variability of normal vocal fold dynamics for different vocal loading in one healthy subject investigated by phonovibrograms. , 2009, Journal of voice : official journal of the Voice Foundation.

[4]  Andrew J. McWhorter,et al.  Objective quantification of pre- and postphonosurgery vocal fold vibratory characteristics using high-speed videoendoscopy and a harmonic waveform model. , 2014, Journal of speech, language, and hearing research : JSLHR.

[5]  K. Hajian‐Tilaki,et al.  Receiver Operating Characteristic (ROC) Curve Analysis for Medical Diagnostic Test Evaluation. , 2013, Caspian journal of internal medicine.

[6]  H. K. Schutte,et al.  Videokymography: high-speed line scanning of vocal fold vibration. , 1996, Journal of voice : official journal of the Voice Foundation.

[7]  B Hammarberg,et al.  Glottal closure, transglottal airflow, and voice quality in healthy middle-aged women. , 1995, Journal of voice : official journal of the Voice Foundation.

[8]  Peak Woo,et al.  Objective measures of laryngeal imaging: what have we learned since Dr. Paul Moore. , 2014, Journal of voice : official journal of the Voice Foundation.

[9]  Jack J Jiang,et al.  Mucosal wave measurement and visualization techniques. , 2011, Journal of voice : official journal of the Voice Foundation.

[10]  Ken-Ichi Sakakibara,et al.  Age- and gender-related difference of vocal fold vibration and glottal configuration in normal speakers: analysis with glottal area waveform. , 2014, Journal of voice : official journal of the Voice Foundation.

[11]  R. Colton,et al.  Evaluation of stroboscopic signs. , 2011, Journal of voice : official journal of the Voice Foundation.

[12]  Antoine Giovanni,et al.  Objective Voice Analysis in Dysphonic Patients: New Data Including Nonlinear Measurements , 2006, Folia Phoniatrica et Logopaedica.

[13]  F. Liang,et al.  Correlation among the dysphonia severity index (DSI), the RBH voice perceptual evaluation, and minimum glottal area in female patients with vocal fold nodules. , 2014, Journal of voice : official journal of the Voice Foundation.

[14]  M Moerman,et al.  Implementation of the European Laryngological Society (ELS) basic protocol for assessing voice treatment effect. , 2003, Revue de laryngologie - otologie - rhinologie.

[15]  Tolga Çiloglu,et al.  Automatic Segmentation of High Speed Video Images of Vocal Folds , 2014, J. Appl. Math..

[16]  Johan Sundberg,et al.  Three registers in an untrained female singer analyzed by videokymography, strobolaryngoscopy and sound spectrography. , 2008, The Journal of the Acoustical Society of America.

[17]  N. Tayama,et al.  Vocal Fold Vibration in Vocal Fold Atrophy: Quantitative Analysis With High-Speed Digital Imaging. , 2015, Journal of voice : official journal of the Voice Foundation.

[18]  C.A.J. Prescott,et al.  Diagnosis and treatment of voice disorders , 1997 .

[19]  C. Bohr,et al.  Spatiotemporal analysis of high-speed videolaryngoscopic imaging of organic pathologies in males. , 2014, Journal of speech, language, and hearing research : JSLHR.

[20]  Michael R. Smith,et al.  Evaluation of Several Nonparametric Bootstrap Methods to Estimate Confidence Intervals for Software Metrics , 2003, IEEE Trans. Software Eng..

[21]  Christophe d'Alessandro,et al.  Automatic glottal segmentation using local-based active contours and application to glottovibrography , 2012, Speech Commun..

[22]  F. de Jong An Introduction to the Teacher’s Voice in a Biopsychosocial Perspective , 2010, Folia Phoniatrica et Logopaedica.

[23]  Terri Treman Gerlach,et al.  Vocal fold phase asymmetries in patients with voice disorders: a study across visualization techniques. , 2012, American journal of speech-language pathology.

[24]  Yu Zhang,et al.  Extracting Physiologically Relevant Parameters of Vocal Folds From High-Speed Video Image Series , 2007, IEEE Transactions on Biomedical Engineering.

[25]  Yuling Yan,et al.  Tracing vocal fold vibrations using level set segmentation method , 2015, International journal for numerical methods in biomedical engineering.

[26]  Rita R. Patel,et al.  Measurement of glottal cycle characteristics between children and adults: physiological variations. , 2014, Journal of voice : official journal of the Voice Foundation.

[27]  Wilhelm Burger,et al.  Digital Image Processing - An Algorithmic Introduction using Java , 2008, Texts in Computer Science.

[28]  Karen M. K. Chan,et al.  Quantitative high-speed laryngoscopic analysis of vocal fold vibration in fatigued voice of young karaoke singers. , 2013, Journal of voice : official journal of the Voice Foundation.

[29]  Dimitar D Deliyski,et al.  Laryngeal High-Speed Videoendoscopy: Rationale and Recommendation for Accurate and Consistent Terminology. , 2015, Journal of speech, language, and hearing research : JSLHR.

[30]  M. Schuster,et al.  Multiparametric analysis of vocal fold vibrations in healthy and disordered voices in high-speed imaging. , 2011, Journal of voice : official journal of the Voice Foundation.

[31]  Chung-Feng Jeffrey Kuo,et al.  Automatic recognizing of vocal fold disorders from glottis images , 2014, Proceedings of the Institution of Mechanical Engineers. Part H, Journal of engineering in medicine.

[32]  M. Döllinger,et al.  Vocal fold vibration amplitude, open quotient, speed quotient and their variability along glottal length: Kymographic data from normal subjects , 2013, Logopedics, phoniatrics, vocology.

[33]  Juan Ignacio Godino-Llorente,et al.  Segmentation of the glottal space from laryngeal images using the watershed transform , 2008, Comput. Medical Imaging Graph..

[34]  Ken-Ichi Sakakibara,et al.  Quantitative analysis of digital videokymography: a preliminary study on age- and gender-related difference of vocal fold vibration in normal speakers. , 2015, Journal of voice : official journal of the Voice Foundation.