Generating 3D spatio-temporal models of vocal folds vibrations from high speed digital imaging

The oscillation process of the vocal folds that controls the air flow from the lungs is the underlying mechanism of human voice. Quantitative and qualitative voice assessment, allows for early detection of pathological changes of the vocal folds. In this study we focus on the analysis of high speed image sequences of the larynx during phonation. Using open-source tools such as Insight Segmentation and Registration Toolkit (ITK) and Visualization Toolkit (VTK), we have developed a methodology for automatic segmentation of the vocal folds edges movements and generation of a 3D spatio-temporal representation. The proposed geometrical model enables to calculate time indices (i.e. the opening and closing quotients and the proposed parameter named the Closure Difference Index (CDI)) for quantification of the oscillation process. This study, shows that the established geometrical model can contribute to more objective and accurate diagnosis method of voice disorders.

[1]  C.A.J. Prescott,et al.  Diagnosis and treatment of voice disorders , 1997 .

[2]  Rubén Fraile Muñoz,et al.  Segmentation of the glottal space from laryngeal images using the watershed transform , 2008 .

[3]  Christophe d'Alessandro,et al.  Automatic glottal segmentation using local-based active contours and application to glottovibrography , 2012, Speech Commun..

[4]  Chi Zhu,et al.  Automatic tracing of vocal-fold motion from high-speed digital images , 2006, IEEE Transactions on Biomedical Engineering.

[5]  Michael Döllinger,et al.  Phonovibrography: Mapping High-Speed Movies of Vocal Fold Vibrations Into 2-D Diagrams for Visualizing and Analyzing the Underlying Laryngeal Dynamics , 2008, IEEE Transactions on Medical Imaging.

[6]  Ken-Ichi Sakakibara,et al.  Age- and gender-related difference of vocal fold vibration and glottal configuration in normal speakers: analysis with glottal area waveform. , 2014, Journal of voice : official journal of the Voice Foundation.

[7]  Krzysztof Izdebski,et al.  Mucosal wave characteristics in three voice modes (fry, hiss & overpressure) produced by a female speaker: a preliminary study using stroboscopy, HSDI and analyzed by kymography, P-FFT & Nyquist plots , 2012, Photonics West - Biomedical Optics.

[8]  María J. Ledesma-Carbayo,et al.  Optimal multiresolution 3D level-set method for liver segmentation incorporating local curvature constraints , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[9]  Marimuthu Palaniswami,et al.  Estimation of vocal fold plane in 3D CT images for diagnosis of vocal fold abnormalities , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[10]  Mariola Sliwinska-Kowalska,et al.  Quantitative assessment of videolaryngostroboscopic images in patients with glottic pathologies , 2017, Logopedics, phoniatrics, vocology.

[11]  Robert E. Hillman,et al.  Relationships Between Vocal Function Measures Derived from an Acoustic Microphone and a Subglottal Neck-Surface Accelerometer , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[12]  Manal Abdel Wahed Computer aided recognition of pathological voice , 2014, 2014 31st National Radio Science Conference (NRSC).

[13]  Ron Goldman,et al.  Curvature formulas for implicit curves and surfaces , 2005, Comput. Aided Geom. Des..

[14]  Michael Döllinger,et al.  Three-Dimensional Optical Reconstruction of Vocal Fold Kinematics Using High-Speed Video With a Laser Projection System , 2015, IEEE Transactions on Medical Imaging.

[15]  M. Döllinger,et al.  Vocal fold vibration amplitude, open quotient, speed quotient and their variability along glottal length: Kymographic data from normal subjects , 2013, Logopedics, phoniatrics, vocology.

[16]  Pawel Strumillo,et al.  Application of Mel Cepstral processing and Support Vector Machines for diagnosing vocal disorders from voice recordings , 2011, Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2011.

[17]  A. Skalski,et al.  Analysis of vocal folds movement in high speed videoendoscopy based on level set segmentation and image registration , 2008, 2008 International Conference on Signals and Electronic Systems.

[18]  Dimitar D Deliyski,et al.  Utility of Laryngeal High-speed Videoendoscopy in Clinical Voice Assessment. , 2017, Journal of voice : official journal of the Voice Foundation.