Voice Pathology Detection and Multi-classification Using Machine Learning Classifiers

Automatic voice pathology detection can provide objective estimation and prevention in the early stages of voice diseases. A voice pathology detection and multi-classification method using an audio feature set extracted from glottal flow waveform is proposed in this work. In addition, a feature selection method of Fisher discrimination criterion is used to screen more valuable features and eliminate the features with low efficiency. The discrimination ability and effectiveness of the selected features were verified using three dimensional scatter plot and box plot in this study. One of the contributions of this paper is to investigate and evaluate the performance of different machine learning classifiers value for voice pathology detection and multi-classification. All experiments were carried out using the Massachusetts eye and ear infirmary database. Accuracy, sensitivity, specificity and receiver operating characteristic area are used as evaluation indexes to compare the performance of different machine learning classifiers. Each machine learning classifier had a good performance using the proposed feature set with nearly 100% accuracy in detection and higher than 90% accuracy in multi-classification. The experimental results indicate that the proposed method is valuable for voice pathology detection and multi-classification, especially in multi-classification.

[1]  Edson Cataldo,et al.  Analysis and Classification of Voice Pathologies Using Glottal Signal Parameters. , 2016, Journal of voice : official journal of the Voice Foundation.

[2]  Muhammad Ghulam,et al.  Pathological voice detection and binary classification using MPEG-7 audio features , 2014, Biomed. Signal Process. Control..

[3]  Alex Burdorf,et al.  Voice disorders in teachers and their associations with work-related factors: a systematic review. , 2013, Journal of communication disorders.

[4]  S. Jothilakshmi,et al.  Automatic system to detect the type of voice pathology , 2014, Appl. Soft Comput..

[5]  Paavo Alku,et al.  HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Antanas Verikas,et al.  Fusion of voice signal information for detection of mild laryngeal pathology , 2014, Appl. Soft Comput..

[7]  Ghulam Muhammad,et al.  Voice Pathology Detection and Classification Using Auto-Correlation and Entropy Features in Different Frequency Regions , 2018, IEEE Access.

[8]  R. Martins,et al.  Voice disorders in teachers. A review. , 2014, Journal of voice : official journal of the Voice Foundation.

[9]  Riyanarto Sarno,et al.  Classification of Music Mood Using MPEG-7 Audio Features and SVM with Confidence Interval , 2018, Int. J. Artif. Intell. Tools.

[10]  Matti Airas,et al.  TKK Aparat: An environment for voice inverse filtering and parameterization , 2008, Logopedics, phoniatrics, vocology.

[11]  Muhammad Ghulam,et al.  Voice pathology detection using interlaced derivative pattern on glottal source excitation , 2017, Biomed. Signal Process. Control..

[12]  Joaquín Luque,et al.  Evaluation of MPEG-7-Based Audio Descriptors for Animal Voice Recognition over Wireless Acoustic Sensor Networks , 2016, Sensors.

[13]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[14]  Marcelo de Oliveira Rosa,et al.  Adaptive estimation of residue signal for voice pathology diagnosis , 2000, IEEE Trans. Biomed. Eng..

[15]  Ghulam Muhammad,et al.  Automatic voice pathology detection and classification using vocal tract area irregularity , 2016 .

[16]  Paavo Alku,et al.  Analysis and Detection of Pathological Voice Using Glottal Source Features , 2020, IEEE Journal of Selected Topics in Signal Processing.

[17]  Ömer Eskidere,et al.  Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features , 2015, Comput. Math. Methods Medicine.