A novel hybrid of genetic algorithm and ANN for developing a high efficient method for vocal fold pathology diagnosis

In this paper, an initial feature vector based on the combination of the wavelet packet decomposition (WPD) and the Mel frequency cepstral coefficients (MFCCs) is proposed. For optimizing the initial feature vector, a genetic algorithm (GA)-based approach is proposed and compared with the well-known principal component analysis (PCA) approach. The artificial neural network (ANN) with the different learning algorithms is used as the classifier. Some experiments are carried out for evaluating and comparing the classification accuracies which are obtained by the use of the different learning algorithms and the different feature vectors (the initial and the optimized ones). Finally, a hybrid of the ANN with the ‘trainscg’ training algorithm and the genetic algorithm is proposed for the vocal fold pathology diagnosis. Also, the performance of the proposed method is compared with the recent works. The experiments' results show a better performance (the higher classification accuracy) of the proposed method in comparison with the others.

[1]  Michael J. Harrison,et al.  Voice analysis for detection of hoarseness due to a local anesthetic procedure , 2009, 2009 3rd International Conference on Signal Processing and Communication Systems.

[2]  Jagadish Nayak,et al.  Identification of voice disorders using speech samples , 2003, TENCON 2003. Conference on Convergent Technologies for Asia-Pacific Region.

[3]  Vahid Majidnezhad,et al.  A HMM-BASED METHOD FOR VOCAL FOLD PATHOLOGY DIAGNOSIS , 2012 .

[4]  Mansour Vali,et al.  Voice disorders identification based on different feature reduction methodologies and support vector machine , 2010, 2010 18th Iranian Conference on Electrical Engineering.

[5]  Rodrigo Capobianco Guido,et al.  Discrete wavelet transform and support vector machine applied to pathological voice signals identification , 2005, Seventh IEEE International Symposium on Multimedia (ISM'05).

[6]  Vahid Majidnezhad,et al.  A Hybrid of Genetic Algorithm and Support Vector Machine for Feature Reduction and Detection of Vocal Fold Pathology , 2013 .

[7]  Marcelo de Oliveira Rosa,et al.  Adaptive estimation of residue signal for voice pathology diagnosis , 2000, IEEE Trans. Biomed. Eng..

[8]  H.L. Rufiner,et al.  Acoustic analysis of speech for detection of laryngeal pathologies , 2000, Proceedings of the 22nd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (Cat. No.00CH37143).

[9]  M. Hariharan,et al.  Identification of vocal fold pathology based on Mel Frequency Band Energy Coefficients and singular value decomposition , 2009, 2009 IEEE International Conference on Signal and Image Processing Applications.

[10]  Abeer Alwan,et al.  The glottaltopogram: A method of analyzing high-speed images of the vocal folds , 2014, Comput. Speech Lang..

[11]  Kimiko Tsukada,et al.  An acoustic comparison of vowel length contrasts in Arabic, Japanese and Thai: Durational and spectral data , 2009, Int. J. Asian Lang. Process..

[12]  M. Hariharan,et al.  Diagnosis of vocal fold pathology using time-domain features and systole activated neural network , 2009, 2009 5th International Colloquium on Signal Processing & Its Applications.

[13]  G. Vaziri,et al.  Pathological Assessment of Vocal Fold Nodules and Polyp via Fractal Dimension of Patients' Voices , 2008, 2008 2nd International Conference on Bioinformatics and Biomedical Engineering.

[14]  J. Lohscheller Towards evidence based diagnosis of voice disorders using phonovibrograms , 2009, 2009 2nd International Symposium on Applied Sciences in Biomedical and Communication Technologies.

[15]  Joseana Macêdo Fechine,et al.  Pathological voice discrimination using cepstral analysis, vector quantization and Hidden Markov Models , 2008, 2008 8th IEEE International Conference on BioInformatics and BioEngineering.

[16]  John H. L. Hansen,et al.  A screening test for speech pathology assessment using objective quality measures , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[17]  Vahid Majidnezhad,et al.  An ANN-based Method for Detecting Vocal Fold Pathology , 2013, ArXiv.

[18]  R. J. Feder,et al.  Videostroboscopic evaluation of the larynx. , 1987, Ear, nose, & throat journal.

[19]  Vahid Majidnezhad,et al.  The SVM-Based Feature Reduction in Vocal Fold Pathology Diagnosis , 2013 .

[20]  Muhammad Ghulam,et al.  Automatic voice disorder classification using vowel formants , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[21]  P Kitzing,et al.  Glottography, the electrophysiological investigation of phonatory biomechanics. , 1986, Acta oto-rhino-laryngologica Belgica.

[22]  D. Mehta,et al.  Investigating acoustic correlates of human vocal fold vibratory phase asymmetry through modeling and laryngeal high-speed videoendoscopy. , 2011, The Journal of the Acoustical Society of America.

[23]  Zeinab Mahmoudi,et al.  Classification of voice disorder in children with cochlear implantation and hearing aid using multiple classifier fusion , 2011, 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010).

[24]  Jacqueline Vaissière,et al.  On the Acoustic and Perceptual Characterization of Reference Vowels in a Cross-language Perspective , 2011, ICPhS.

[25]  Claudia Manfredi,et al.  Adaptive noise energy estimation in pathological speech signals , 2000, IEEE Transactions on Biomedical Engineering.

[26]  Jianglin Wang,et al.  Vocal Folds Disorder Detection using Pattern Recognition Methods , 2007, 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[27]  F. Almasganj,et al.  Comparison of neural networks and support vector machines applied to optimized features extracted from patients' speech signal for classification of vocal fold inflammation , 2005, Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005..

[28]  Rodney A. Kennedy,et al.  Band-limited signal concentration in time-frequency , 2009, 2009 3rd International Conference on Signal Processing and Communication Systems.

[29]  Minsoo Hahn,et al.  Pathological Voice Detection Using Efficient Combination of Heterogeneous Features , 2008, IEICE Trans. Inf. Syst..

[30]  Carlos Dias Maciel,et al.  Support vector machines and wavelets for voice disorder sorting , 2006, 2006 Proceeding of the Thirty-Eighth Southeastern Symposium on System Theory.

[31]  Pedro Gómez Vilda,et al.  Dimensionality Reduction of a Pathological Voice Quality Assessment System Based on Gaussian Mixture Models and Short-Term Cepstral Parameters , 2006, IEEE Transactions on Biomedical Engineering.

[32]  Saeed Rahati Quchani,et al.  Classification of voice disorders in children with cochlear implantation and hearing aid using multiple classifier fusion , 2010, ISSPA.

[33]  Minsoo Hahn,et al.  Automatic Assessment of Pathological Voice Quality Using Higher-Order Statistics in the LPC Residual Domain , 2009, EURASIP J. Adv. Signal Process..

[34]  Farshad Almasganj,et al.  Pathological Assesment of Vocal Fold Nodules and Polyp Using Accoustic Perturbation and Phase Space Features , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[35]  R. Martinez,et al.  Spectral perturbation parameters for voice pathology detection , 2005, International Symposium on Signals, Circuits and Systems, 2005. ISSCS 2005..

[36]  T. Ananthakrishna,et al.  k-means nearest neighbor classifier for voice pathology , 2004, Proceedings of the IEEE INDICON 2004. First India Annual Conference, 2004..

[37]  L. Gavidia-Ceballos,et al.  A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment , 1998, IEEE Transactions on Biomedical Engineering.

[38]  Paulo César Cortez,et al.  Wavelet transform and artificial neural networks applied to voice disorders identification , 2011, 2011 Third World Congress on Nature and Biologically Inspired Computing.

[39]  Yannis Stylianou,et al.  Voice Pathology Detection and Discrimination Based on Modulation Spectral Features , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[40]  Thierry Dutoit,et al.  Phase-based information for voice pathology detection , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[41]  Vahid Majidnezhad,et al.  A Novel Method for Feature Extraction in Vocal Fold Pathology Diagnosis , 2012, MobiHealth.

[42]  Yannis Stylianou,et al.  Dysphonia detection based on modulation spectral features and cepstral coefficients , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[43]  Tao Li,et al.  Discrimination of severely noisy pathological voice with spectral slope and HNR , 2004, Proceedings 7th International Conference on Signal Processing, 2004. Proceedings. ICSP '04. 2004..

[44]  Miguel Angel Ferrer-Ballester,et al.  Automatic Detection of Pathologies in The Voice by HOS Based Parameters , 2001, EURASIP J. Adv. Signal Process..

[45]  G. Castellanos-Dominguez,et al.  A new approach to discriminative HMM training for pathological voice classification , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[46]  L. Gavidia-Ceballos,et al.  Direct speech feature estimation using an iterative EM algorithm for vocal fold pathology detection , 1996, IEEE Transactions on Biomedical Engineering.

[47]  Tao Li,et al.  Estimation of Harmonic and Noise Components from Pathological Voice using Iterative Method , 2005, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[48]  Jeffrey G. Andrews,et al.  Mode Switching for the Multi-Antenna Broadcast Channel Based on Delay and Channel Quantization , 2008, EURASIP J. Adv. Signal Process..

[49]  T.W. Berger,et al.  Pathological Voice Assessment , 2006, 2006 International Conference of the IEEE Engineering in Medicine and Biology Society.

[50]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Evaluation of neural classifiers using statistic methods for identification of laryngeal pathologies , 1998, Proceedings 5th Brazilian Symposium on Neural Networks (Cat. No.98EX209).

[51]  V.K. Makukha,et al.  Spectrum analysis of vocalization application for voice pathology detection , 2007, EUROCON 2007 - The International Conference on "Computer as a Tool".

[52]  Vahid Majidnezhad,et al.  A Novel GMM-Based Feature Reduction for Vocal Fold Pathology Diagnosis , 2013 .

[53]  Juan Ignacio Godino-Llorente,et al.  Complexity analysis of pathological voices by means of hidden markov entropy measurements , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[54]  Wang Jian,et al.  Pathological speech deformation degree assessment based on integrating feature and neural network , 2008, 2008 27th Chinese Control Conference.

[55]  J.C. Pereira,et al.  Optimum path forest classifier applied to laryngeal pathology detection , 2008, 2008 15th International Conference on Systems, Signals and Image Processing.

[56]  Germán Castellanos-Domínguez,et al.  Automatic Detection of Pathological Voices Using Complexity Measures, Noise Parameters, and Mel-Cepstral Coefficients , 2011, IEEE Transactions on Biomedical Engineering.