An Optimized Cepstral Feature Selection method for Dysfluencies Classification using Tamil Speech Dataset

Speech is the most important and indispensable mode of communication between humans. In communication, the continuous flow of speech gets affected due to the interruption of emotional, panic and psychological factors that cause syllable or word repetition, prolongation and interjection. Speech dysfluency is a primary challenge for speech pathologist to isolate the normal speech from the stuttered speech. The primary objective of this paper is to propose a novel approach through optimized cepstral features selection that improves the classifiers accuracy. In this paper, Particle Swarm Optimization (PSO) and Synergistic Fibroblast Optimization (SFO) were introduced to select optimal features from conventional MFCC (Mel-Frequency Cepstrum Coefficients). The optimized cepstral features from PSO and SFO of pre-processed Tamil speech data is used to discriminate among different categories of speech signals like Normal, Moderate and Sever stutter through machine learning classification methods such as Support Vector Machine (SVM) and Naive Bayes (NB). From the experimental results, the optimal selection of cepstral features using SFO algorithm has achieved high accuracy of 96.08% employed with NB which outperforms well to the feature selection of PSO and classical MFCC. The evaluation of the proposed methodology is done by using performance metrics like sensitivity, specificity, precision, f-score and accuracy.

[1]  S. S. Awad The application of digital speech processing to stuttering therapy , 1997, IEEE Instrumentation and Measurement Technology Conference Sensing, Processing, Networking. IMTC Proceedings.

[2]  Arti V. Bang,et al.  Emotion recognition on the basis of audio signal using Naive Bayes classifier , 2016, 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[3]  Mohammad Ehsan Basiri,et al.  Particle Swarm Optimization for Feature Selection in Speaker Verification , 2010, EvoApplications.

[4]  K. M. Ravikumar,et al.  Automatic Detection of Syllable Repetition in Read Speech for Objective Assessment of Stuttered Disfluencies , 2008 .

[5]  You-ming Ma Predicting ratings of severity of stuttering. , 1961 .

[6]  Wiesława Kuniszyk-Jóźkowiak,et al.  Automatic detection of prolonged fricative phonemes with the Hidden Markov Models approach , 2007 .

[7]  I. M. Yassin,et al.  Optimization of MFCC parameters using Particle Swarm Optimization for diagnosis of infant hypothyroidism using Multi- Layer Perceptron , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[8]  M. Shivakumar,et al.  Adaptive optimization based neural network for classification of stuttered speech , 2019, ICCSP.

[9]  P. Subashini,et al.  Synergistic fibroblast optimization based improved reinforcement learning for intelligent assistive device , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[10]  J. Saini,et al.  Prevalence of communication disorders in a group of islands in India , 2017 .

[11]  Sazali Yaacob,et al.  Comparison of speech parameterization techniques for the classification of speech disfluencies , 2013 .

[12]  M. Hariharan,et al.  MFCC based recognition of repetitions and prolongations in stuttered speech using k-NN and LDA , 2009, 2009 IEEE Student Conference on Research and Development (SCOReD).

[13]  Uttam Kumar Roy,et al.  Emotion recognition using prosodie and spectral features of speech and Naïve Bayes Classifier , 2017, 2017 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET).

[14]  M. Young Predicting ratings of severity of stuttering. , 1961, The Journal of speech and hearing disorders.

[15]  P. Mahesha,et al.  Support vector machine-based stuttering dysfluency classification using GMM supervectors , 2015, Int. J. Grid Util. Comput..

[16]  Indra Ganesan,et al.  Speech Recognition with Advanced Feature Extraction Methods Using Adaptive Particle Swarm Optimization , 2016 .

[17]  D. E. Williams,et al.  Comparison of procedures for scaling severity of stuttering. , 1963, Journal of speech and hearing research.

[18]  Bozena Kostek,et al.  Comparative Analysis of Spectral and Cepstral Feature Extraction Techniques for Phoneme Modelling , 2018, MISSI.

[19]  P. Mahesha,et al.  Feature based classification of dysfluent and normal speech , 2012, CCSEIT '12.

[20]  M Krishnaveni,et al.  Synergistic Fibroblast Optimization , 2017 .

[21]  Turker Tuncer,et al.  Turkish vowel classification based on acoustical and decompositional features optimized by Genetic Algorithm , 2019 .