Speech stuttering assessment using sample entropy and Least Square Support Vector Machine

This work is intended to discuss the performance of sample entropy feature for the recognition of stuttered events. The data for the analysis is taken from the UCLASS database. Manual segmentation is performed to identify the stuttered events prior to the feature extraction process. Wavelet packet decomposition is performed, and the sample entropy features are extracted using three different filter banks, Bark scale, Mel scale and Erb scale. The extracted features are tested using Least Square Support Vector Machine (LS SVM) for the identification of repetition and prolongation. Ten fold cross validation method is used to ensure the reliability of the results. The experimental investigations reveal that the proposed method shows promising results in distinguishing between the two stuttering events.

[1]  Andrzej Czyzewski,et al.  Intelligent Processing of Stuttered Speech , 2003, Journal of Intelligent Information Systems.

[2]  Kemal Polat,et al.  A cascade learning system for classification of diabetes disease: Generalized Discriminant Analysis and Least Square Support Vector Machine , 2008, Expert Syst. Appl..

[3]  Sazali Yaacob,et al.  Discrimination of vision impairments using single trial VEPs , 2011, 2011 IEEE International Conference on Control System, Computing and Engineering.

[4]  Y. V. Geetha,et al.  Classification of childhood disfluencies using neural networks , 2000 .

[5]  P Howell,et al.  Development of a two-stage procedure for the automatic recognition of dysfluencies in the speech of children who stutter: I. Psychometric procedures appropriate for selection of training material for lexical dysfluency classifiers. , 1997, Journal of speech, language, and hearing research : JSLHR.

[6]  M. Hariharan,et al.  Automatic detection of prolongations and repetitions using LPCC , 2009, 2009 International Conference for Technical Postgraduates (TECHPOS).

[7]  S. S. Awad The application of digital speech processing to stuttering therapy , 1997, IEEE Instrumentation and Measurement Technology Conference Sensing, Processing, Networking. IMTC Proceedings.

[8]  Jasmy Yunus,et al.  Overview of a Computer-based Stuttering Therapy , 2006 .

[9]  Peter Howell,et al.  The University College London Archive of Stuttered Speech (UCLASS). , 2009, Journal of speech, language, and hearing research : JSLHR.

[10]  C. Burrus,et al.  Introduction to Wavelets and Wavelet Transforms: A Primer , 1997 .

[11]  Wiesława Kuniszyk-Jóźkowiak,et al.  The application of Kohonen and Multilayer Perceptron Networks in the speech nonfluency analysis , 2014 .

[12]  Chee-Ming Ting,et al.  Application of Malay speech technology in Malay Speech Therapy Assistance Tools , 2007, 2007 International Conference on Intelligent and Advanced Systems.

[13]  K. M. Ravikumar,et al.  Automatic Detection of Syllable Repetition in Read Speech for Objective Assessment of Stuttered Disfluencies , 2008 .

[14]  Peter Howell,et al.  Facilities to assist people to research into stammered speech. , 2004, Stammering research : an on-line journal published by the British Stammering Association.

[15]  H. C. Nagaraj,et al.  An Approach for Objective Assessment of Stuttered Speech Using MFCC Features , 2009 .

[16]  Elmar Nöth,et al.  Automatic stuttering recognition using hidden Markov models , 2000, INTERSPEECH.

[17]  Eric Achten,et al.  fMRI of developmental stuttering: A pilot study , 2003, Brain and Language.

[18]  M. Hariharan,et al.  MFCC based recognition of repetitions and prolongations in stuttered speech using k-NN and LDA , 2009, 2009 IEEE Student Conference on Research and Development (SCOReD).

[19]  P Howell,et al.  Development of a two-stage procedure for the automatic recognition of dysfluencies in the speech of children who stutter: II. ANN recognition of repetitions and prolongations with supplied word segment markers. , 1997, Journal of speech, language, and hearing research : JSLHR.

[20]  Johan A. K. Suykens,et al.  Multiclass least squares support vector machines , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[21]  I. Daubechies,et al.  Biorthogonal bases of compactly supported wavelets , 1992 .

[22]  Sazali Yaacob,et al.  Classification of Speech Dysfluencies Using LPC Based Parameterization Techniques , 2012, Journal of Medical Systems.

[23]  J. Richman,et al.  Physiological time-series analysis using approximate entropy and sample entropy. , 2000, American journal of physiology. Heart and circulatory physiology.

[24]  Sazali Yaacob,et al.  Classification of speech dysfluencies with MFCC and LPCC features , 2012, Expert Syst. Appl..

[25]  Ajit S. Bopardikar,et al.  Wavelet transforms - introduction to theory and applications , 1998 .

[26]  Alex Acero,et al.  Spoken Language Processing , 2001 .

[27]  M. Hariharan,et al.  Mental tasks classifications using S-transform for BCI applications , 2011, 2011 IEEE Conference on Sustainable Utilization and Development in Engineering and Technology (STUDENT).