A Comparative Study of Recognition Technique Used for Development of Automatic Stuttered Speech Dysfluency Recognition System

Objectives: This paper is an attempt to compare the work done around the world for development of stuttered speech database and approaches for analysis of stuttered speech and recognition system. Methods/Statistical Analysis: In particular we have compared the different methods adopted by the researchers around the world for development of speech database and the techniques implemented on these developed databases. We have compared the databases on the basis of utterances, gender, age group, speech dysfluencies and type of samples. The recognition systems are compared on the basis of feature used, classification techniques and the accuracy. Findings: Speech recognition based application is getting more popularized and now being implemented at various places. However, the developed speech recognition systems cannot handle the speech dysfluencies. Very less work had been carried out till date for stuttered speech recognition system. The work for Indian languages is very negligible. The only work carried out is for Kannada. There is no major contribution for other Indian Languages. This paper shows the current status and the notable work carried in other languages. Application/Improvements: There is a need to develop more such systems for other Indian languages which will be very helpful for multilingual society like India.

[1]  Wieslawa Kuniszyk-Józkowiak,et al.  Hierarchical ANN system for stuttering identification , 2013, Comput. Speech Lang..

[2]  Andrzej Czyzewski,et al.  Intelligent Processing of Stuttered Speech , 2003, Journal of Intelligent Information Systems.

[3]  Sazali Yaacob,et al.  Classification of Speech Dysfluencies Using LPC Based Parameterization Techniques , 2012, Journal of Medical Systems.

[4]  Lalit Jain,et al.  Digital Audio Watermarking using Frequency Masking Technique , 2015 .

[5]  Elizabeth Shriberg,et al.  Phonetic Consequences of Speech Disfluency , 1999 .

[6]  P Howell,et al.  Development of a two-stage procedure for the automatic recognition of dysfluencies in the speech of children who stutter: I. Psychometric procedures appropriate for selection of training material for lexical dysfluency classifiers. , 1997, Journal of speech, language, and hearing research : JSLHR.

[7]  Jasmy Yunus,et al.  Overview of a Computer-based Stuttering Therapy , 2006 .

[8]  T. Callister,et al.  Intensive stuttering modification therapy: a multidimensional assessment of treatment outcomes. , 2005, Journal of speech, language, and hearing research : JSLHR.

[9]  G. Krishnan,et al.  Revisiting the acquired neurogenic stuttering in the light of developmental stuttering , 2011, Journal of Neurolinguistics.

[10]  Wieslawa Kuniszyk-Józkowiak,et al.  Speech nonfluency detection using Kohonen networks , 2009, Neural Computing and Applications.

[11]  D. Ward Sudden onset stuttering in an adult: Neurogenic and psychogenic perspectives , 2010, Journal of Neurolinguistics.

[12]  Lisa M. D. Archibald,et al.  The relationship between stuttering severity and kinesthetic acuity for jaw movements in adults who stutter , 1999 .

[13]  Peter A. Heeman,et al.  Using a Uniform-Weight Grammar to Model Disfluencies in Stuttered Read Speech : A Pilot Study , 2004 .

[14]  S. S. Awad,et al.  Computer assisted treated for motor speech disorders , 1999, IMTC/99. Proceedings of the 16th IEEE Instrumentation and Measurement Technology Conference (Cat. No.99CH36309).

[15]  Y. V. Geetha,et al.  Classification of childhood disfluencies using neural networks , 2000 .

[16]  Simon E. Fisher,et al.  Localisation of a gene implicated in a severe speech and language disorder , 1997, Nature Genetics.

[17]  R. Ingham,et al.  Time-interval analysis of interjudge and intrajudge agreement for stuttering event judgments. , 1992, Journal of speech and hearing research.

[18]  Tetsuya Takiguchi,et al.  Multimodal speech recognition of a person with articulation disorders using AAM and MAF , 2010, 2010 IEEE International Workshop on Multimedia Signal Processing.

[19]  Sin-Horng Chen,et al.  A speech recognition method based on the sequential multi-layer perceptrons , 1996, Neural Networks.

[20]  M. Hariharan,et al.  Automatic detection of prolongations and repetitions using LPCC , 2009, 2009 International Conference for Technical Postgraduates (TECHPOS).

[21]  P Howell,et al.  Exchange of stuttering from function words to content words with age. , 1999, Journal of speech, language, and hearing research : JSLHR.

[22]  Eric Achten,et al.  fMRI of developmental stuttering: A pilot study , 2003, Brain and Language.

[23]  E. Yairi,et al.  Identification of traits associated with stuttering. , 2006, Journal of communication disorders.

[24]  Katherine E Henson,et al.  Risk of Suicide After Cancer Diagnosis in England , 2018, JAMA psychiatry.

[25]  A. Packman,et al.  Altered auditory feedback and the treatment of stuttering: a review. , 2006, Journal of fluency disorders.

[26]  M. Wingate,et al.  Foundations of Stuttering , 2001 .

[27]  O. P. Skljarov,et al.  An internet system of partner-learning special type , 2003, 2003 IEEE International Workshop on Workload Characterization (IEEE Cat. No.03EX775).

[28]  Marek Wisniewski,et al.  Automatic Detection of Disorders in a Continuous Speech with the Hidden Markov Models Approach , 2008, Computer Recognition Systems 2.

[29]  Ratnadeep R. Deshmukh,et al.  Analysis of Variations in Speech in Different Age Groups using Prosody Technique , 2015 .

[30]  Peter Howell,et al.  Exchange of disfluency with age from function words to content words in spanish speakers who stutter. , 2003, Journal of speech, language, and hearing research : JSLHR.

[31]  Wiesława Kuniszyk-Jóźkowiak,et al.  The application of Kohonen and Multilayer Perceptron Networks in the speech nonfluency analysis , 2014 .

[32]  P. Mahesha,et al.  Gaussian Mixture Model Based Classification of Stuttering Dysfluencies , 2016, J. Intell. Syst..

[33]  S. S. Awad The application of digital speech processing to stuttering therapy , 1997, IEEE Instrumentation and Measurement Technology Conference Sensing, Processing, Networking. IMTC Proceedings.

[34]  Thiemo Voigt,et al.  Smartphone Support for Persons Who Stutter , 2014, IPSN 2014.

[35]  Peter Howell,et al.  Assessment of Some Contemporary Theories of Stuttering That Apply to Spontaneous Speech. , 2004, Contemporary issues in communication science and disorders : CICSD.

[36]  Sazali Yaacob,et al.  Classification of speech dysfluencies with MFCC and LPCC features , 2012, Expert Syst. Appl..

[37]  Proceedings of the 17th IEEE instrumentation and measurement technology conference , 2000, Proceedings of the 17th IEEE Instrumentation and Measurement Technology Conference [Cat. No. 00CH37066].

[38]  O. Skljarov,et al.  Chaos and speech rhythm , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[39]  Elmar Nöth,et al.  Automatic stuttering recognition using hidden Markov models , 2000, INTERSPEECH.

[40]  O. Bloodstein A handbook on stuttering , 1969 .

[41]  R. Fabus,et al.  A Review of Stuttering Intervention Approaches for Preschool-Age and Elementary School-Age Children , 2010 .

[42]  D. Dewey,et al.  What Is Developmental Dyspraxia , 1995, Brain and Cognition.

[43]  Chee-Ming Ting,et al.  Application of Malay speech technology in Malay Speech Therapy Assistance Tools , 2007, 2007 International Conference on Intelligent and Advanced Systems.

[44]  Sazali Yaacob,et al.  Comparison of speech parameterization techniques for the classification of speech disfluencies , 2013 .

[45]  J. Kalinowski,et al.  The need for self-report data in the assessment of stuttering therapy efficacy: repetitions and prolongations of speech. The stuttering syndrome. , 2006, International journal of language & communication disorders.

[46]  Jiri Pospichal,et al.  Pattern search in dysfluent speech , 2012, 2012 IEEE International Workshop on Machine Learning for Signal Processing.

[47]  Selim S. Awad,et al.  Speech therapy software on an open web platform , 2014, 2014 10th International Computer Engineering Conference (ICENCO).

[48]  E. Boberg,et al.  An investigation of interclinic agreement in the identification of fluent and stuttered syllables , 1988 .

[49]  Ratnadeep R. Deshmukh,et al.  DEVELOPMENT OF ISOLATED WORDS SPEECH DATABASE OF MARATHI WORDS FOR AGRICULTURE PURPOSE , 2012 .

[50]  M. D. Shieh,et al.  Automatic Recognition of Repetitions in Stuttered Speech: Using End-Point Detection and Dynamic Time Warping , 2015 .

[51]  W Kuniszyk-Jóźkowiak,et al.  Effect of acoustical, visual and tactile echo on speech fluency of stutterers. , 1996, Folia phoniatrica et logopaedica : official organ of the International Association of Logopedics and Phoniatrics.

[52]  J. Pálfy,et al.  Analysis of Dysfluencies by Computational Intelligence , 2014 .

[53]  M. Hariharan,et al.  MFCC based recognition of repetitions and prolongations in stuttered speech using k-NN and LDA , 2009, 2009 IEEE Student Conference on Research and Development (SCOReD).

[54]  G. Janvale,et al.  Emotion Recognition System from Artificial Marathi Speech using MFCC and LDA Techniques , 2014 .

[55]  Thomas S. Huang,et al.  Hmm-Based and Svm-Based Recognition of the Speech of Talkers With Spastic Dysarthria , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[56]  K. M. Ravikumar,et al.  Automatic Detection of Syllable Repetition in Read Speech for Objective Assessment of Stuttered Disfluencies , 2008 .

[57]  H. C. Nagaraj,et al.  An Approach for Objective Assessment of Stuttered Speech Using MFCC Features , 2009 .

[58]  Wieslawa Kuniszyk-Józkowiak,et al.  Artificial Neural Networks in the Disabled Speech Analysis , 2009, Computer Recognition Systems 3.

[59]  E. Szabelska,et al.  Computer-based speech analysis in stutter , 2013 .

[60]  Wiesława Kuniszyk-Jóźkowiak,et al.  Automatic detection of prolonged fricative phonemes with the Hidden Markov Models approach , 2007 .

[61]  Peter Howell,et al.  Predicting stuttering from phonetic complexity in German. , 2004, Journal of fluency disorders.

[62]  P. Zebrowski Duration of the speech disfluencies of beginning stutterers. , 1991, Journal of speech and hearing research.