Specific Language Impairment Detection Through Voice Analysis

Specific Language Impairment is a communication disorder regarding the mastery of language and conversation that impacts children. The system proposed aims to provide an alternative diagnosis method that does not rely on specific assessment tools. The system will accept a voice sample from the child and then detect indicators that differentiate individuals with specific language impairment from that voice sample. These indicators were based on the timbre and pitch characteristics of sound. Three different feature spaces are calculated, followed by derived features, with three different classifiers to determine the most accurate combination. The three feature spaces are Chroma, Mel-frequency cepstral coefficients (MFCC), and Tonnetz and the three classifiers are Support Vector Machines, Random Forest and a Recurrent Neural Network. MFCC, representing the timbre characteristic, was found to be the most accurate feature vector across all classifiers and Random Forest being the most accurate classifier across all feature spaces. The most accurate combination found was the MFCC feature vector with the Random Forest classifier with an accuracy level of 99%. The MFCC feature vector has the most features that are extracted giving the reason for the high accuracy. However, this accuracy decreases when the recorded word is three syllables or longer. The system proposed has proven to be a valid method that can detect SLI.

[1]  Ashwin Nair Anil Kumar,et al.  Text dependent voice recognition system using MFCC and VQ for security applications , 2017, 2017 International conference of Electronics, Communication and Aerospace Technology (ICECA).

[2]  Chee Kyun Ng,et al.  Animal voice recognition for identification (ID) detection system , 2011, 2011 IEEE 7th International Colloquium on Signal Processing and its Applications.

[3]  Djemili Rafik,et al.  Text-Independent Speaker Identification by Combining MFCC and MVA Features , 2018, 2018 International Conference on Signal, Image, Vision and their Applications (SIVA).

[4]  Tamara Sztynda,et al.  Loss of BAP1 Expression Is Very Rare in Pancreatic Ductal Adenocarcinoma , 2016, PloS one.

[5]  Stephen Grossberg,et al.  Recurrent neural networks , 2013, Scholarpedia.

[6]  Anssi Klapuri,et al.  Signal Processing Methods for Music Transcription , 2006 .

[7]  P. Schulz,et al.  Specific Language Impairment and Early Second Language Acquisition: The Risk of Over- and Underdiagnosis , 2014 .

[8]  Voula C. Georgopoulos,et al.  Development of intelligent method for differential diagnosis of Specific Language Impairment , 2001, 2001 Conference Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[9]  Zhihua Wang,et al.  Bowel Sound Detection Based on MFCC Feature and LSTM Neural Network , 2018, 2018 IEEE Biomedical Circuits and Systems Conference (BioCAS).

[10]  Pavel Grill,et al.  Speech Databases of Typical Children and Children with SLI , 2016, PloS one.