Analysis on Mel Frequency Cepstral Coefficients and Linear Predictive Cepstral Coefficients as Feature Extraction on Automatic Accents Identification

Automatic Accents Identification is very important for discussion especially within scope of speaker recognition. Some contribution of appropriate techniques uses in Music Recognition and Accent Identification may contributes in improving the recognition rate. Techniques in discussing on music genre identification or accents automatic identification and the combination of both processes still in ambiguous for this field. This paper investigates mainly the processes involved in speech processing or identification includes: acoustic/speech signal, pre-processing, feature extraction, pattern classification and accuracy results. Process of automatic accents identification through speech signals starts with general pre-processing techniques, feature extraction; which in this studies too, comparing within two techniques; Mel-Frequency Cepstral Coefficients (MFCC) and Linear Predictive Cepstral Coefficients (LPCC).While for vocal tract with musical characteristic in used for musical genre identification and the usage of pattern classification for three methods which includes; Hidden Markov Model (HMM), Support Vector Machine (SVM) and Probabilistic Principal Component Analysis. Thus, this paper investigates the feature extraction techniques used in identifying accents to be implemented in Quranic Accents identification and proposed MFCC as better techniques for feature extraction and getting higher accuracies for 93.33% while 86.67% if compared to LPCC.

[1]  S. Maity,et al.  IITKGP-MLILSC speech database for language identification , 2012, 2012 National Conference on Communications (NCC).

[2]  J. Gronsbell,et al.  A Comparison of Feature Selection and Classification Algorithms in Identifying Baseball Pitches , 2013 .

[3]  P. Dhanalakshmi,et al.  EEG Signal Classification using Linear Predictive Cepstral Coefficient Features , 2013 .

[4]  Juraj Kacur,et al.  Speaker identification by K-nearest neighbors: Application of PCA and LDA prior to KNN , 2011, 2011 18th International Conference on Systems, Signals and Image Processing.

[5]  Salina Abdul Samad,et al.  Modified Adaptive Line Enhancer in Variable Noise Environments using Set-Membership Adaptive Algorithm , 2014 .

[6]  Wasfi G. Al-Khatib,et al.  Identification of Question and Non-Question Segments in Arabic Monologue Based on Prosodic Features Using Type-2 Fuzzy Logic Systems , 2010, 2010 Second International Conference on Computational Intelligence, Modelling and Simulation.

[7]  Vinod DS,et al.  Vector Quantization and MFCC based Classification of Dysfluencies in Stuttered Speech , 2012 .

[8]  Tien Ping Tan,et al.  Analysis of Malay Speech Recognition for Different Speaker Origins , 2012, 2012 International Conference on Asian Language Processing.

[9]  Trisiladevi C. Nagavi,et al.  Overview of automatic Indian music information recognition, classification and retrieval systems , 2011, 2011 International Conference on Recent Trends in Information Systems.

[10]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[11]  Sivaji Bandyopadhyay,et al.  Music Genre Classification: A Semi-supervised Approach , 2013, MCPR.

[12]  Ricardo Gutierrez-Osuna,et al.  Foreign Accent Conversion Through Concatenative Synthesis in the Articulatory Domain , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Samir A. Elsagheer Mohamed,et al.  Virtual Learning System (Miqra’ah) for Quran Recitations for Sighted and Blind Students , 2014 .

[14]  Chang-Hsing Lee,et al.  Music genre classification using modulation spectral features and multiple prototype vectors representation , 2011, 2011 4th International Congress on Image and Signal Processing.

[15]  Sid-Ahmed Selouani,et al.  Speaker-independent ASR for Modern Standard Arabic: effect of regional accents , 2012, International Journal of Speech Technology.

[16]  David Michael Graeme Watts Speaker identification - prototype development and performance , 2006 .

[17]  Bibhas Chandra Dhara,et al.  Music Classification based on MFCC Variants and Amplitude Variation Pattern: A Hierarchical Approach , 2012 .

[18]  Christopher M. Bishop,et al.  Mixtures of Probabilistic Principal Component Analyzers , 1999, Neural Computation.

[19]  Saurabh Deshmukh,et al.  Analysis and application of audio features extraction and classification method to be used for North Indian Classical Music’s singer identification problem , 2014 .

[20]  Shaiful Jahari Hashim,et al.  Feature extraction using Spectral Centroid and Mel Frequency Cepstral Coefficient for Quranic Accent Automatic Identification , 2014, 2014 IEEE Student Conference on Research and Development.

[21]  J. Harrington,et al.  Techniques in Speech Acoustics , 1999, Computational Linguistics.

[22]  Shweta Sinha,et al.  Fusion of multi-stream speech features for dialect classification , 2015, CSI Transactions on ICT.

[23]  Dang Cong Zheng,et al.  A new approach to acoustic analysis of two British regional accents—Birmingham and Liverpool accents , 2012, Int. J. Speech Technol..