Automatic text independent language identification using reduct set of feature vectors

In this paper, robust features are proposed for spoken language identification (LID) system. 12 Mel frequency cepstral coefficients (MFCCs) and five formant frequencies are extracted from each short-time windowed speech signal. These features are concatenated to form 17-dimensional feature vectors. 8-dimensional reduct set is obtained from this 17-dimensional feature vector using rough set theory. This 8-dimensional reduct set is transformed into 15 dimensional new feature vectors using the approach followed in [1]. In both the training and testing phases of LID, these 15 dimensional feature vectors are used. Due to usage of reduct set, there is a significant reduction of time in training and testing phases of the proposed LID system. The experiments are carried out on speech database of Indian languages and the results are impressive.

[1]  V. Kamakshi Prasad,et al.  Text independent language recognition system using DHMM with new features , 2012, 2012 IEEE 11th International Conference on Signal Processing.

[2]  M. Sugiyama,et al.  Automatic language recognition using acoustic features , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[3]  Andrzej Skowron,et al.  Zdzislaw Pawlak life and work (1926-2006) , 2007, Inf. Sci..

[4]  Hussein Zedan,et al.  Rough set approach to online signature identification , 2011, Digit. Signal Process..

[5]  Dominik Slezak,et al.  Ensembles of Bireducts: Towards Robust Classification and Simple Representation , 2011, FGIT.

[6]  Torgeir R. Hvidsten,et al.  A tutorial-based guide to the ROSETTA system : A Rough Set Toolkit for Analysis of Data by , 2013 .

[7]  A. Lawrence Spitz,et al.  Automatic language identification , 1997 .

[8]  Douglas A. Reynolds,et al.  Language identification using Gaussian mixture model tokenization , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  M. Zissman Automatic Language Identification of Telephone Speech , 1993 .

[10]  Richard Weber,et al.  Soft clustering - Fuzzy and rough approaches and their extensions and derivatives , 2013, Int. J. Approx. Reason..

[11]  Chris Cornelis,et al.  Attribute selection with fuzzy decision reducts , 2010, Inf. Sci..

[12]  Marc A. Zissman,et al.  Comparison of : Four Approaches to Automatic Language Identification of Telephone Speech , 2004 .

[13]  Hideyuki Suzuki,et al.  A new speech recognition method based on VQ-distortion measure and HMM , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  B. Yegnanarayana Formant extraction from linear‐prediction phase spectra , 1978 .

[15]  Jan G. Bazan,et al.  Rough set algorithms in classification problem , 2000 .

[16]  Russell B. Ives,et al.  Development of an automatic identification system of spoken languages: Phase I , 1982, ICASSP.

[17]  Ian C. Bruce,et al.  Robust Formant Tracking for Continuous Speech With Speaker Variability , 2003, IEEE Transactions on Audio, Speech, and Language Processing.

[18]  Y. M. Lazim,et al.  Clustering model of multimedia data by using rough sets theory , 2012, 2012 International Conference on Computer & Information Science (ICCIS).