论文信息 - LPC and MFCC Performance Evaluation with Artificial Neural Network for Spoken Language Identification

LPC and MFCC Performance Evaluation with Artificial Neural Network for Spoken Language Identification

Automatic language identification plays an essential role in wide range of multi-lingual services. Automatic translators to certain language or routing an incoming telephone call to a human switchboard operator fluent in the corresponding language are examples of these applications that require automatic language identification. This paper investigates the usage of Linear Predictive Coding (LPC) and/or Mel Frequency Cepstral Coefficients (MFCC) with Artificial Neural Network (ANN) for automatic language identification. Different orders for the LPC and MFCC have been tested. In addition, different hidden layers, different neurons in every hidden layers and different transfer functions have been tested in the ANN. Three languages; Arabic, English and French have been used in this paper to evaluate the performance of the automatic language identification systems.

[1] R. B. Shinde,et al. Vowel Classification based on LPC and ANN , 2012 .

[2] Yuet-Ming Lam,et al. FIXED-POINT IMPLEMENTATIONS OF SPEECH RECOGNITION SYSTEMS , 2002 .

[3] Bojan Petek,et al. Context-dependent hidden control neutral network architecture for continuous speech recognition , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4] Khaled Daqrouq,et al. Wavelet Packet and Percent of Energy Distribution with Neural Networks Based Gender Identification System , 2011 .

[5] Matti Karjalainen. Speech communication, human and machine: by Douglas O'Shaughnessy, INRS-Telecommunication. Publisher: Addison-Wesley Publishing Company, Route 128, Reading, MA 01867, U.S.A., 1987, xviii+568 pp., ISBN 0-201-16520-1 , 1988 .

[6] Tom E. Bishop,et al. Blind Image Restoration Using a Block-Stationary Signal Model , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[7] Vishal Chourasia. Phonetically Rich Hindi Sentence Corpus for Creation of Speech Database , 2006 .

[8] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[9] R. S. Anand,et al. Enhanced recognition rate of spoken Hindi paired word using probabilistic neural network approach , 2011, Int. J. Inf. Commun. Technol..

[10] Vikash Kumar Singh,et al. Broad Acoustic Classification of Spoken Hindi Hybrid Paired Words using Artificial Neural Networks , 2012 .

[11] Juha Häkkinen,et al. Robust end-of-utterance detection for real-time speech recognition applications , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).