Phonological and computational perspectives of language identification (LID) system

Living beings inherently have the ability to differentiate languages as a part of human intelligence. Automatic LID had been a science fiction in 1970's but today, this has been deployed in practical usage. From the two classifications, text-based language recognition and spoken language recognition, the latter is comparatively challenging and has been worked in the paper. Language Recognition, generally means, the system (process) which determines the identity of the particular language. It's widely used in multilingual processing's for translation, interpretations and spoken facts retrieval. It finds place in research domain of Artificial Intelligence and security for data (information) distillation. This paper experiments for systems with two different Datasets, separately for prosody and acoustic (MFCC) based study and furthermore, their system fusion to deliver a thoughtful results.

[1]  J. Foil,et al.  Language identification using noisy speech , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Jean-Luc Gauvain,et al.  Cross-lingual experiments with phone recognition , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Douglas E. Lake,et al.  A system for clustering spoken documents , 1993, EUROSPEECH.

[4]  Marc A. Zissman,et al.  Automatic language identification using Gaussian mixture and hidden Markov models , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Ronald A. Cole,et al.  Perceptual benchmarks for automatic language identification , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Shubha Kadambe,et al.  Language identification with phonological and lexical models , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[7]  Y.K. Muthusamy,et al.  Reviewing automatic language identification , 1994, IEEE Signal Processing Magazine.

[8]  Raymond J. D'Amore,et al.  One-time complete indexing of text: theory and practice , 1985, SIGIR '85.

[9]  Russell B. Ives,et al.  Development of an automatic identification system of spoken languages: Phase I , 1982, ICASSP.

[10]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[11]  Jan P. van Hemert,et al.  Automatic segmentation of speech , 1991, IEEE Trans. Signal Process..

[12]  T. J. Edwards,et al.  Statistical models for automatic language identification , 1980, ICASSP.

[13]  Mei-Yuh Hwang,et al.  Speech recognition using hidden Markov models: A CMU perspective , 1990, Speech Commun..

[14]  F. J. Goodman,et al.  Improved automatic language identification in noisy speech , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[15]  Marc A. Zissman,et al.  Comparison of : Four Approaches to Automatic Language Identification of Telephone Speech , 2004 .

[16]  Roy E. Kimbrell,et al.  Searching for text? Send an N-gram] , 1988 .

[17]  Douglas B. Paul,et al.  Speech Recognition Using Hidden Markov Models , 1990 .

[18]  Robert D. Rodman,et al.  An Introduction to Language , 1984 .

[19]  Jean-Luc Gauvain,et al.  Language identification using phone-based acoustic likelihoods , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[20]  P. Mermelstein Automatic segmentation of speech into syllabic units. , 1975, The Journal of the Acoustical Society of America.

[21]  Etienne Barnard,et al.  Analysis of phoneme-based features for language identification , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[22]  William A. Ainsworth,et al.  Speech Recognition by Machine , 1988 .

[23]  Marc A. Zissman,et al.  Automatic language identification of telephone speech messages using phoneme recognition and N-gram modeling , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[24]  M. Picheny,et al.  Comparison of Parametric Representation for Monosyllabic Word Recognition in Continuously Spoken Sentences , 2017 .

[25]  Kung-Pu Li Automatic language identification using syllabic spectral features , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[26]  A. House,et al.  Toward automatic identification of the language of an utterance. I. Preliminary methodological con , 1977 .

[27]  Paul Dalsgaard,et al.  On the use of data-driven clustering technique for identification of poly- and mono-phonemes for four European languages , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[28]  Ronald A. Cole,et al.  A comparison of approaches to automatic language identification using telephone speech , 1993, EUROSPEECH.

[29]  M. Savic,et al.  An automatic language identification system , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[30]  Victor Zue,et al.  Automatic language identification using a segment-based approach , 1993, EUROSPEECH.

[31]  Ronald A. Cole,et al.  Automatic segmentation and identification of ten languages using telephone speech , 1992, ICSLP.

[32]  Ronald A. Cole,et al.  The OGI multi-language telephone speech corpus , 1992, ICSLP.

[33]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[34]  Roger C. F. Tucker,et al.  Automatic language identification using sub-word models , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[35]  Jean-Luc Gauvain,et al.  Identifying non-linguistic speech features , 1993, EUROSPEECH.

[36]  M. Sugiyama,et al.  Automatic language recognition using acoustic features , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[37]  Seiichi Nakagawa,et al.  Spoken language identification by ergodic HMMS and its state sequences , 1994 .

[38]  Marc A. Zissnian LANGUAGE IDENTIFICATION USING PHONEME RECOGNITION AND PHONOTACTIC LANGUAGE MODELING , 1995 .