Automatic Spoken Language Identification by Digital Signal Processing Methods. Tatar and Russian Languages

The paper studies the problem of language identification for audio files. For solving the problem, we use methods of digital signal processing only (without analysis of phonemes distinctive for language). A special attention is drawn to the form of signal in an area close to the position of a stop consonant. The evaluation is performed on a set of two languages; this includes speech records taken from TV programs. It is provided that solely one of the two languages (either Tatar or Russian) is used in each of files. Experimental evidence demonstrates the feasibility of the proposed techniques.

[1]  Lukás Burget,et al.  iVector-based discriminative adaptation for automatic speech recognition , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.

[2]  George R. Doddington,et al.  Automatic Language Identification. , 1974 .

[3]  K. Sreenivasa Rao,et al.  Language identification using Hilbert envelope and phase information of linear prediction residual , 2013, 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE).

[4]  Marc A. Zissman Language identification using phoneme recognition and phonotactic language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[5]  Björn Schuller,et al.  Computational Paralinguistics , 2013 .

[6]  Bin Ma,et al.  Spoken Language Recognition: From Fundamentals to Practice , 2013, Proceedings of the IEEE.

[7]  Joaquín González-Rodríguez,et al.  Automatic language identification using deep neural networks , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  R. Latypov,et al.  Classification of speech files by waveforms , 2015, Lobachevskii Journal of Mathematics.

[9]  Nikhil V Mathew,et al.  Analyzing the Effectiveness of N-gram Technique Based Feature Set in a Naive Bayesian Spam Filter , 2016, 2016 International Conference on Emerging Technological Trends (ICETT).

[10]  S. Mallat A wavelet tour of signal processing , 1998 .