Linear Predictive Coefficients-Based Feature to Identify Top-Seven Spoken Languages

Speech recognition in multilingual scenario is not trivial in the case when multiple languages are used in one conversation. Language must be identified before we process speech recognition as such...

[1]  Ryo Masumura,et al.  Parallel phonetically aware DNNs and LSTM-RNNS for frame-by-frame discriminative modeling of spoken language identification , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  Santanu Phadikar,et al.  A Dravidian Language Identification System , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[3]  Jean Paul Haton,et al.  Frame-Synchronous and Local Confidence Measures for Automatic Speech Recognition , 2011, Int. J. Pattern Recognit. Artif. Intell..

[4]  Hari Krishna Vydana,et al.  Significance of neural phonotactic models for large-scale spoken language identification , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[5]  Teddy Surya Gunawan,et al.  Development of language identification system using MFCC and vector quantization , 2017, 2017 IEEE 4th International Conference on Smart Instrumentation, Measurement and Application (ICSIMA).

[6]  Santanu Phadikar,et al.  Identification of top-3 spoken Indian languages: An Ensemble learning-based approach , 2018, 2018 Fourth International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN).

[7]  Dong Wang,et al.  Phonetic Temporal Neural Model for Language Identification , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[8]  Marelie H. Davel,et al.  The effect of language identification accuracy on speech recognition accuracy of proper names , 2017, 2017 Pattern Recognition Association of South Africa and Robotics and Mechatronics (PRASA-RobMech).

[9]  Shubha Kadambe,et al.  Language identification with phonological and lexical models , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[10]  Santanu Phadikar,et al.  Line spectral frequency-based features and extreme learning machine for voice activity detection from audio signal , 2018, Int. J. Speech Technol..

[11]  Anu George,et al.  Automatic language identification for seven Indian languages using higher level features , 2017, 2017 IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES).

[12]  Yonghong Yan,et al.  Similar Language Identification for Uyghur and Kazakh on Short Spoken Texts , 2016, 2016 8th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC).

[13]  Marc A. Zissman Language identification using phoneme recognition and phonotactic language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[14]  Sanasam Ranbir Singh,et al.  Effect of language independent transcribers on spoken language identification for different Indian languages , 2017, 2017 International Conference on Asian Language Processing (IALP).

[15]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[16]  Jun Mo Koo,et al.  A Korean Large Vocabulary Speech Recognition System for Automatic Telephone Number Query Service , 1994, Int. J. Pattern Recognit. Artif. Intell..

[17]  Paul Dalsgaard,et al.  Language-identification using language-dependent phonemes and language-independent speech units , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[18]  Yang Guo,et al.  An Investigation of Imbalanced Ensemble Learning Methods for Cross-Project Defect Prediction , 2019, Int. J. Pattern Recognit. Artif. Intell..

[19]  Berlin Chen,et al.  Voice retrieval of Mandarin broadcast news speech , 2006, Int. J. Pattern Recognit. Artif. Intell..

[20]  Ian McLoughlin,et al.  LID-Senones and Their Statistics for Language Identification , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[21]  David A. Ross,et al.  Automatic Language Identification in music videos with low level audio and visual features , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[23]  Shahan Nercessian,et al.  Approaches for language identification in mismatched environments , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).

[24]  Konstantin Markov,et al.  Language identification with dynamic hidden Markov network , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[25]  F. Itakura Line spectrum representation of linear predictor coefficients of speech signals , 1975 .

[26]  Antanas Verikas,et al.  Agreeing to disagree: active learning with noisy labels without crowdsourcing , 2017, International Journal of Machine Learning and Cybernetics.

[27]  Chiu-yu Tseng,et al.  Efficient speech Recognition Techniques for the finals of Mandarin syllables , 1988, Int. J. Pattern Recognit. Artif. Intell..

[28]  Shweta Bansal,et al.  Modeling of linguistic and acoustic information from speech signal for multilingual spoken language identification system (SLID) , 2017, 2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA).

[29]  Walid Mahdi,et al.  Improving of Open-Set Language Identification by Using Deep SVM and Thresholding Functions , 2017, 2017 IEEE/ACS 14th International Conference on Computer Systems and Applications (AICCSA).

[30]  Dong Wang,et al.  Speech Enhancement Using Modified MMSE-LSA and Phase Reconstruction in Voiced and Unvoiced Speech , 2018, Int. J. Pattern Recognit. Artif. Intell..

[31]  Gertjan J. Burghouts Soft-Assignment Random-forest with an Application to Discriminative Representation of Human Actions in Videos , 2013, Int. J. Pattern Recognit. Artif. Intell..

[32]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[33]  Yue Wang,et al.  An Unsupervised Two-Talker Speech Separation System Based on CASA , 2017, Int. J. Pattern Recognit. Artif. Intell..