Speaker-Independent Automatic Speech Recognition System for Mobile Phone Applications in Punjabi

Speaker-independent Automatic Speech Recognition (ASR) system based mobile phone applications are gaining popularity due to technological advancements and accessibility. Speech based applications may provide mobile phone accessibility and comfort to people performing activities where hand-free phone access is desirable e.g. drivers, athletes, machine operators etc. Similarly, users with disabilities like low vision, blindness and physically challenged may use it as an assistive technology. Development of ASR system for a specific language needs accurate, reliable and efficient acoustic model having language-specific pronunciation dictionary. Punjabi language is one of the popular languages worldwide having more than 150 million speakers. Three acoustic models- continuous, semi-continuous and phonetically-tied are developed based on three pronunciation dictionaries- word, sub-word and character based. Analysis of performance results validate Punjabi language principle “One word one sound” by having better accuracy and reliability for character based pronunciation dictionary than others. Further, phonetically-tied model outperforms others in terms of accuracy, word error rate and size due to reasonable number of Gaussians.

[1]  Mike Schuster,et al.  Japanese and Korean voice search , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  Bruce T. Lowerre,et al.  The HARPY speech recognition system , 1976 .

[3]  K. Davis,et al.  Automatic Recognition of Spoken Digits , 1952 .

[4]  Mei-Yuh Hwang,et al.  The SPHINX-II speech recognition system: an overview , 1993, Comput. Speech Lang..

[5]  Francoise Beaufays,et al.  “Your Word is my Command”: Google Search by Voice: A Case Study , 2010 .

[6]  Yogesh Kumar,et al.  An automatic speech recognition system for spontaneous Punjabi speech corpus , 2017, Int. J. Speech Technol..

[7]  Francoise Beaufays,et al.  Google Search by Voice: A Case Study , 2010 .

[8]  Jia-Lin Shen,et al.  Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data , 1997, IEEE Trans. Speech Audio Process..

[9]  Kuldeep Kumar,et al.  A Hindi speech recognition system for connected words using HTK , 2012 .

[10]  Dennis H. Klatt,et al.  Review of the ARPA speech understanding project , 1990 .

[11]  Chiori Hori,et al.  A Myanmar large vocabulary continuous speech recognition system , 2015, 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA).

[12]  Lin-Shan Lee,et al.  Phonetic state tied-mixture tone modeling for large vocabulary continuous Mandarin speech recognition , 1999, EUROSPEECH.

[13]  Navdeep Singh,et al.  Speech Based Command and Control System for Mobile Phones: Issues and Challenges , 2016, 2016 Second International Conference on Computational Intelligence & Communication Technology (CICT).

[14]  Fatima El Haoussi,et al.  Investigation Amazigh speech recognition using CMU tools , 2014, Int. J. Speech Technol..

[15]  Virender Kadyan,et al.  Punjabi Automatic Speech Recognition Using HTK , 2012 .

[16]  Sruti Sruba Bharali,et al.  A comparative study of different features for isolated spoken word recognition using HMM with reference to Assamese language , 2015, Int. J. Speech Technol..

[17]  Pabitra Mitra,et al.  Bengali speech corpus for continuous auutomatic speech recognition system , 2011, 2011 International Conference on Speech Database and Assessments (Oriental COCOSDA).

[18]  Hsiao-Wuen Hon,et al.  An overview of the SPHINX speech recognition system , 1990, IEEE Trans. Acoust. Speech Signal Process..

[19]  A. M. Natarajan,et al.  Syllable modeling in continuous speech recognition for Tamil language , 2009, Int. J. Speech Technol..

[20]  Alexander I. Rudnicky,et al.  Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System for Hand-Held Devices , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[21]  Adel M. Alimi,et al.  On Developing an Automatic Speech Recognition System for Standard Arabic Language , 2012 .

[22]  Kumar Ravinder,et al.  Comparison of HMM and DTW for Isolated Word Recognition System of Punjabi Language , 2010, CIARP.