Development of high quality speech compression system for Quranic recitation based on modified CELP algorithm

Currently, there are more than 500 Quranic recitations available freely on the internet. There is also a growing trends on the use of smart phone compare to traditional desktop PCs for accessing the internet. On such limited device, a high quality speech compression for Quranic recitation is favorable. In this paper, we developed a high quality speech compression for Quranic recitation by modifying Code Excited Linear Prediction (CELP) algorithm. First, the characteristics of the Quranic recitation of all surahs was evaluated in which it was found that the voiced speech is more dominant compare to unvoiced speech. Next, we optimize CELP algorithm based on previous findings by modifying the original stochastic codebook. Instead of random Gaussian signal, we trained the codebook using LBG algorithm for Surah Al-Fatihah, and used the trained codebook for the whole Quran. Lastly, we evaluate the performance of the developed algorithm objectively using PESQ (Perceptual Evaluation of Speech Quality). Results showed that our proposed algorithm performs better than the traditional CELP algorithm, in terms of PESQ score and processing time.

[1]  S. Paulikas,et al.  Assessment of CELP codecs quality in multi-lingual environment , 2009, 2009 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT).

[2]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  R. M. Isa,et al.  The Comparison between Listening to Al-Quran and Listening to Classical Music on the Brainwave Signal for the Alpha Band , 2012, 2012 Third International Conference on Intelligent Systems Modelling and Simulation.

[4]  Daniel P. W. Ellis,et al.  Speech and Audio Signal Processing - Processing and Perception of Speech and Music, Second Edition , 1999 .

[5]  Hongbing Hu,et al.  A spectral/temporal method for robust fundamental frequency tracking. , 2008, The Journal of the Acoustical Society of America.

[6]  Alan W. Black,et al.  The CMU Arctic speech databases , 2004, SSW.

[7]  D. A. van Leeuwen,et al.  Speech and Audio Signal Processing , 2011 .

[8]  Karthikeyan Natesan Ramamurthy,et al.  MATLAB Software for the Code Excited Linear Prediction Algorithm: The Federal Standard-1016 , 2010, MATLAB Software for the Code Excited Linear Prediction Algorithm.