论文信息 - Using deep learning for automatically determining correct application of basic quranic recitation rules

Using deep learning for automatically determining correct application of basic quranic recitation rules

Quranic Recitation Rules (Ahkam Al-Tajweed) are the articulation rules that should be applied properly when reciting the Holy Quran. Most of the current automatic Quran recitation systems focus on the basic aspects of recitation, which are concerned with the correct pronunciation of words and neglect the advanced Ahkam Al-Tajweed that are related to the rhythmic and melodious way of recitation such as where to stop and how to “stretch” or “merge” certain letters. The only existing works on the latter parts are limited in terms of the rules they consider or the parts of Quran they cover. This paper comes to fill these gaps. It addresses the problem of identifying the correct usage of Ahkam Al-Tajweed in the entire Quran. Specifically, we focus on eight Ahkam Al-Tajweed faced by early learners of recitation. Popular audio processing techniques for feature extraction (such as LPC, MFCC and WPD) and classification (KNN, SVM, RF, etc.) are tested on an in-house dataset. Moreover, we study the significance of the features by performing several t-tests. Our results show the highest accuracy achieved is 94.4%, which is obtained when bagging is applied to SVM with all features except for the LPC features.

[1] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[2] Ahmad Sharieh,et al. Speaker Independent Quranic Recognizer Based on Maximum Likelihood Linear Regression , 2007 .

[3] Hsiao-Chuan Wang,et al. Language identification using pitch contour information , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[4] Björn W. Schuller,et al. Large-scale audio feature extraction and SVM for acoustic scene classification , 2013, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[5] George Tzanetakis,et al. Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[6] Zaidi Razak,et al. Quranic verse Recitation Recognition Module for Educational Programme , 2008 .

[7] George Tzanetakis,et al. Audio Analysis using the Discrete Wavelet Transform , 2001 .

[8] Ian H. Witten,et al. Weka: Practical machine learning tools and techniques with Java implementations , 1999 .

[9] Mahmoud Al-Ayyoub,et al. Spoken Arabic dialects identification: The case of Egyptian and Jordanian dialects , 2014, 2014 5th International Conference on Information and Communication Systems (ICICS).

[10] Gamini Dissanayake,et al. Driver Drowsiness Classification Using Fuzzy Wavelet-Packet-Based Feature-Extraction Algorithm , 2011, IEEE Transactions on Biomedical Engineering.

[11] Noorhaniza Wahid,et al. Improvement of Audio Feature Extraction Techniques in Traditional Indian Musical Instrument , 2014, SCDM.

[12] G. Peeters. Automatic Classification of Large Musical Instrument Databases Using Hierarchical Classifiers with Inertia Ratio Maximization , 2003 .

[13] Mohd Yamani Idna Idris,et al. Quranic Verse Recitation Recognition Module for Support in j-QAF Learning: A Review , 2008 .

[14] Sherif Abdou,et al. Enhancing usability of CAPL system for qur'an recitation learning , 2007, INTERSPEECH.

[15] Mahmoud Al-Ayyoub,et al. Multi-agent based dynamic resource provisioning and monitoring for cloud computing systems infrastructure , 2015, Cluster Computing.

[16] Mario Chica-Olmo,et al. An assessment of the effectiveness of a random forest classifier for land-cover classification , 2012 .

[17] Sherif Abdou,et al. INTERSPEECH 2006-ICSLP 849 g System Using Speech Recognition , 2006 .

[18] A.K.M Fazlul Haque. FFT and Wavelet-Based Feature Extraction for Acoustic Audio Classification. , 2012 .

[19] Mohd Yamani Idna Idris,et al. Quranic Verse Recitation Feature Extraction using Mel-Frequency Cepstral Coefficient (MFCC) , 2008 .

[20] W. El Falou,et al. Analysis and implementation of a "Quranic" verses delimitation system in audio files using speech recognition techniques , 2006, 2006 2nd International Conference on Information & Communication Technologies.

[21] Ana María Martínez Enríquez,et al. Voice Content Matching System for Quran Readers , 2010, 2010 Ninth Mexican International Conference on Artificial Intelligence.