Detecting sentences types in the standard arabic language

The standard Arabic language, like many other languages, contains a prosodic feature, which is hidden in the speech signal. The studies related to this field are still in the preliminary stages. This fact results in restraining the performance of the communication tools. The prosodic study allows people having all the communication tools needed in their native language. Therefore, we propose, in this paper, a prosodic study between the various types of sentences in the standard Arabic language. The sentences are recognized according to three modalities as the following: declarative, interrogative and exclamatory sentences. The results of this study will be used to synthesize the different types of pronunciation that can be exploited in several domains namely the man-machine communication. To this end, we developed a specific dataset, consisting of the three types of sentences. Then, we tested two sets of features: prosodic features (Fundamental Frequency, Energy and Duration) and spectrum features (Mel-Frequency Cepstral Coefficients and Linear Predictive Coding) as well their combination. We adopted the Multi-Class Support Vector Machine (MC-SVM) as classifier. The experimental results are very encouraging.

[1]  Hocine Teffahi,et al.  Detection of questions in Berber language using prosodic features , 2014, 2014 International Conference on Multimedia Computing and Systems (ICMCS).

[2]  Feature Extraction in Speech Coding and Recognition , 2022 .

[3]  Mari Ostendorf,et al.  Question Detection in Spoken Conversations Using Textual Conversations , 2011, ACL.

[4]  Ioannis Pitas,et al.  Automatic emotional speech classification , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  I. Elamvazuthi,et al.  Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques , 2010, ArXiv.

[6]  Helena Moniz,et al.  Analysis of Interrogatives in Different Domains , 2010, COST 2102 Training School.

[7]  Thomas Hain,et al.  Multi-pass sentence-end detection of lecture speech , 2014, INTERSPEECH.

[8]  Julia Hirschberg,et al.  Detecting question-bearing turns in spoken tutorial dialogues , 2006, INTERSPEECH.

[9]  Frédéric Béchet,et al.  Speaker Role Recognition Using Question Detection and Characterization , 2011, INTERSPEECH.

[10]  Yashwant Prasad Singh,et al.  Multi-class Support Vector Machine (SVM) Classifiers -- An Application in Hypothyroid Detection and Classification , 2011, 2011 Sixth International Conference on Bio-Inspired Computing: Theories and Applications.

[11]  Chilin Shih,et al.  Comparison of declarative and interrogative intonation in Chinese , 2002, Speech Prosody 2002.

[12]  Jiahong Yuan,et al.  Detection of questions in Chinese conversational speech , 2005, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005..

[13]  Waqas Anwar,et al.  An artificial neural network approach for sentence boundary disambiguation in urdu language text , 2015, Int. Arab J. Inf. Technol..

[14]  L. Cheded,et al.  A PRELIMINARY STUDY OF PROSODY-BASED DETECTION OF QUESTIONS IN ARABIC SPEECH MONOLOGUES , 2010 .

[15]  Andreas Stolcke,et al.  Enriching speech recognition with automatic detection of sentence boundaries and disfluencies , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  Julia Hirschberg,et al.  Intonational cues to student questions in tutoring dialogs , 2006, INTERSPEECH.

[17]  Dilek Z. Hakkani-Tür,et al.  Any questions? Automatic question detection in meetings , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.

[18]  Klaus R. Scherer,et al.  The role of intonation in emotional expressions , 2005, Speech Commun..

[19]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[20]  Andrew Olney,et al.  Identifying Teacher Questions Using Automatic Speech Recognition in Classrooms , 2016, SIGDIAL Conference.

[21]  Shrikanth S. Narayanan,et al.  Toward detecting emotions in spoken dialogs , 2005, IEEE Transactions on Speech and Audio Processing.

[22]  Yang Liu,et al.  Automatic sentence boundary detection in conversational speech: A cross-lingual evaluation on English and Czech , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[23]  Andreas Stolcke,et al.  Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech? , 1998, Language and speech.

[24]  Neeta Awasthy,et al.  Spectral Analysis of Speech: A New Technique , 2008 .

[25]  Bryan L. Pellom,et al.  The analysis and design of architecture systems for speech recognition on modern handheld-computing devices , 2003, First IEEE/ACM/IFIP International Conference on Hardware/ Software Codesign and Systems Synthesis (IEEE Cat. No.03TH8721).

[26]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[27]  D. Tribout,et al.  Multi-level information and automatic dialog act detection in human-human spoken dialogs , 2005, Speech Commun..

[28]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[29]  Shweta Shah,et al.  IMPROVE HCII BY USING GENDER AND EMOTION RECOGNITION THROUGH SPEECH SIGNAL : A SURVEY , 2016 .