Identification of Question and Non-Question Segments in Arabic Monologue Based on Prosodic Features Using Type-2 Fuzzy Logic Systems

In this work, we propose the use of type-2 fuzzy logic systems (type-2 FLS) to identify question and nonquestion segments in an Arabic monologue based on prosodic features. Prosody has been widely used in many speech-related applications including speaker and word recognition, emotion and accent identification, topic and sentence segmentation, and text-to-speech applications. An important application investigated is that of identifying question sentences in Arabic Monologue Lectures. Languages, other than Arabic, have received a lot of attention in this regard, hence the need for this research work concentrating on the Arabic language. Having first segmented the sentences from the continuous speech using energy and duration features, prosodic features are, then, extracted from each sentence. These features are used as input to the two proposed classifiers to classify each sentence into either Question or Non Question sentence. Results from this work have been compared with the previously used support vector machine (SVM), and the outputs indicate that the proposed type-2 FLS model outperforms SVM for all the experiments carried out, mainly due to its superior ability to handle uncertainties in the feature set.

[1]  J. Mendel,et al.  Overcoming time-varying co-channel interference using type-2 fuzzy adaptive filters , 2000 .

[2]  Shrikanth S. Narayanan,et al.  Toward detecting emotions in spoken dialogs , 2005, IEEE Transactions on Speech and Audio Processing.

[3]  Vladik Kreinovich,et al.  Interval-Valued Degrees of Belief: Applications of Interval Computations to Expert Systems and Intelligent Control , 1997, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[4]  Jerry M. Mendel,et al.  Type-2 fuzzy logic systems , 1999, IEEE Trans. Fuzzy Syst..

[5]  Didier Dubois,et al.  Fuzzy sets and systems ' . Theory and applications , 2007 .

[6]  Rosalind W. Picard,et al.  Dialog Act Classification from Prosodic Features Using Support Vector Machines , 2002 .

[7]  Jerry M. Mendel,et al.  Equalization of nonlinear time-varying channels using type-2 fuzzy adaptive filters , 2000, IEEE Trans. Fuzzy Syst..

[8]  M. Swerts Prosodic features at discourse boundaries of different strength. , 1997, The Journal of the Acoustical Society of America.

[9]  Wasfi G. Al-Khatib,et al.  Detection of Questions in Arabic Audio Monologues Using Prosodic Features , 2007, Ninth IEEE International Symposium on Multimedia (ISM 2007).

[10]  Lotfi A. Zadeh,et al.  The Concepts of a Linguistic Variable and its Application to Approximate Reasoning , 1975 .

[11]  Jerry M. Mendel,et al.  Interval type-2 fuzzy logic systems , 2000, Ninth IEEE International Conference on Fuzzy Systems. FUZZ- IEEE 2000 (Cat. No.00CH37063).

[12]  E. Cox Adaptive fuzzy systems , 1993, IEEE Spectrum.

[13]  Jerry M. Mendel,et al.  Type-2 fuzzy sets made simple , 2002, IEEE Trans. Fuzzy Syst..

[14]  Andreas Stolcke,et al.  Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech? , 1998, Language and speech.

[15]  Jerry M. Mendel,et al.  Fuzzy sets for words: a new beginning , 2003, The 12th IEEE International Conference on Fuzzy Systems, 2003. FUZZ '03..

[16]  Jerry M. Mendel,et al.  Connection admission control in ATM networks using survey-based type-2 fuzzy logic systems , 2000, IEEE Trans. Syst. Man Cybern. Part C.

[17]  Seungsoo Lee,et al.  A Ranking Method for Type-2 Fuzzy Values , 2002 .

[18]  Jerry M. Mendel,et al.  A vector similarity measure for linguistic approximation: Interval type-2 and type-1 fuzzy sets , 2008, Inf. Sci..

[19]  J. Mendel Uncertain Rule-Based Fuzzy Logic Systems: Introduction and New Directions , 2001 .

[20]  Mohammad Hossein Fazel Zarandi,et al.  A type-2 fuzzy rule-based expert system model for stock price analysis , 2009, Expert Syst. Appl..

[21]  Silvia Pfeiffer,et al.  Pause concepts for audio segmentation at different semantic levels , 2001, MULTIMEDIA '01.