Statement of Permission to Use Robust Modeling of Dialog Acts Using Language and Speech Robust Modeling of Dialog Acts Using Language and Speech

I am submitting herewith a thesis written by Mohammad Shahed Sorower entitled " Robust Modeling of Dialog Acts using Language and Speech ". I have examined the final copy of this thesis for form and content and recommend that it be accepted in partial fulfillment of the requirements for the degree of Master of Science with a major in Electrical and Computer Engineering. In presenting this thesis as a partial fulfillment of the requirements for the master's degree at The Universiy of Memphis, I agree that the library shall make it available to borrowers under the rules of the library. Brief quotations from this thesis are allowable without special permission, provided that accurate acknowledgement of the source is made. Permission for extensive quotation from reproduction of this thesis may be granted by major professor, or in his absence, by the Head of the interlibrary services when in opinion of either, the proposed use of the material is for scholarly purposes. Any copying or use of the material in this thesis for financial gain shall not be allowed without my written permission. ii DEDICATION To all the teachers I had in my life ……… My mother, who had been my first teacher and a guide throughout my life. Also to all the teachers I had in my life, especially those I owe my achievements to beside my knowledge and enlightenment. iii ACKNOWLEDGEMENTS First, I would like to thank my supervisor Dr. Mohammed Yeasin for his support throughout this thesis, for all his patience and guidance. I would also like to thank Dr. for proving valuable guidance and suggestions for this work. I am also grateful to all my colleagues from Computer Vision, Pattern and Image Analysis (CVPIA) Lab for their helps with experiments, coding and sincere encouragements; special thanks to Mr. Gahangir Hossain for sincerely scrutinizing the complete thesis. I sincerely thank Human Communication Research Center (HCRC) at University of Edinburgh for sharing HCRC Map Task data for our use in this thesis. I also thank Dr. Max Louwerse for taking the procedure to get this data from University of Edinburgh and let us have access to that. I am also obliged to Herff College of Engineering, The University of Memphis, for offering me Herff Graduate Fellowship that was a great honor and encouragement for me. I would like to express my personal gratitude and appreciation for Mafruhatul Jannat, who always …

[1]  Arthur C. Graesser,et al.  AutoTutor: an intelligent tutoring system with mixed-initiative dialogue , 2005, IEEE Transactions on Education.

[2]  Adrian Akmajian,et al.  Linguistics: An Introduction to Language and Communication , 1979 .

[3]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, STOC '84.

[4]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[5]  Steve Renals,et al.  Using Prosodic Features in Language Models for Meetings , 2007, MLMI.

[6]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[7]  Patrick F. Reidy An Introduction to Latent Semantic Analysis , 2009 .

[8]  Stefanie Shattuck-Hufnagel,et al.  The Use of Prosody in Syntactic Disambiguation , 1991, HLT.

[9]  Gwyneth Doherty-Sneddon,et al.  The Reliability of a Dialogue Structure Coding Scheme , 1997, CL.

[10]  Xiao Lin,et al.  Recognition of emotional state from spoken sentences , 1999, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451).

[11]  Mohammad S. Sorower,et al.  WHAT SPEECH TELLS US ABOUT DISCOURSE: THE ROLE OF PROSODIC AND DISCOURSE FEATURES IN DIALOGUE ACT CLASSIFICATION , 2007 .

[12]  Jörg Helbig,et al.  Speech-Controlled Human Machine Interaction (Sprachgesteuerte Mensch-Maschine-Interaktion) , 2004, it Inf. Technol..

[13]  R. Schapire The Strength of Weak Learnability , 1990, Machine Learning.

[14]  Anne H. Anderson,et al.  The Hcrc Map Task Corpus , 1991 .

[15]  Thomas G. Dietterich Machine-Learning Research Four Current Directions , 1997 .

[16]  Sebastian Zander,et al.  A preliminary performance comparison of five machine learning algorithms for practical IP traffic flow classification , 2006, CCRV.

[17]  E. Brill A Simple Rule-Based Part of Speech Tagger , 1992, HLT.

[18]  Gina-Anne Levow,et al.  Dialog act tagging with support vector machines and hidden Markov models , 2006, INTERSPEECH.

[19]  Miss A.O. Penney (b) , 1974, The New Yale Book of Quotations.

[20]  Mark Liberman,et al.  Towards an integrated understanding of speaking rate in conversation , 2006, INTERSPEECH.

[21]  P Taylor,et al.  Intonation and dialogue context as constraints for speech recognition , 1998 .

[22]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[23]  Julia Hirschberg,et al.  A Prosodic Analysis of Discourse Segments in Direction-Giving Monologues , 1996, ACL.

[24]  Arthur C. Graesser,et al.  When Are Tutorial Dialogues More Effective Than Reading? , 2007, Cogn. Sci..

[25]  Alan W. Black,et al.  Three methods of intonation modeling , 1998, SSW.

[26]  C. J. Whitaker,et al.  Ten measures of diversity in classifier ensembles: limits for two classifiers , 2001 .

[27]  A. U.S. ToBI Prosodic Analysis of a Professional Speaker of American English , 2004 .

[28]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[29]  Csr Young,et al.  How to Do Things With Words , 2009 .

[30]  Elmar Nöth,et al.  Dialog act classification with the help of prosody , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[31]  Niall M. Adams,et al.  Improving the Practice of Classifier Performance Assessment , 2000, Neural Computation.

[32]  Mark A. Hall,et al.  Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning , 1999, ICML.

[33]  Mohammed Yeasin,et al.  Robust Classification of Dialog Acts from the Transcription of Utterances , 2007, International Conference on Semantic Computing (ICSC 2007).

[34]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[35]  Julia Hirschberg,et al.  Using Machine Learning to Identify Intonational Segments , 1998 .

[36]  Arthur C. Graesser,et al.  Intelligent Tutoring Systems with Conversational Dialogue , 2001, AI Mag..

[37]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[38]  Rosalind W. Picard,et al.  Dialog Act Classification from Prosodic Features Using Support Vector Machines , 2002 .

[39]  Rebecca J. Passonneau,et al.  Combining Multiple Knowledge Sources for Discourse Segmentation , 1995, ACL.

[40]  Stefan Wermter,et al.  Learning dialog act processing , 1996, COLING.

[41]  Mohammed Yeasin,et al.  Robust Recognition of Emotion from Speech , 2006, IVA.

[42]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[43]  F ChenStanley,et al.  An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.

[44]  Detlef Koll,et al.  Probabilistic dialogue act extraction for concept based multilingual translation systems , 1998, ICSLP.

[45]  Max M. Louwerse,et al.  Dialog Act Classification Using N-Gram Algorithms , 2006, FLAIRS.

[46]  T. Seppänen,et al.  Creation and utilisation of the MediaTeam Emotional Speech Corpus , 2003 .

[47]  Mari Ostendorf,et al.  Prosodic and lexical indications of discourse structure in human-machine interactions , 1997, Speech Commun..

[48]  Ján Horecký,et al.  The content and form of illocutionary acts , 1996 .

[49]  J Allan,et al.  Readings in information retrieval. , 1998 .

[50]  Klaus Ries,et al.  HMM and neural network based speech act detection , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[51]  Mohammed Yeasin,et al.  Speech-gesture driven multimodal interfaces for crisis management , 2003, Proc. IEEE.