Automated Labelling of Dialogue Modes In Tutorial Dialogues

We present in this paper a study whose goal was to automatically label higher level constructs, called dialogue modes, in tutorial dialogues. Each tutorial dialogue is regarded as a sequence of utterances articulated by either the learner or the tutor. The dialogue utterances can be grouped into dialogue modes which correspond to general conversational phases such as dialogue openings, e.g. when the conversational partners greet each other, or serve specific pedagogical purposes, e.g. a scaffolding students’ problem solving process. Detecting dialogue modes is important because they can be used as an instrument to understand what good tutors do at a higher level of abstraction, thus, enabling more general conclusions about good tutoring. We propose an approach to the dialogue mode labeling problem based on Conditional Random Fields, a powerful machine learning technique for sequence labeling which has net advantages over alternatives such as Hidden Markov Models. The downside of the Condition Random Fields approach is that it requires annotated data while the Hidden Markov Models approach is unsupervised. The performance of the approach on a large data set of 1,438 tutoring sessions yielded very good results compared to human generated tags.

[1]  Miguel Á. Carreira-Perpiñán,et al.  Multiscale conditional random fields for image labeling , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[2]  Vasile Rus,et al.  Building an Intelligent PAL from the Tutor.com Session Database Phase 1: Data Mining , 2014, EDM.

[3]  Norbert Reithinger,et al.  Utilizing Statistical Dialogue Act Processing in Verbrnobil , 1995, ACL.

[4]  Arthur C. Graesser,et al.  Automatic Discovery of Speech Act Categories in Educational Games , 2012, EDM.

[5]  Stellan Ohlsson,et al.  Toward a Computational Model of Expert Tutoring: A First Report , 2006, FLAIRS.

[6]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[7]  Csr Young,et al.  How to Do Things With Words , 2009 .

[8]  Takashi Yamauchi,et al.  Learning from human tutoring , 2001, Cogn. Sci..

[9]  Klaus Ries,et al.  HMM and neural network based speech act detection , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[10]  Arthur C. Graesser,et al.  Automated Speech Act Classification For Online Chat , 2011, MAICS.

[11]  Sidney K. D'Mello,et al.  Dialogue Modes in Expert Tutoring , 2008, Intelligent Tutoring Systems.

[12]  Sidney K. D'Mello,et al.  How Do They Do It? Investigating Dialogue Moves within Dialogue Modes in Expert Human Tutoring , 2012, ITS.

[13]  Arthur C. Graesser,et al.  Recent Advances in Conversational Intelligent Tutoring Systems , 2013, AI Mag..

[14]  Arthur C. Graesser,et al.  MetaTutor: Analyzing Self-Regulated Learning in a Tutoring System for Biology , 2009, AIED.

[15]  Gautam Biswas,et al.  Using Hidden Markov Models to Characterize Student Behaviors in Learning-by-Teaching Environments , 2008, Intelligent Tutoring Systems.

[16]  Srinivas Bangalore,et al.  Incremental Parsing Models for Dialog Task Structure , 2009, EACL.

[17]  Norbert Reithinger Some experiments in speech act prediction , 1994 .

[18]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[19]  J. Sadock Speech acts , 2007 .

[20]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[21]  Barbara Di Eugenio,et al.  FLSA: Extending Latent Semantic Analysis with Features for Dialogue Act Classification , 2004, ACL.

[22]  Fernando Pereira,et al.  Shallow Parsing with Conditional Random Fields , 2003, NAACL.

[23]  Peter Wiemer-Hastings,et al.  Classification of Speech Acts in Tutorial Dialog , 2000 .