论文信息 - How to Classify Tutorial Dialogue? Comparing Feature Vectors vs. Sequences

How to Classify Tutorial Dialogue? Comparing Feature Vectors vs. Sequences

A key issue in using machine learning to classify tutorial dialogues is how to represent time-varying data. Standard classifiers take as input a feature vector and output its predicted label. It is possible to formulate tutorial dialogue classification problems in this way. However, a feature vector representation requires mapping a dialogue onto a fixed number of features, and does not innately exploit its sequential nature. In contrast, this paper explores a recent method that classifies sequences, using a technique new to the Educational Data Mining community – Hidden Conditional Random Fields [Quattoni et al., 2007]. We illustrate its application to a data set from Project LISTEN's Reading Tutor, and compare it to three baselines using the same data, crossvalidation splits, and feature set. Our technique produces state-of-the-art classification accuracy in predicting reading task completion. We consider the contributions of this paper to be (i) introducing HCRFs to the EDM community, (ii) formulating tutorial dialogue classification as a sequence classification problem, and (iii) evaluating and comparing dialogue classification.

Jack Mostow | José P. González-Brenes | Weisi Duan | Jack Mostow | Weisi Duan

[1] L. Breiman. CONSISTENCY FOR A SIMPLE MODEL OF RANDOM FORESTS , 2004 .

[2] Kurt VanLehn,et al. Do Micro-Level Tutorial Decisions Matter: Applying Reinforcement Learning to Induce Pedagogical Tutorial Tactics , 2010, Intelligent Tutoring Systems.

[3] Sebastian Möller,et al. Evaluating spoken dialogue systems according to de-facto standards: A case study , 2007, Comput. Speech Lang..

[4] Marilyn A. Walker,et al. Reinforcement Learning for Spoken Dialogue Systems , 1999, NIPS.

[5] Joseph E. Beck,et al. Engagement tracing: using response times to model student disengagement , 2005, AIED.

[6] Kurt VanLehn,et al. Reinforcement Learning-based Feature Seleciton For Developing Pedagogically Effective Tutorial Dialogue Tactics , 2008, EDM.

[7] Sebastian Möller,et al. Predicting the quality and usability of spoken dialogue services , 2008, Speech Commun..

[8] Trevor Darrell,et al. Hidden Conditional Random Fields , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Charles M. Reigeluth,et al. Instructional Design Theories and Models : An Overview of Their Current Status , 1983 .

[10] Alan W. Black,et al. Describing Spoken Dialogue Systems Differences , 2008 .

[11] Michael I. Jordan,et al. Multi-task feature selection , 2006 .

[12] Nello Cristianini,et al. Classification using String Kernels , 2000 .

[13] Marilyn A. Walker,et al. Towards developing general models of usability with PARADISE , 2000, Natural Language Engineering.

[14] Mei-Yuh Hwang,et al. The SPHINX-II speech recognition system: an overview , 1993, Comput. Speech Lang..

[15] Adam L. Berger,et al. A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[16] Hua Ai,et al. Comparing User Simulation Models For Dialog Strategy Learning , 2007, HLT-NAACL.

[17] Joseph E. Beck,et al. High-Level Student Modeling with Machine Learning , 2000, Intelligent Tutoring Systems.

[18] H. Zou,et al. Regularization and variable selection via the elastic net , 2005 .

[19] Jack Mostow,et al. Evaluating tutors that listen: an overview of project LISTEN , 2001 .

[20] Hanna M. Wallach,et al. Topic modeling: beyond bag-of-words , 2006, ICML.

[21] Regina Barzilay,et al. Gestural Cohesion for Topic Segmentation , 2008, ACL.

[22] Jack Mostow,et al. Classifying dialogue in high-dimensional space , 2011, TSLP.

[23] Kevin P. Murphy,et al. Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[24] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[25] Trevor Darrell,et al. Hidden Conditional Random Fields for Gesture Recognition , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[26] Jérôme Louradour,et al. SVM Speaker Verification using an Incomplete Cholesky Decomposition Sequence Kernel , 2006, 2006 IEEE Odyssey - The Speaker and Language Recognition Workshop.

[27] Kristy Elizabeth Boyer,et al. Characterizing the Effectiveness of Tutorial Dialogue with Hidden Markov Models , 2010, Intelligent Tutoring Systems.

[28] Melita Hajdinjak,et al. The PARADISE Evaluation Framework: Issues and Findings , 2006, Computational Linguistics.

[29] Jian Pei,et al. A brief survey on sequence classification , 2010, SKDD.

[30] Alex Acero,et al. Hidden conditional random fields for phone classification , 2005, INTERSPEECH.

[31] J M Bland,et al. Weighted comparison of means , 1998, BMJ.

[32] Jack Mostow,et al. Predicting Task Completion from Rich but Scarce Data , 2010, EDM.

[33] Andreas Stolcke,et al. Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.