Help Me Understand This Conversation: Methods of Identifying Implicit Links Between CSCL Contributions

Multi-participant chat conversations are one of the most frequently employed Computer Supported Collaborative Learning tools due to their ease of use. Moreover, chats enhance knowledge sharing, sustain creativity and aid in collaborative problem solving. Nevertheless, the manual analysis of multi-participant chats is a difficult task due to the mixture of different topics and the inter-twinning of multiple discussion threads during the same conversation. Several tools that employ Natural Language Processing techniques have been developed to automatically identify links between contributions in order to facilitate the tracking of topics and of discussion threads, as well as to highlight key contributions in terms of follow-up impact. This paper proposes a novel method for detecting implicit links based on features computed using string kernels and word embeddings, combined with neural networks. This method significantly outperforms previous results on the same dataset. Due to its smaller size, our model represents an alternative to more complex deep neural networks, especially when limited training data is available as is the case of CSCL chats in a specific domain.

[1]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[2]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[3]  J. Schmidhuber,et al.  Framewise phoneme classification with bidirectional LSTM networks , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[4]  Traian Rebedea,et al.  Identifying Implicit Links in CSCL Chats Using String Kernels and Neural Networks , 2018, AIED.

[5]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[6]  Trevor Cohn,et al.  Learning Kernels over Strings using Gaussian Processes , 2017, IJCNLP.

[7]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[8]  Traian Rebedea,et al.  Unlocking the Power of Word2Vec for Identifying Implicit Links , 2017, 2017 IEEE 17th International Conference on Advanced Learning Technologies (ICALT).

[9]  Shuohang Wang,et al.  A Compare-Aggregate Model for Matching Text Sequences , 2016, ICLR.

[10]  Ethem Alpaydin,et al.  Multiple Kernel Learning Algorithms , 2011, J. Mach. Learn. Res..

[11]  Giuseppe Carenini,et al.  Chat Disentanglement: Identifying Semantic Reply Relationships with Random Forests and Recurrent Neural Networks , 2017, IJCNLP.

[12]  Gerry Stahl,et al.  Studying Virtual Math Teams , 2010 .

[13]  Traian Rebedea,et al.  A Polyphonic Model and System for Inter-animation Analysis in Chat Conversations with Multiple Participants , 2010, CICLing.

[14]  José Rodríguez,et al.  An Attention Mechanism for Neural Answer Selection Using a Combined Global and Local View , 2017, 2017 IEEE 29th International Conference on Tools with Artificial Intelligence (ICTAI).

[15]  Torsten Holmer,et al.  Explicit Referencing in Learning Chats: Needs and Acceptance , 2006, EC-TEL.

[16]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[17]  Nello Cristianini,et al.  Classification using String Kernels , 2000 .

[18]  Traian Rebedea,et al.  Sentence selection with neural networks using string kernels , 2017, KES.

[19]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[20]  Bowen Zhou,et al.  Improved Representation Learning for Question Answer Matching , 2016, ACL.

[21]  Aoife Cahill,et al.  String Kernels for Native Language Identification: Insights from Behind the Curtains , 2016, CL.

[22]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[23]  Hang Li,et al.  Convolutional Neural Network Architectures for Matching Natural Language Sentences , 2014, NIPS.

[24]  Aoife Cahill,et al.  Can characters reveal your native language? A language-independent approach to native language identification , 2014, EMNLP.

[25]  Carlos Caldeira,et al.  Group Cognition: Computer Support for Building Collaborative Knowledge (Acting with Technology) , 2006 .

[26]  Micha Elsner,et al.  Disentangling Chat , 2010, CL.

[27]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[28]  Bowen Zhou,et al.  Attentive Pooling Networks , 2016, ArXiv.

[29]  Traian Rebedea,et al.  Time and Semantic Similarity - What is the Best Alternative to Capture Implicit Links in CSCL Conversations? , 2017, CSCL.