Identifying Implicit Links in CSCL Chats Using String Kernels and Neural Networks

Chat conversations between more than two participants are often used in Computer Supported Collaborative Learning (CSCL) scenarios because they enhance collaborative knowledge sharing and sustain creativity. However, multi-participant chats are more difficult to follow and analyze due to the complex ways in which different discussion threads and topics can interact. This paper introduces a novel method based on neural networks for detecting implicit links that uses features computed with string kernels and word embeddings. In contrast to previous experiments with an accuracy of 33%, we obtained a considerable increase to 44% for the same dataset. Our method represents an alternative to more complex deep neural networks that cannot be properly used due to overfitting on limited training data.

[1]  Trevor Cohn,et al.  Learning Kernels over Strings using Gaussian Processes , 2017, IJCNLP.

[2]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[3]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[4]  Torsten Holmer,et al.  Explicit Referencing in Learning Chats: Needs and Acceptance , 2006, EC-TEL.

[5]  Traian Rebedea,et al.  Time and Semantic Similarity - What is the Best Alternative to Capture Implicit Links in CSCL Conversations? , 2017, CSCL.

[6]  Traian Rebedea,et al.  Unlocking the Power of Word2Vec for Identifying Implicit Links , 2017, 2017 IEEE 17th International Conference on Advanced Learning Technologies (ICALT).

[7]  Ethem Alpaydin,et al.  Multiple Kernel Learning Algorithms , 2011, J. Mach. Learn. Res..

[8]  Micha Elsner,et al.  Disentangling Chat , 2010, CL.

[9]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[10]  Hang Li,et al.  Convolutional Neural Network Architectures for Matching Natural Language Sentences , 2014, NIPS.

[11]  Nello Cristianini,et al.  Classification using String Kernels , 2000 .

[12]  Traian Rebedea,et al.  A Polyphonic Model and System for Inter-animation Analysis in Chat Conversations with Multiple Participants , 2010, CICLing.

[13]  Traian Rebedea,et al.  Sentence selection with neural networks using string kernels , 2017, KES.

[14]  Giuseppe Carenini,et al.  Chat Disentanglement: Identifying Semantic Reply Relationships with Random Forests and Recurrent Neural Networks , 2017, IJCNLP.

[15]  Aoife Cahill,et al.  Can characters reveal your native language? A language-independent approach to native language identification , 2014, EMNLP.