CAWA: An Attention-Network for Credit Attribution

Credit attribution is the task of associating individual parts in a document with their most appropriate class labels. It is an important task with applications to information retrieval and text summarization. When labeled training data is available, traditional approaches for sequence tagging can be used for credit attribution. However, generating such labeled datasets is expensive and time-consuming. In this paper, we present "Credit Attribution With Attention (CAWA)", a neural-network-based approach, that instead of using sentence-level labeled data, uses the set of class labels that are associated with an entire document as a source of distant-supervision. CAWA combines an attention mechanism with a multilabel classifier into an end-to-end learning framework to perform credit attribution. CAWA labels the individual sentences from the input document using the resultant attention-weights. CAWA improves upon the state-of-the-art credit attribution approach by not constraining a sentence to belong to just one class, but modeling each sentence as a distribution over all classes, leading to better modeling of semantically-similar classes. Experiments on the credit attribution task on a variety of datasets show that the sentence class labels generated by CAWA outperform the competing approaches. Additionally, on the multilabel text classification task, CAWA performs better than the competing credit attribution approaches.

[1]  B. Rost,et al.  A modified definition of Sov, a segment‐based measure for protein secondary structure prediction assessment , 1999, Proteins.

[2]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[3]  John D. Lafferty,et al.  Statistical Models for Text Segmentation , 1999, Machine Learning.

[4]  Johannes Fürnkranz,et al.  Large-Scale Multi-label Text Classification - Revisiting Neural Networks , 2013, ECML/PKDD.

[5]  Julia Hirschberg,et al.  Some intonational characteristics of discourse structure , 1992, ICSLP.

[6]  Susan T. Dumais,et al.  Partially labeled topic models for interpretable text mining , 2011, KDD.

[7]  B. Rost,et al.  Redefining the goals of protein secondary structure prediction. , 1994, Journal of molecular biology.

[8]  Ramesh Nallapati,et al.  Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora , 2009, EMNLP.

[9]  George Karypis,et al.  Text Segmentation on Multilabel Documents: A Distant-Supervised Approach , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[10]  Regina Barzilay,et al.  Rationalizing Neural Predictions , 2016, EMNLP.

[11]  Freddy Y. Y. Choi Advances in domain independent linear text segmentation , 2000, ANLP.

[12]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[13]  Brendan T. O'Connor,et al.  Learning Latent Personas of Film Characters , 2013, ACL.

[14]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[16]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[17]  Vasudeva Varma,et al.  Attention-Based Neural Text Segmentation , 2018, ECIR.

[18]  Arkaitz Zubiaga,et al.  Content-Based Clustering for Tag Cloud Visualization , 2009, 2009 International Conference on Advances in Social Network Analysis and Mining.

[19]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[20]  P. Jaccard,et al.  Etude comparative de la distribution florale dans une portion des Alpes et des Jura , 1901 .

[21]  Mirella Lapata,et al.  Multiple Instance Learning Networks for Fine-Grained Sentiment Analysis , 2017, TACL.

[22]  Manabu Okumura,et al.  Text Segmentation with Multiple Surface Linguistic Cues , 1998, COLING-ACL.

[23]  Manabu Okumura,et al.  Text Segmentation with Multiple Surface Linguistic Cues , 1999, COLING.

[24]  Gökhan Tür,et al.  Integrating Prosodic and Lexical Cues for Automatic Topic Segmentation , 2001, CL.

[25]  Chris Buckley,et al.  OHSUMED: an interactive retrieval evaluation and new large test collection for research , 1994, SIGIR '94.

[26]  Jonathan Berant,et al.  Text Segmentation as a Supervised Learning Task , 2018, NAACL.

[27]  David J. Miller,et al.  Semisupervised, Multilabel, Multi-Instance Learning for Structured Data , 2017, Neural Computation.

[28]  Goran Glavas,et al.  Unsupervised Text Segmentation Using Semantic Relatedness Graphs , 2016, *SEMEVAL.