Cross-topic Argument Mining from Heterogeneous Sources

Argument mining is a core technology for automating argument search in large document collections. Despite its usefulness for this task, most current approaches are designed for use only with specific text types and fall short when applied to heterogeneous texts. In this paper, we propose a new sentential annotation scheme that is reliably applicable by crowd workers to arbitrary Web texts. We source annotations for over 25,000 instances covering eight controversial topics. We show that integrating topic information into bidirectional long short-term memory networks outperforms vanilla BiLSTMs by more than 3 percentage points in F1 in two- and three-label cross-topic settings. We also show that these results can be further improved by leveraging additional data for topic relevance using multi-task learning.

[1]  O. Svenson Process descriptions of decision making. , 1979 .

[2]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[3]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[4]  Jürgen Schmidhuber,et al.  Recurrent nets that time and count , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[5]  Marie-Francine Moens,et al.  Argumentation mining: the detection, classification and structure of arguments in text , 2009, ICAIL.

[6]  Keiichi Kobayashi,et al.  Comprehension of relations among controversial texts: effects of external strategy use , 2009 .

[7]  Bernard Moulin,et al.  A taxonomy of argumentation models used for knowledge representation , 2010, Artificial Intelligence Review.

[8]  Marie-Francine Moens,et al.  Approaches to Text Mining Arguments from Legal Cases , 2010, Semantic Processing of Legal Texts.

[9]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[10]  Jukka Zitting,et al.  Tika in Action , 2011 .

[11]  Ursula Wingate,et al.  ‘Argument!’ helping students understand what essay writing is about , 2012 .

[12]  Dirk Hovy,et al.  Learning Whom to Trust with MACE , 2013, NAACL.

[13]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[14]  Iryna Gurevych,et al.  Argumentation Mining on the Web from Information Seeking Perspective , 2014, ArgNLP.

[15]  Noam Slonim,et al.  Context Dependent Claim Detection , 2014, COLING.

[16]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[17]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[18]  Iryna Gurevych,et al.  Identifying Argumentative Discourse Structures in Persuasive Essays , 2014, EMNLP.

[19]  Vangelis Karkaletsis,et al.  Argument Extraction from News, Blogs, and Social Media , 2014, SETN.

[20]  Mitesh M. Khapra,et al.  Show Me Your Evidence - an Automatic Method for Context Dependent Evidence Detection , 2015, EMNLP.

[21]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[22]  Phil Blunsom,et al.  Teaching Machines to Read and Comprehend , 2015, NIPS.

[23]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[24]  Phil Blunsom,et al.  Reasoning about Entailment with Neural Attention , 2015, ICLR.

[25]  Diane J. Litman,et al.  Context-aware Argumentative Relation Mining , 2016, ACL.

[26]  Shuohang Wang,et al.  Learning Natural Language Inference with LSTM , 2015, NAACL.

[27]  Larry P. Heck,et al.  Contextual LSTM (CLSTM) models for Large scale NLP tasks , 2016, ArXiv.

[28]  Matthias Hagen,et al.  Cross-Domain Mining of Argumentative Text through Distant Supervision , 2016, NAACL.

[29]  Iryna Gurevych,et al.  New Collection Announcement: Focused Retrieval Over the Web , 2016, SIGIR.

[30]  Saif Mohammad,et al.  SemEval-2016 Task 6: Detecting Stance in Tweets , 2016, *SEMEVAL.

[31]  Iryna Gurevych,et al.  What is the Essence of a Claim? Cross-Domain Claim Identification , 2017, EMNLP.

[32]  Xuanjing Huang,et al.  Adversarial Multi-Criteria Learning for Chinese Word Segmentation , 2017, ACL.

[33]  Iryna Gurevych,et al.  Reporting Score Distributions Makes a Difference: Performance Study of LSTM-networks for Sequence Tagging , 2017, EMNLP.

[34]  Christian Stab,et al.  Argumentative Writing Support by means of Natural Language Processing , 2017 .

[35]  Lu Wang,et al.  Understanding and Detecting Diverse Supporting Arguments on Controversial Issues , 2017, ACL.

[36]  Iryna Gurevych,et al.  Neural End-to-End Learning for Computational Argumentation Mining , 2017, ACL.

[37]  Xuanjing Huang,et al.  Adversarial Multi-task Learning for Text Classification , 2017, ACL.

[38]  Benno Stein,et al.  Unit Segmentation of Argumentative Texts , 2017, ArgMining@EMNLP.

[39]  Iryna Gurevych,et al.  Parsing Argumentation Structures in Persuasive Essays , 2016, CL.

[40]  Benno Stein,et al.  Building an Argument Search Engine for the Web , 2017, ArgMining@EMNLP.

[41]  Iryna Gurevych,et al.  ArgumenText: Searching for Arguments in Heterogeneous Sources , 2018, NAACL.

[42]  Chris Stahlhut Searching Arguments in German with ArgumenText , 2018, DESIRES.