Corpus Wide Argument Mining - a Working Solution

One of the main tasks in argument mining is the retrieval of argumentative content pertaining to a given topic. Most previous work addressed this task by retrieving a relatively small number of relevant documents as the initial source for such content. This line of research yielded moderate success, which is of limited use in a real-world system. Furthermore, for such a system to yield a comprehensive set of relevant arguments, over a wide range of topics, it requires leveraging a large and diverse corpus in an appropriate manner. Here we present a first end-to-end high-precision, corpus-wide argument mining system. This is made possible by combining sentence-level queries over an appropriate indexing of a very large corpus of newspaper articles, with an iterative annotation scheme. This scheme addresses the inherent label bias in the data and pinpoints the regions of the sample space whose manual labeling is required to obtain high-precision among top-ranked candidates.

[1]  Iryna Gurevych,et al.  Cross-topic Argument Mining from Heterogeneous Sources , 2018, EMNLP.

[2]  Noam Slonim,et al.  Context Dependent Claim Detection , 2014, COLING.

[3]  Serena Villata,et al.  Five Years of Argument Mining: a Data-driven Analysis , 2018, IJCAI.

[4]  Benno Stein,et al.  Building an Argument Search Engine for the Web , 2017, ArgMining@EMNLP.

[5]  Karin Baier,et al.  The Uses Of Argument , 2016 .

[6]  Iryna Gurevych,et al.  Neural End-to-End Learning for Computational Argumentation Mining , 2017, ACL.

[7]  Anirban Laha,et al.  An Empirical Evaluation of various Deep Learning Architectures for Bi-Sequence Classification Tasks , 2016, COLING.

[8]  Yi Yang,et al.  Uncertainty Sampling for Action Recognition via Maximizing Expected Average Precision , 2018, IJCAI.

[9]  Jingbo Zhu,et al.  Active Learning for Word Sense Disambiguation with Methods for Addressing the Class Imbalance Problem , 2007, EMNLP.

[10]  Noam Slonim,et al.  Towards an argumentative content search engine using weak supervision , 2018, COLING.

[11]  Iryna Gurevych,et al.  Which argument is more convincing? Analyzing and predicting convincingness of Web arguments using bidirectional LSTM , 2016, ACL.

[12]  Eyal Shnarch,et al.  Are You Convinced? Choosing the More Convincing Evidence with a Siamese Network , 2019, ACL.

[13]  Eyal Shnarch,et al.  Will it Blend? Blending Weak and Strong Labeled Data in a Neural Network for Argumentation Mining , 2018, ACL.

[14]  Graeme Hirst,et al.  Classifying arguments by scheme , 2011, ACL.

[15]  David A. Cohn,et al.  Active Learning with Statistical Models , 1996, NIPS.

[16]  Francesca Toni,et al.  Identifying attack and support argumentative relations using deep learning , 2017, EMNLP.

[17]  Jürgen Schmidhuber,et al.  Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.

[18]  Marie-Francine Moens,et al.  Automatic detection of arguments in legal texts , 2007, ICAIL.

[19]  Zoubin Ghahramani,et al.  Learning from labeled and unlabeled data with label propagation , 2002 .

[20]  Iryna Gurevych,et al.  Parsing Argumentation Structures in Persuasive Essays , 2016, CL.

[21]  Noam Slonim,et al.  Fast End-to-End Wikification , 2019, ArXiv.

[22]  Noam Slonim,et al.  Unsupervised corpus–wide claim detection , 2017, ArgMining@EMNLP.

[23]  Claire Cardie,et al.  Identifying Appropriate Support for Propositions in Online User Comments , 2014, ArgMining@ACL.

[24]  Jan Snajder,et al.  Back up your Stance: Recognizing Arguments in Online Discussions , 2014, ArgMining@ACL.

[25]  Taghi M. Khoshgoftaar,et al.  Experimental perspectives on learning from imbalanced data , 2007, ICML '07.

[26]  Iryna Gurevych,et al.  Cross-topic Argument Mining from Heterogeneous Sources Using Attention-based Neural Networks , 2018, ArXiv.

[27]  Marie-Francine Moens,et al.  Argumentation mining: the detection, classification and structure of arguments in text , 2009, ICAIL.

[28]  Vincent Ng,et al.  End-to-End Argumentation Mining in Student Essays , 2016, NAACL.

[29]  Mitesh M. Khapra,et al.  Show Me Your Evidence - an Automatic Method for Context Dependent Evidence Detection , 2015, EMNLP.

[30]  ˇ FilipBoltu Back up your Stance: Recognizing Arguments in Online Discussions , 2014 .

[31]  José Salvador Sánchez,et al.  Theoretical Analysis of a Performance Measure for Imbalanced Data , 2010, 2010 20th International Conference on Pattern Recognition.

[32]  Chen Wang,et al.  Introducing LUIMA: an experiment in legal conceptual retrieval of vaccine injury decisions using a UIMA type system and tools , 2015, ICAIL.

[33]  Chris Reed,et al.  An Online Annotation Assistant for Argument Schemes , 2019, LAW@ACL.

[34]  Marie-Francine Moens,et al.  Approaches to Text Mining Arguments from Legal Cases , 2010, Semantic Processing of Legal Texts.

[35]  Matthias Hagen,et al.  Cross-Domain Mining of Argumentative Text through Distant Supervision , 2016, NAACL.

[36]  Roy Bar-Haim,et al.  From Surrogacy to Adoption; From Bitcoin to Cryptocurrency: Debate Topic Expansion , 2019, ACL.

[37]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[38]  K. Vijay-Shanker,et al.  Taking into Account the Differences between Actively and Passively Acquired Data: The Case of Active Learning with Support Vector Machines for Imbalanced Datasets , 2009, NAACL.

[39]  Diyi Yang,et al.  Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[40]  Nitesh V. Chawla,et al.  SPECIAL ISSUE ON LEARNING FROM IMBALANCED DATA SETS , 2004 .