论文信息 - Cross-topic Argument Mining from Heterogeneous Sources

Cross-topic Argument Mining from Heterogeneous Sources

Argument mining is a core technology for automating argument search in large document collections. Despite its usefulness for this task, most current approaches are designed for use only with specific text types and fall short when applied to heterogeneous texts. In this paper, we propose a new sentential annotation scheme that is reliably applicable by crowd workers to arbitrary Web texts. We source annotations for over 25,000 instances covering eight controversial topics. We show that integrating topic information into bidirectional long short-term memory networks outperforms vanilla BiLSTMs by more than 3 percentage points in F1 in two- and three-label cross-topic settings. We also show that these results can be further improved by leveraging additional data for topic relevance using multi-task learning.

[1] O. Svenson. Process descriptions of decision making. , 1979 .

[2] Jean Carletta,et al. Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[3] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[4] Jürgen Schmidhuber,et al. Recurrent nets that time and count , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[5] Marie-Francine Moens,et al. Argumentation mining: the detection, classification and structure of arguments in text , 2009, ICAIL.

[6] Keiichi Kobayashi,et al. Comprehension of relations among controversial texts: effects of external strategy use , 2009 .

[7] Bernard Moulin,et al. A taxonomy of argumentation models used for knowledge representation , 2010, Artificial Intelligence Review.

[8] Marie-Francine Moens,et al. Approaches to Text Mining Arguments from Legal Cases , 2010, Semantic Processing of Legal Texts.

[9] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[10] Jukka Zitting,et al. Tika in Action , 2011 .

[11] Ursula Wingate,et al. ‘Argument!’ helping students understand what essay writing is about , 2012 .

[12] Dirk Hovy,et al. Learning Whom to Trust with MACE , 2013, NAACL.

[13] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[14] Iryna Gurevych,et al. Argumentation Mining on the Web from Information Seeking Perspective , 2014, ArgNLP.

[15] Noam Slonim,et al. Context Dependent Claim Detection , 2014, COLING.

[16] Mihai Surdeanu,et al. The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.