Extracting semantic relations using syntax

Most common methods for automatic text analysis in communication science ignore syntactic information, focusing on the occurrence and co-occurrence of individual words, and sometimes n-grams. This is remarkably effective for some purposes, but poses a limitation for fine-grained analyses into semantic relations such as who does what to whom and according to what source. One tested, effective method for moving beyond this bag-of-words assumption is to use a rule-based approach for labeling and extracting syntactic patterns in dependency trees. Although this method can be used for a variety of purposes, its application is hindered by the lack of dedicated and accessible tools. In this paper we introduce the rsyntax R package, which is designed to make working with dependency trees easier and more intuitive for R users, and provides a framework for combining multiple rules for reliably extracting useful semantic relations.

[1]  Wouter van Atteveldt,et al.  Clause Analysis: Using Syntactic Information to Automatically Extract Source, Subject, and Predicate from Texts with an Application to the 2008–2009 Gaza War , 2017, Political Analysis.

[2]  Meng Zhang,et al.  Neural Network Methods for Natural Language Processing , 2017, Computational Linguistics.

[3]  Chen Gui,et al.  A Rule-Based Approach to Aspect Extraction from Product Reviews , 2014, SocialNLP@COLING.

[4]  R. Koopmans,et al.  Political Claims Analysis: Integrating Protest Event and Political Discourse Approaches , 1999 .

[5]  Milan Straka,et al.  Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe , 2017, CoNLL.

[6]  Danqi Chen,et al.  A Fast and Accurate Dependency Parser using Neural Networks , 2014, EMNLP.

[7]  Damian Trilling,et al.  Taking Stock of the Toolkit , 2016, Rethinking Research Methods in an Age of Digital Journalism.

[8]  Sampo Pyysalo,et al.  Universal Dependencies v1: A Multilingual Treebank Collection , 2016, LREC.

[9]  Luke S. Zettlemoyer,et al.  Deep Semantic Role Labeling: What Works and What’s Next , 2017, ACL.

[10]  Wei Xu,et al.  End-to-end learning of semantic role labeling using recurrent neural networks , 2015, ACL.

[11]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[12]  Preslav Nakov,et al.  SemEval-2016 Task 4: Sentiment Analysis in Twitter , 2016, *SEMEVAL.

[13]  Sampo Pyysalo,et al.  SETS: Scalable and Efficient Tree Search in Dependency Graphs , 2015, HLT-NAACL.

[14]  Noah A. Smith,et al.  Dependency Parsing , 2009, Encyclopedia of Artificial Intelligence.

[15]  Christopher D. Manning,et al.  Enhanced English Universal Dependencies: An Improved Representation for Natural Language Understanding Tasks , 2016, LREC.

[16]  Jan Kleinnijenhuis,et al.  A Theory of Evaluative Discourse: Towards a Graph Theory of Journalistic Texts , 1986 .

[17]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[18]  Mark Johnson,et al.  An Improved Non-monotonic Transition System for Dependency Parsing , 2015, EMNLP.

[19]  Advaith Siddharthan,et al.  Hybrid text simplification using synchronous dependency grammars with hand-written and automatically harvested rules , 2014, EACL.

[20]  Simon Clematide,et al.  Electoral Campaigns and Relation Mining: Extracting Semantic Network Data from Newspaper Articles , 2011 .

[21]  Jonathan Nagler,et al.  Methodological Challenges in Estimating Tone: Application to News Coverage of the U.S. Economy , 2016 .

[22]  David G. Rand,et al.  Structural Topic Models for Open‐Ended Survey Responses , 2014, American Journal of Political Science.

[23]  Wouter van Atteveldt,et al.  The Combined Effects of Mass Media and Social Media on Political Perceptions and Preferences , 2019 .

[24]  Charles E. Osgood,et al.  Evaluative assertion analysis. , 1956 .

[25]  Justin Grimmer,et al.  Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts , 2013, Political Analysis.

[26]  Yoav Goldberg,et al.  Syntactic Search by Example , 2020, ACL.

[27]  Ralf Zimmer,et al.  RelEx - Relation extraction using dependency parse trees , 2007, Bioinform..

[28]  Z. Harris Co-Occurrence and Transformation in Linguistic Structure , 1957 .