Sentence Simplification for Semantic Role Labelling and Information Extraction

In this paper, we report on the extrinsic evaluation of an automatic sentence simplification method with respect to two NLP tasks: semantic role labelling (SRL) and information extraction (IE). The paper begins with our observation of challenges in the intrinsic evaluation of sentence simplification systems, which motivates the use of extrinsic evaluation of these systems with respect to other NLP tasks. We describe the two NLP systems and the test data used in the extrinsic evaluation, and present arguments and evidence motivating the integration of a sentence simplification step as a means of improving the accuracy of these systems. Our evaluation reveals that their performance is improved by the simplification step: the SRL system is better able to assign semantic roles to the majority of the arguments of verbs and the IE system is better able to identify fillers for all IE template slots.

[1]  Son Bao Pham,et al.  Learning to Simplify Children Stories with Limited Data , 2014, ACIIDS.

[2]  Emiel Krahmer,et al.  Sentence Simplification by Monolingual Machine Translation , 2012, ACL.

[3]  Richard J. Evans,et al.  Comparing methods for the syntactic simplification of sentences in information extraction , 2011, Literary and Linguistic Computing.

[4]  B. Lyxell,et al.  Looking at text simplification-Using eye tracking to evaluate the readability of automatically simplified sentences Linnea , 2018 .

[5]  Raman Chandrasekar,et al.  Automatic induction of rules for text simplification , 1997, Knowl. Based Syst..

[6]  David Kauchak,et al.  Sentence Simplification as Tree Transduction , 2013, PITR@ACL.

[7]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[8]  Ani Nenkova,et al.  Syntactic Simplification for Improving Content Selection in Multi-Document Summarization , 2004, COLING.

[9]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[10]  Daphne Koller,et al.  Sentence Simplification for Semantic Role Labeling , 2008, ACL.

[11]  Sara Tonelli,et al.  MUSST: A Multilingual Syntactic Simplification Tool , 2017, IJCNLP.

[12]  Tomás Jelínek Improvements to Dependency Parsing Using Automatic Simplification of Data , 2014, LREC.

[13]  W. Kintsch,et al.  The construction-integration model: A framework for studying memory for text. , 1991 .

[14]  Chris Callison-Burch,et al.  Optimizing Statistical Machine Translation for Text Simplification , 2016, TACL.

[15]  Arthur C. Graesser,et al.  Automated Evaluation of Text and Discourse with Coh-Metrix: Introduction , 2014 .

[16]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[17]  Sigrid Klerke,et al.  Looking hard: Eye tracking for detecting grammaticality of automatically compressed sentences , 2015, NODALIDA.

[18]  Ruslan Mitkov,et al.  The Oxford handbook of computational linguistics , 2003 .

[19]  Yorick Wilks,et al.  The METER corpus : a corpus for analysing journalistic text reuse , 2001 .

[20]  Richard Evans,et al.  A Tagging Approach to Identify Complex Constituents for Text Simplification , 2013, RANLP.

[21]  Ruslan Mitkov,et al.  Intelligent Text Processing to Help Readers with Autism , 2018 .

[22]  G. Seth Psychology of Language , 1968, Nature.

[23]  David West,et al.  UNC-CH at DUC 2007: Query Expansion, Lexical Simplification and Sentence Selection Strategies for Multi-Document Summarization , 2007 .

[24]  Richard Evans,et al.  Identifying signs of syntactic complexity for rule-based sentence simplification , 2018, Natural Language Engineering.

[25]  André Freitas,et al.  A Sentence Simplification System for Improving Relation Extraction , 2016, COLING.

[26]  Yifan Peng,et al.  iSimp: A sentence simplification system for biomedicail text , 2012, 2012 IEEE International Conference on Bioinformatics and Biomedicine.

[27]  Deirdre Hogan,et al.  Coordinate Noun Phrase Disambiguation in a Generative Parsing Model , 2007, ACL.

[28]  P. Broek,et al.  Processing and memory of central versus peripheral information as a function of reading goals: evidence from eye-movements , 2015 .

[29]  Tadashi Nomoto,et al.  Lexico-syntactic text simplification and compression with typed dependencies , 2014, COLING.

[30]  Siddhartha Jonnalagadda,et al.  Towards Effective Sentence Simplification for Automatic Processing of Biomedical Text , 2009, HLT-NAACL.

[31]  Yvonne Margaret Canning,et al.  Syntactic simplification of text , 2002 .

[32]  G. Waters,et al.  Verbal working memory and sentence comprehension , 1999, Behavioral and Brain Sciences.

[33]  David R. Dowty Thematic proto-roles and argument selection , 1991 .

[34]  Adrià de Gispert,et al.  Source sentence simplification for statistical machine translation , 2017, Comput. Speech Lang..

[35]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[36]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[37]  Goran Glavaš,et al.  Event-centered simplication of news stories , 2013 .