Learning Biological Processes with Global Constraints

Biological processes are complex phenomena involving a series of events that are related to one another through various relationships. Systems that can understand and reason over biological processes would dramatically improve the performance of semantic applications involving inference such as question answering (QA) ‐ specifically “How?” and “Why?” questions. In this paper, we present the task of process extraction, in which events within a process and the relations between the events are automatically extracted from text. We represent processes by graphs whose edges describe a set of temporal, causal and co-reference event-event relations, and characterize the structural properties of these graphs (e.g., the graphs are connected). Then, we present a method for extracting relations between the events, which exploits these structural properties by performing joint inference over the set of extracted relations. On a novel dataset containing 148 descriptions of biological processes (released with this paper), we show significant improvement comparing to baselines that disregard process structure.

[1]  Vincent Ng,et al.  Classifying Temporal Relations with Rich Linguistic Knowledge , 2013, NAACL.

[2]  Dan Roth,et al.  Joint Inference for Event Timeline Construction , 2012, EMNLP.

[3]  Ido Dagan,et al.  Learning Entailment Relations by Global Graph Structure Optimization , 2012, CL.

[4]  D. Sontag 1 Introduction to Dual Decomposition for Inference , 2010 .

[5]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[6]  Christopher D. Manning,et al.  Learning Constraints for Consistent Timeline Extraction , 2012, EMNLP.

[7]  Christopher D. Manning,et al.  A Global Joint Model for Semantic Role Labeling , 2008, CL.

[8]  Tommi S. Jaakkola,et al.  Introduction to dual composition for inference , 2011 .

[9]  Regina Barzilay,et al.  Multi-Event Extraction Guided by Global Constraints , 2012, NAACL.

[10]  Christopher D. Manning,et al.  Enforcing Transitivity in Coreference Resolution , 2008, ACL.

[11]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[12]  E. Schegloff,et al.  Opening up Closings , 1973 .

[13]  Nathanael Chambers,et al.  Jointly Combining Implicit Constraints Improves Temporal Ordering , 2008, EMNLP.

[14]  SurdeanuMihai,et al.  Learning to rank answers to non-factoid questions from web collections , 2011 .

[15]  Pascal Denis,et al.  Predicting Globally-Coherent Temporal Structures from Texts via Endpoint Inference and Graph Decomposition , 2011, IJCAI.

[16]  James Pustejovsky,et al.  TimeML: Robust Specification of Event and Temporal Expressions in Text , 2003, New Directions in Question Answering.

[17]  Wanxiang Che,et al.  Effective Bilingual Constraints for Semi-Supervised Learning of Named Entity Recognizers , 2013, AAAI.

[18]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[19]  Andrew McCallum,et al.  Fast and Robust Joint Models for Biomedical Event Extraction , 2011, EMNLP.

[20]  Nathanael Chambers,et al.  Template-Based Information Extraction without the Templates , 2011, ACL.

[21]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[22]  Ming-Wei Chang,et al.  Structured learning with constrained conditional models , 2012, Machine Learning.

[23]  Alexander M. Rush,et al.  Improved Parsing and POS Tagging Using Inter-Sentence Consistency Constraints , 2012, EMNLP-CoNLL.

[24]  A. I.,et al.  Neural Field Continuum Limits and the Structure–Function Partitioning of Cognitive–Emotional Brain Networks , 2023, Biology.

[25]  Mihai Surdeanu,et al.  Event Extraction as Dependency Parsing , 2011, ACL.

[26]  Jackie Chi Kit Cheung,et al.  Probabilistic Frame Induction , 2013, NAACL.

[27]  Jari Björne,et al.  EXTRACTING CONTEXTUALIZED COMPLEX BIOLOGICAL EVENTS WITH RICH GRAPH‐BASED FEATURE SETS , 2011, Comput. Intell..

[28]  Eric P. Xing,et al.  Concise Integer Linear Programming Formulations for Dependency Parsing , 2009, ACL.

[29]  Hoifung Poon,et al.  Joint Inference for Knowledge Extraction from Biomedical Literature , 2010, NAACL.

[30]  Jun'ichi Tsujii,et al.  Event Extraction with Complex Event Classification Using Rich Features , 2010, J. Bioinform. Comput. Biol..

[31]  Mihai Surdeanu,et al.  Learning to Rank Answers to Non-Factoid Questions from Web Collections , 2011, CL.

[32]  Yuji Matsumoto,et al.  Jointly Identifying Temporal Relations with Markov Logic , 2009, ACL.

[33]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[34]  Sampo Pyysalo,et al.  Overview of BioNLP’09 Shared Task on Event Extraction , 2009, BioNLP@HLT-NAACL.