New Resources and Perspectives for Biomedical Event Extraction

Event extraction is a major focus of recent work in biomedical information extraction. Despite substantial advances, many challenges still remain for reliable automatic extraction of events from text. We introduce a new biomedical event extraction resource consisting of analyses automatically created by systems participating in the recent BioNLP Shared Task (ST) 2011. In providing for the first time the outputs of a broad set of state-of-the-art event extraction systems, this resource opens many new opportunities for studying aspects of event extraction, from the identification of common errors to the study of effective approaches to combining the strengths of systems. We demonstrate these opportunities through a multi-system analysis on three BioNLP ST 2011 main tasks, focusing on events that none of the systems can successfully extract. We further argue for new perspectives to the performance evaluation of domain event extraction systems, considering a document-level, "off-the-page" representation and evaluation to complement the mention-level evaluations pursued in most recent work.

[1]  Michael Gamon,et al.  MSR-NLP Entry in BioNLP Shared Task 2011 , 2011, BioNLP@ACL.

[2]  Akinori Yonezawa,et al.  Overview of Genia Event Task in BioNLP Shared Task 2011 , 2011, BioNLP@ACL.

[3]  Sampo Pyysalo,et al.  EXTRACTING BIO‐MOLECULAR EVENTS FROM LITERATURE—THE BIONLP’09 SHARED TASK , 2011, Comput. Intell..

[4]  Mihai Surdeanu,et al.  Event Extraction as Dependency Parsing , 2011, ACL.

[5]  Junichi Tsujii,et al.  Event extraction for systems biology by text mining the literature. , 2010, Trends in biotechnology.

[6]  Jun'ichi Tsujii,et al.  Overview of BioNLP 2011 Protein Coreference Shared Task , 2011, BioNLP@ACL.

[7]  Karin M. Verspoor,et al.  From Graphs to Events: A Subgraph Matching Approach for Information Extraction from Biomedical Text , 2011, BioNLP@ACL.

[8]  Sampo Pyysalo,et al.  Integration of Static Relations to Enhance Event Extraction from Text , 2010, BioNLP@ACL.

[9]  Beth Sundheim Third Message Understanding Evaluation and Conference (MUC-3): Phase 1 Status Report , 1991, HLT.

[10]  Nathanael Chambers,et al.  Template-Based Information Extraction without the Templates , 2011, ACL.

[11]  Yukiko Matsuoka,et al.  PathText: a text mining integrator for biological pathway visualizations , 2010, Bioinform..

[12]  Halil Kilicoglu,et al.  Adapting a General Semantic Interpretation Approach to Biological Event Extraction , 2011, BioNLP@ACL.

[13]  Andrew McCallum,et al.  Modeling Relations and Their Mentions without Labeled Text , 2010, ECML/PKDD.

[14]  Jari Björne,et al.  Generalizing Biomedical Event Extraction , 2011, BioNLP@ACL.

[15]  Peter M. A. Sloot,et al.  Extracting Biological Events from Text Using Simple Syntactic Patterns , 2011, BioNLP@ACL.

[16]  Robert Bossy,et al.  BioNLP Shared Task 2011 - Bacteria Biotope , 2011, BioNLP@ACL.

[17]  Sampo Pyysalo,et al.  BioNLP Shared Task 2011: Supporting Resources , 2011, BioNLP@ACL.

[18]  Siddharth Patwardhan,et al.  Effective Information Extraction with Semantic Affinity Patterns and Relevant Regions , 2007, EMNLP.

[19]  Quang Le Minh,et al.  A Pattern Approach for Biomedical Event Annotation , 2011, Proceedings of BioNLP Shared Task 2011 Workshop.

[20]  Bernard De Baets,et al.  Detecting Entity Relations as a Supporting Task for Bio-Molecular Event Extraction , 2011, BioNLP@ACL.

[21]  Sampo Pyysalo,et al.  Overview of the Epigenetics and Post-translational Modifications (EPI) task of BioNLP Shared Task 2011 , 2011, BioNLP@ACL.

[22]  A. Valencia,et al.  Overview of the protein-protein interaction annotation extraction task of BioCreative II , 2008, Genome Biology.

[23]  Jari Björne,et al.  Complex event extraction at PubMed scale , 2010, Bioinform..

[24]  Andrew McCallum,et al.  Model Combination for Event Extraction in BioNLP 2011 , 2011, BioNLP@ACL.

[25]  Sophia Ananiadou,et al.  Boosting automatic event extraction from the literature using domain adaptation and coreference resolution , 2012, Bioinform..

[26]  Tapio Salakoski,et al.  EVEX: A PubMed-Scale Resource for Homology-Based Generalization of Text Mining Predictions , 2011, BioNLP@ACL.

[27]  Jin-Dong Kim,et al.  Overview of the protein coreference task in BioNLP Shared Task 2011 , 2011 .

[28]  Sampo Pyysalo,et al.  Overview of the Entity Relations (REL) supporting task of BioNLP Shared Task 2011 , 2011, BioNLP@ACL.

[29]  Graciela Gonzalez-Hernandez,et al.  Double Layered Learning for Biological Event Extraction from Text , 2011, BioNLP@ACL.

[30]  Joyce Yue Chai,et al.  Beyond NomBank: A Study of Implicit Arguments for Nominal Predicates , 2010, ACL.

[31]  Andrew McCallum,et al.  Fast and Robust Joint Models for Biomedical Event Extraction , 2011, EMNLP.

[32]  Karën Fort,et al.  BioNLP Shared Task 2011 – Bacteria Gene Interactions and Renaming , 2011, BioNLP@ACL.

[33]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.

[34]  Sampo Pyysalo,et al.  Overview of the Infectious Diseases (ID) task of BioNLP Shared Task 2011 , 2011, BioNLP@ACL.

[35]  Andreas Vlachos,et al.  Biomedical event extraction from abstracts and full papers using search-based structured prediction , 2011, BMC Bioinformatics.

[36]  Mark A. Przybocki,et al.  The Automatic Content Extraction (ACE) Program – Tasks, Data, and Evaluation , 2004, LREC.

[37]  Sampo Pyysalo,et al.  Medie and Info-pubmed: 2010 update , 2010, BMC Bioinformatics.

[38]  Yuji Matsumoto,et al.  Coreference based event-argument relation extraction on biomedical text , 2011, Semantic Mining in Biomedicine.