Semantic Event Extraction from Biological Texts Using a Kernel-Based Method

As research into protein and gene interactions continues to produce vast amount of data, concerning to biological event, there is an increasing need to capture these results in structured formats allowing for computational analysis. Although many efforts have been focused to create databases that store this information in computer readable form, populating these sources largely requires a manual process of interpreting and extracting biological event templates from the biological research literature. Being able to efficiently and systematically automate the extraction of biological events from unstructured text, would improve the content of these databases, and provide methods to collect, maintain, interpret, curate, and discover knowledge needed for research or education. Hence, it is important to have an automated extraction system to extract events from biological texts. In this paper, we present an automated information extraction approach, to identify biological events in text. Our approach is based on, identifying event triggers and extracting event participants by using a kernel learner that operates on dependency and semantic information to calculate similarity between feature vectors.

[1]  Hsinchun Chen,et al.  Filling Preposition-Based Templates to Capture Information from Medical Abstracts , 2001, Pacific Symposium on Biocomputing.

[2]  Jari Björne,et al.  Extracting Complex Biological Events with Rich Graph-Based Feature Sets , 2009, BioNLP@HLT-NAACL.

[3]  Daniel Dominic Sleator,et al.  Parsing English with a Link Grammar , 1995, IWPT.

[4]  Jun'ichi Tsujii,et al.  Event Extraction from Biomedical Papers Using a Full Parser , 2000, Pacific Symposium on Biocomputing.

[5]  Ralf Zimmer,et al.  RelEx - Relation extraction using dependency parse trees , 2007, Bioinform..

[6]  Junichi Tsujii,et al.  Event extraction for systems biology by text mining the literature. , 2010, Trends in biotechnology.

[7]  Joyce A. Mitchell,et al.  Gene Indexing: Characterization and Analysis of NLM's GeneRIFs , 2003, AMIA.

[8]  Rim Faiz,et al.  French-Written Event Extraction Based on Contextual Exploration , 2010, FLAIRS Conference.

[9]  Thorsten Brants,et al.  A Context Pattern Induction Method for Named Entity Extraction , 2006, CoNLL.

[10]  Eric Brill,et al.  Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging , 1995, CL.

[11]  Hsinchun Chen,et al.  Extracting gene pathway relations using a hybrid grammar: the Arizona Relation Parser , 2004, Bioinform..

[12]  G Demetriou,et al.  Two applications of information extraction to biological science journal articles: enzyme interactions and protein structures. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[13]  Ted Briscoe,et al.  Biomedical Event Extraction without Training Data , 2009, BioNLP@HLT-NAACL.

[14]  Michael Krauthammer,et al.  GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles , 2001, ISMB.

[15]  Tapio Salakoski,et al.  Analysis of Link Grammar on Biomedical Dependency Corpus Targeted at Protein-Protein Interactions , 2004, NLPBA/BioNLP.

[16]  Ulf Leser,et al.  Molecular event extraction from Link Grammar parse trees , 2009, BioNLP@HLT-NAACL.

[17]  Dragomir R. Radev,et al.  Semi-Supervised Classification for Extracting Protein Interaction Sentences using Dependency Parsing , 2007, EMNLP.

[18]  K. Bretonnel Cohen,et al.  High-precision biological event extraction with a concept recognizer , 2009, BioNLP@HLT-NAACL.

[19]  Rim Faiz Identifying Relevant Sentences in News Articles for Event Information Extraction , 2006, Int. J. Comput. Process. Orient. Lang..

[20]  Mariana L. Neves,et al.  Extraction of biomedical events using case-based reasoning , 2009, BioNLP@HLT-NAACL.

[21]  Udo Hahn,et al.  Event Extraction from Trimmed Dependency Graphs , 2009, BioNLP@HLT-NAACL.

[22]  Sampo Pyysalo,et al.  Overview of BioNLP’09 Shared Task on Event Extraction , 2009, BioNLP@HLT-NAACL.

[23]  Nello Cristianini,et al.  Classification using String Kernels , 2000 .

[24]  Sougata Mukherjea,et al.  Enhancing a biomedical information extraction system with dictionary mining and context disambiguation , 2004, IBM J. Res. Dev..