Biomolecular event trigger detection using neighborhood hash features.

The complex interactions between biomolecules and the consequences of these interactions are known as biomolecular events. Such events particularly in proteins play a key role in several aspects of proteomics. The major source of extraction of biomolecular events is the biomedical literature. Event trigger word detection is generally the first step in computationally mining the biomedical literature for biomolecular events. In this work, we study how to efficiently map the dependency graph of a candidate sentence into semantic/syntactic features, and use these semantic/syntactic features to detect bio-event triggers from the biomedical literature. The key factor in our method was the use of the hash operation to iteratively compute the dependency graph and utilize the properties of the hash operation to map the dependency graph into neighborhood hash features. The experimental results showed that neighborhood hash features can effectively represent the semantic/syntactic information in the sentence dependency graph. Furthermore, neighborhood hash features and basic features are complementary in the detection of biomolecular triggers. This approach, based on neighborhood hash features, achieved state-of-the-art performance on BioNLP datasets with respect to comparable evaluations.

[1]  Razvan C. Bunescu,et al.  A Shortest Path Dependency Kernel for Relation Extraction , 2005, HLT.

[2]  Jun'ichi Tsujii,et al.  BIO‐MOLECULAR EVENT EXTRACTION WITH MARKOV LOGIC , 2011, Comput. Intell..

[3]  Jun'ichi Tsujii,et al.  Evaluating Impact of Re-training a Lexical Disambiguation Model on Domain Adaptation of an HPSG Parser , 2007, Trends in Parsing Technology.

[4]  Jari Björne,et al.  Complex event extraction at PubMed scale , 2010, Bioinform..

[5]  Jun'ichi Tsujii,et al.  Event Extraction with Complex Event Classification Using Rich Features , 2010, J. Bioinform. Comput. Biol..

[6]  Nguyen Truong Son,et al.  A Pattern Approach for Biomedical Event Annotation , 2011, BioNLP@ACL.

[7]  Kathleen Marchal,et al.  Detection of cis-acting regulatory elements in plants: a Gibbs sampling approach , 2000 .

[8]  Jihoon Yang,et al.  Walk-weighted subsequence kernels for protein-protein interaction extraction , 2010, BMC Bioinformatics.

[9]  Eugene Charniak,et al.  Self-Training for Biomedical Parsing , 2008, ACL.

[10]  Jari Björne,et al.  Extracting Complex Biological Events with Rich Graph-Based Feature Sets , 2009, BioNLP@HLT-NAACL.

[11]  Matthew Lease,et al.  Parsing Biomedical Literature , 2005, IJCNLP.

[12]  Sung-Hyon Myaeng,et al.  Simplicity is Better: Revisiting Single Kernel PPI Extraction , 2010, COLING.

[13]  Koby Crammer,et al.  On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..

[14]  Stephen Clark,et al.  Porting a lexicalized-grammar parser to the biomedical domain , 2009, J. Biomed. Informatics.

[15]  Razvan C. Bunescu,et al.  Subsequence Kernels for Relation Extraction , 2005, NIPS.

[16]  Sampo Pyysalo,et al.  A Comparative Study of Syntactic Parsers for Event Extraction , 2010, BioNLP@ACL.

[17]  Sampo Pyysalo,et al.  Overview of BioNLP’09 Shared Task on Event Extraction , 2009, BioNLP@HLT-NAACL.

[18]  Jihoon Yang,et al.  Data and text mining Kernel approaches for genic interaction extraction , 2008 .

[19]  Timothy Baldwin,et al.  Word sense disambiguation for event trigger word detection in biomedicine , 2011, BMC Bioinformatics.

[20]  Dmitry Zelenko,et al.  Kernel methods for relation extraction , 2003 .

[21]  Hisashi Kashima,et al.  A Linear-Time Graph Kernel , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[22]  Halil Kilicoglu,et al.  EFFECTIVE BIO‐EVENT EXTRACTION USING TRIGGER WORDS AND SYNTACTIC DEPENDENCIES , 2011, Comput. Intell..

[23]  Hagit Shatkay,et al.  Mining the Biomedical Literature in the Genomic Era: An Overview , 2003, J. Comput. Biol..

[24]  Yijia Zhang,et al.  Neighborhood hash graph kernel for protein-protein interaction extraction , 2011, J. Biomed. Informatics.

[25]  Thomas Gärtner,et al.  On Graph Kernels: Hardness Results and Efficient Alternatives , 2003, COLT.

[26]  Ulf Leser,et al.  MOLECULAR EVENT EXTRACTION FROM LINK GRAMMAR PARSE TREES IN THE BIONLP’09 SHARED TASK , 2011, Comput. Intell..

[27]  Yvan Saeys,et al.  HIGH‐PRECISION BIO‐MOLECULAR EVENT EXTRACTION FROM TEXT USING PARALLEL BINARY CLASSIFIERS , 2011, Comput. Intell..

[28]  M. He,et al.  PPI Finder: A Mining Tool for Human Protein-Protein Interactions , 2009, PloS one.

[29]  Junichi Tsujii,et al.  Event extraction for systems biology by text mining the literature. , 2010, Trends in biotechnology.