BioNLP 2011 Task Bacteria Biotope – The Alvis system

This paper describes the system of the INRA Bibliome research group applied to the Bacteria Biotope (BB) task of the BioNLP 2011 shared tasks. Bacteria, geographical locations and host entities were processed by a pattern-based approach and domain lexical resources. For the extraction of environment locations, we propose a framework based on semantic analysis supported by an ontology of the biotope domain. Domain-specific rules were developed for dealing with Bacteria anaphora. Official results show that our Alvis system achieves the best performance of participating systems.

[1]  Akinori Yonezawa,et al.  Overview of Genia Event Task in BioNLP Shared Task 2011 , 2011, BioNLP@ACL.

[2]  Thomas Hofmann,et al.  Support vector machine learning for interdependent and structured output spaces , 2004, ICML.

[3]  Julian R. Ullmann,et al.  An Algorithm for Subgraph Isomorphism , 1976, J. ACM.

[4]  Peter M. A. Sloot,et al.  A hybrid approach to extract protein-protein interactions , 2011, Bioinform..

[5]  Trevor C. Charles,et al.  Sinorhizobium meliloti 1021 loss-of-function deletion mutation in chvI and its phenotypic characteristics. , 2010, Molecular plant-microbe interactions : MPMI.

[6]  Jari Björne,et al.  Generalizing Biomedical Event Extraction , 2011, BioNLP@ACL.

[7]  Vincent Ng,et al.  Supervised Models for Coreference Resolution , 2009, EMNLP.

[8]  Sanda M. Harabagiu,et al.  Unsupervised Event Coreference Resolution with Rich Linguistic Features , 2010, ACL.

[9]  Jin-Dong Kim,et al.  The GENIA corpus: an annotated research abstract corpus in molecular biology domain , 2002 .

[10]  Haibin Liu,et al.  Biological event extraction using subgraph matching , 2010, Semantic Mining in Biomedicine.

[11]  Jun'ichi Tsujii,et al.  Evaluating contributions of natural language parsers to protein–protein interaction extraction , 2008, Bioinform..

[12]  K. Bretonnel Cohen,et al.  U-Compare: share and compare text mining tools with UIMA , 2009, Bioinform..

[13]  Joel L Fagan,et al.  Experiments in Automatic Phrase Indexing For Document Retrieval: A Comparison of Syntactic and Non-Syntactic Methods , 1987 .

[14]  Yoko Eguchi,et al.  Two-component signal transduction as potential drug targets in pathogenic bacteria. , 2010, Current opinion in microbiology.

[15]  Thierry Hamon,et al.  Detection of synonymy links between terms: experiment and results , 2001 .

[16]  János Csirik,et al.  The BioScope corpus: annotation for negation, uncertainty and their scope in biomedical texts , 2008, BioNLP.

[17]  Stan Matwin,et al.  Beyond the Bag of Words: A Text Representation for Sentence Selection , 2006, Canadian Conference on AI.

[18]  Jian Su,et al.  Improving Noun Phrase Coreference Resolution by Matching Strings , 2004, IJCNLP.

[19]  Heeyoung Lee,et al.  A Multi-Pass Sieve for Coreference Resolution , 2010, EMNLP.

[20]  Christopher D. Manning,et al.  The Stanford Typed Dependencies Representation , 2008, CF+CDPE@COLING.

[21]  R Holliday,et al.  The inheritance of epigenetic defects. , 1987, Science.

[22]  Andrew McCallum,et al.  Robust Biomedical Event Extraction with Dual Decomposition and Minimal Domain Adaptation , 2011, BioNLP@ACL.

[23]  Carlo Aliprandi,et al.  KAF: a Generic Semantic Annotation Format , 2009 .

[24]  Jian Su,et al.  Coreference Resolution Using Competition Learning Approach , 2003, ACL.

[25]  Peter Murray-Rust,et al.  High-Throughput Identification of Chemistry in Life Science Texts , 2006, CompLife.

[26]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[27]  Udo Hahn,et al.  Event Extraction from Trimmed Dependency Graphs , 2009, BioNLP@HLT-NAACL.

[28]  Alan R. Aronson,et al.  An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..

[29]  Sanda M. Harabagiu,et al.  RESOLUTION , 1977, Monatsschrift für Kriminologie und Strafrechtsreform.

[30]  Michael Gamon,et al.  MSR-NLP Entry in BioNLP Shared Task 2011 , 2011, BioNLP@ACL.

[31]  K. Bretonnel Cohen,et al.  The structural and content aspects of abstracts versus bodies of full text journal articles are different , 2010, BMC Bioinformatics.

[32]  Vincent Fromion,et al.  Reconstruction and analysis of the genetic and metabolic regulatory networks of the central metabolism of Bacillus subtilis , 2008, BMC Systems Biology.

[33]  Jin-Dong Kim,et al.  Overview of the protein coreference task in BioNLP Shared Task 2011 , 2011 .

[34]  R Holliday,et al.  DNA modification mechanisms and gene activity during development , 1975, Science.

[35]  Sampo Pyysalo,et al.  Overview of the Entity Relations (REL) supporting task of BioNLP Shared Task 2011 , 2011, BioNLP@ACL.

[36]  A. Bird,et al.  Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals , 2003, Nature Genetics.

[37]  Michael Collins,et al.  A New Statistical Parser Based on Bigram Lexical Dependencies , 1996, ACL.

[38]  Stephen Clark,et al.  Porting a lexicalized-grammar parser to the biomedical domain , 2009, J. Biomed. Informatics.

[39]  Halil Kilicoglu,et al.  Adapting a General Semantic Interpretation Approach to Biological Event Extraction , 2011, BioNLP@ACL.

[40]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[41]  Gerben Menschaert,et al.  PubMeth: a cancer methylation database combining text-mining and expert annotation , 2007, Nucleic Acids Res..

[42]  I-Min A. Chen,et al.  The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata , 2007, Nucleic Acids Res..

[43]  Sophie Rosset,et al.  Semantic annotation of the French media dialog corpus , 2005, INTERSPEECH.

[44]  Carol Friedman,et al.  Automatic extraction of gene and protein synonyms from MEDLINE and journal articles , 2002, AMIA.

[45]  Jian Su,et al.  Coreference Resolution Using Semantic Relatedness Information from Automatically Discovered Patterns , 2007, ACL.

[46]  C. Fillmore FRAME SEMANTICS AND THE NATURE OF LANGUAGE * , 1976 .

[47]  Robert Bossy,et al.  Building Large Lexicalized Ontologies from Text: A Use Case in Automatic Indexing of Biotechnology Patents , 2010, EKAW.

[48]  L. Danlos “Discourse Verbs” and Discourse Periphrastic Links , 2006 .

[49]  Pierre Lechat,et al.  GenoList: an integrated environment for comparative analysis of microbial genomes , 2007, Nucleic Acids Res..

[50]  Lorraine K. Tanabe,et al.  GENETAG: a tagged corpus for gene/protein named entity recognition , 2005, BMC Bioinformatics.

[51]  Jun'ichi Tsujii,et al.  Dependency Parsing and Domain Adaptation with LR Models and Parser Ensembles , 2007, EMNLP.

[52]  Karen Jensen PEGASUS: Deriving Argument Structures after Syntax , 1993, Natural Language Processing.

[53]  Sophia Ananiadou,et al.  Construction of an annotated corpus to support biomedical information extraction , 2009, BMC Bioinformatics.

[54]  Sampo Pyysalo,et al.  Towards Event Extraction from Full Texts on Infectious Diseases , 2010, BioNLP@ACL.

[55]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[56]  Keith Stevens,et al.  The S-Space Package: An Open Source Package for Word Space Models , 2010, ACL.

[57]  Jörg Stülke,et al.  A community-curated consensual annotation that is continuously updated: the Bacillus subtilis centred wiki SubtiWiki , 2009, Database J. Biol. Databases Curation.

[58]  K. Hengeveld Mood and modality , 2004 .

[59]  Fabio Rinaldi,et al.  UZurich in the BioNLP 2009 Shared Task , 2009, BioNLP@HLT-NAACL.

[60]  Sophia Ananiadou,et al.  Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty , 2009, ACL.

[61]  Scott Weinstein,et al.  Centering: A Framework for Modeling the Local Coherence of Discourse , 1995, CL.

[62]  Udo Hahn,et al.  Evaluating the Impact of Alternative Dependency Graph Encodings on Solving Event Extraction Tasks , 2010, EMNLP.

[63]  Mark A. Przybocki,et al.  Linguistic Resources and Evaluation Techniques for Evaluation of Cross-Document Automatic Content Extraction , 2008, LREC.

[64]  Yannick Versley,et al.  BART: A Modular Toolkit for Coreference Resolution , 2008, ACL.

[65]  Sophia Ananiadou,et al.  How to make the most of NE dictionaries in statistical NER , 2008, BMC Bioinformatics.

[66]  Sampo Pyysalo,et al.  Event Extraction for Post-Translational Modifications , 2010, BioNLP@ACL.

[67]  Eugene Charniak,et al.  Coarse-to-Fine n-Best Parsing and MaxEnt Discriminative Reranking , 2005, ACL.

[68]  K. Cohen,et al.  Overview of BioCreative II gene normalization , 2008, Genome Biology.

[69]  Yu-Hsiang Lin,et al.  Pronominal and Sortal Anaphora Resolution for Biomedical Literature , 2004, ROCLING/IJCLCLP.

[70]  Douglas Herrmann,et al.  A Taxonomy of Part-Whole Relations , 1987, Cogn. Sci..

[71]  Jun'ichi Tsujii,et al.  Feature Forest Models for Probabilistic HPSG Parsing , 2008, CL.

[72]  Udo Hahn,et al.  High-performance gene name normalization with GENO , 2009, Bioinform..

[73]  Sampo Pyysalo,et al.  EXTRACTING BIO‐MOLECULAR EVENTS FROM LITERATURE—THE BIONLP’09 SHARED TASK , 2011, Comput. Intell..

[74]  Tat-Seng Chua,et al.  A Public Reference Implementation of the RAP Anaphora Resolution Algorithm , 2004, LREC.

[75]  Thierry Hamon,et al.  Improving Term Extraction with Terminological Resources , 2006, FinTAL.

[76]  Maria T. Pazienza,et al.  Information Extraction , 2002, Lecture Notes in Computer Science.

[77]  Junichi Tsujii,et al.  Event extraction for systems biology by text mining the literature. , 2010, Trends in biotechnology.

[78]  Claire Cardie,et al.  Conundrums in Noun Phrase Coreference Resolution: Making Sense of the State-of-the-Art , 2009, ACL.

[79]  Mario Vento,et al.  A (sub)graph isomorphism algorithm for matching large graphs , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[80]  Jari Björne,et al.  Reconstruction of Semantic Relationships from Their Projections in Biomolecular Domain , 2010, BioNLP@ACL.

[81]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .

[82]  Sampo Pyysalo,et al.  Static Relations: a Piece in the Biomedical Information Extraction Puzzle , 2009, BioNLP@HLT-NAACL.

[83]  Michael Schroeder,et al.  Inter-species normalization of gene mentions with GNAT , 2008, ECCB.

[84]  Gerold Schneider,et al.  Hybrid Long-Distance Functional Dependency Parsing , 2009 .

[85]  Andreas Vlachos,et al.  Two Strong Baselines for the BioNLP 2009 Event Extraction Task , 2010, BioNLP@ACL.

[86]  Jun'ichi Tsujii,et al.  Syntax Annotation for the GENIA Corpus , 2005, IJCNLP.

[87]  Shalom Lappin,et al.  An Algorithm for Pronominal Anaphora Resolution , 1994, CL.

[88]  A. Valencia,et al.  Overview of the protein-protein interaction annotation extraction task of BioCreative II , 2008, Genome Biology.

[89]  Karin M. Verspoor,et al.  From Graphs to Events: A Subgraph Matching Approach for Information Extraction from Biomedical Text , 2011, BioNLP@ACL.

[90]  Claire Cardie,et al.  Coreference Resolution with Reconcile , 2010, ACL.

[91]  Sabine Buchholz,et al.  CoNLL-X Shared Task on Multilingual Dependency Parsing , 2006, CoNLL.

[92]  Eric Crestan,et al.  Web-Scale Distributional Similarity and Entity Set Expansion , 2009, EMNLP.

[93]  Halil Kilicoglu,et al.  Syntactic Dependency Based Heuristics for Biological Event Extraction , 2009, BioNLP@HLT-NAACL.

[94]  Luc De Raedt,et al.  Inductive Logic Programming: Theory and Methods , 1994, J. Log. Program..

[95]  Jignesh M. Patel,et al.  SAGA: a subgraph matching tool for biological graphs , 2007, Bioinform..

[96]  Nancy Chinchor,et al.  Overview of MUC-7 , 1998, MUC.

[97]  Sophia Ananiadou,et al.  Text Mining for Biology And Biomedicine , 2005 .

[98]  Sampo Pyysalo,et al.  Overview of the Epigenetics and Post-translational Modifications (EPI) task of BioNLP Shared Task 2011 , 2011, BioNLP@ACL.

[99]  Chris F. Taylor,et al.  The minimum information about a genome sequence (MIGS) specification , 2008, Nature Biotechnology.

[100]  Hoifung Poon,et al.  Joint Inference for Knowledge Extraction from Biomedical Literature , 2010, NAACL.

[101]  Jari Björne,et al.  Complex event extraction at PubMed scale , 2010, Bioinform..

[102]  James R. Curran,et al.  Parsing the WSJ Using CCG and Log-Linear Models , 2004, ACL.

[103]  K. Bretonnel Cohen,et al.  Themes in biomedical natural language processing: BioNLP08 , 2008, BMC Bioinformatics.

[104]  Vincent Ng,et al.  Supervised Noun Phrase Coreference Research: The First Fifteen Years , 2010, ACL.

[105]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[106]  Yvan Saeys,et al.  Analyzing text in search of bio-molecular events: a high-precision machine learning framework , 2009, BioNLP@HLT-NAACL.

[107]  Ulf Leser,et al.  A Comprehensive Benchmark of Kernel Methods to Extract Protein–Protein Interactions from Literature , 2010, PLoS Comput. Biol..

[108]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[109]  Jian Su,et al.  Recognition of protein/gene names from text using an ensemble of classifiers , 2005, BMC Bioinformatics.

[110]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[111]  Jun'ichi Tsujii,et al.  Event Extraction with Complex Event Classification Using Rich Features , 2010, J. Bioinform. Comput. Biol..

[112]  César de Pablo-Sánchez,et al.  Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents , 2010, BMC Bioinformatics.

[113]  Goran Nenadic,et al.  LINNAEUS: A species name identification system for biomedical literature , 2010, BMC Bioinformatics.

[114]  Jun'ichi Tsujii,et al.  Bidirectional Inference with the Easiest-First Strategy for Tagging Sequence Data , 2005, HLT.

[115]  Sampo Pyysalo,et al.  Integration of Static Relations to Enhance Event Extraction from Text , 2010, BioNLP@ACL.

[116]  Ted Briscoe,et al.  Statistical Anaphora Resolution in Biomedical Texts , 2008, COLING.

[117]  Lipika Dey,et al.  Biological relation extraction and query answering from MEDLINE abstracts using ontology-based text mining , 2007, Data Knowl. Eng..

[118]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[119]  Jun'ichi Tsujii,et al.  Task-oriented Evaluation of Syntactic Parsers and Their Representations , 2008, ACL.

[120]  A. Ciechanover,et al.  The ubiquitin-proteasome proteolytic pathway: destruction for the sake of construction. , 2002, Physiological reviews.

[121]  Halil Kilicoglu,et al.  Recognizing speculative language in biomedical research articles: a linguistically motivated perspective , 2008, BMC Bioinformatics.

[122]  Roser Morante,et al.  Memory-Based Resolution of In-Sentence Scopes of Hedge Cues , 2010, CoNLL Shared Task.

[123]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[124]  F Rinaldi,et al.  OntoGene in BioCreative II.5 , 2010, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[125]  Graciela Gonzalez,et al.  BANNER: An Executable Survey of Advances in Biomedical Named Entity Recognition , 2007, Pacific Symposium on Biocomputing.

[126]  R. Kay,et al.  Eukaryotic signal transduction via histidine-aspartate phosphorelay. , 2000, Journal of cell science.

[127]  Kaleem Siddiqi,et al.  Matching Hierarchical Structures Using Association Graphs , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[128]  Jari Björne,et al.  EXTRACTING CONTEXTUALIZED COMPLEX BIOLOGICAL EVENTS WITH RICH GRAPH‐BASED FEATURE SETS , 2011, Comput. Intell..

[129]  Jari Björne,et al.  Extracting Complex Biological Events with Rich Graph-Based Feature Sets , 2009, BioNLP@HLT-NAACL.

[130]  R. Dellavalle,et al.  The Processes of Life: An Introduction to Molecular Biology , 2009 .

[131]  Graciela Gonzalez-Hernandez,et al.  Double Layered Learning for Biological Event Extraction from Text , 2011, BioNLP@ACL.

[132]  Robert Bossy,et al.  BioNLP Shared Task 2011 - Bacteria Biotope , 2011, BioNLP@ACL.

[133]  Alfonso Valencia,et al.  Overview of BioCreAtIvE: critical assessment of information extraction for biology , 2005, BMC Bioinformatics.

[134]  Kelly Domico,et al.  Complex Biological Event Extraction from Full Text using Signatures of Linguistic and Semantic Features , 2011, BioNLP@ACL.

[135]  Don Tuggener,et al.  Inkrementelle Koreferenzanalyse für das Deutsche , 2010, KONVENS.

[136]  Xavier Carreras,et al.  Simple Semi-supervised Dependency Parsing , 2008, ACL.

[137]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[138]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[139]  Eugene Charniak,et al.  Any Domain Parsing: Automatic Domain Adaptation for Natural Language Parsing , 2010 .

[140]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[141]  Massimo Poesio,et al.  A General-Purpose, Off-the-shelf Anaphora Resolution Module: Implementation and Preliminary Evaluation , 2004, LREC.

[142]  Jian Su,et al.  Exploring Various Knowledge in Relation Extraction , 2005, ACL.

[143]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[144]  Malvina Nissim,et al.  Comparing Knowledge Sources for Nominal Anaphora Resolution , 2005, Computational Linguistics.

[145]  Dan Klein,et al.  Learning Accurate, Compact, and Interpretable Tree Annotation , 2006, ACL.

[146]  Claire Gardent,et al.  Improving Machine Learning Approaches to Coreference Resolution , 2002, ACL.

[147]  Hongfang Liu,et al.  An exploratory study of a text classification framework for Internet-based surveillance of emerging epidemics , 2011, Int. J. Medical Informatics.

[148]  Jari Björne,et al.  A Graph Kernel for Protein-Protein Interaction Extraction , 2008, BioNLP.

[149]  Hasan Davulcu,et al.  BioEve: Bio-Molecular Event Extraction from Text Using Semantic Classification and Dependency Parsing , 2009, BioNLP@HLT-NAACL.

[150]  Jin-Dong Kim,et al.  Exploring Domain Differences for the Design of a Pronoun Resolution System for Biomedical Text , 2008, COLING.

[151]  Dragomir R. Radev,et al.  Semi-Supervised Classification for Extracting Protein Interaction Sentences using Dependency Parsing , 2007, EMNLP.

[152]  Daniel Gildea,et al.  Automatic Labeling of Semantic Roles , 2000, ACL.

[153]  Jari Björne,et al.  BioInfer: a corpus for information extraction in the biomedical domain , 2007, BMC Bioinformatics.

[154]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[155]  Sampo Pyysalo,et al.  A Comparative Study of Syntactic Parsers for Event Extraction , 2010, BioNLP@ACL.

[156]  Burr Settles,et al.  ABNER: an open source tool for automatically tagging genes, proteins and other entity names in text , 2005 .

[157]  Sampo Pyysalo,et al.  Overview of BioNLP’09 Shared Task on Event Extraction , 2009, BioNLP@HLT-NAACL.

[158]  Richard Johansson,et al.  Extended Constituent-to-Dependency Conversion for English , 2007, NODALIDA.

[159]  S. Dongen Graph clustering by flow simulation , 2000 .

[160]  Gérard P. Huet,et al.  A Unification Algorithm for Typed lambda-Calculus , 1975, Theor. Comput. Sci..

[161]  A. Ninfa,et al.  Protein phosphorylation and regulation of adaptive responses in bacteria. , 1989, Microbiological reviews.

[162]  Yue Wang,et al.  Investigating heterogeneous protein annotations toward cross-corpora utilization , 2009, BMC Bioinformatics.

[163]  Sampo Pyysalo,et al.  BioNLP Shared Task 2011: Supporting Resources , 2011, BioNLP@ACL.

[164]  A. Valencia,et al.  Evaluation of text-mining systems for biology: overview of the Second BioCreative community challenge , 2008, Genome Biology.

[165]  Mihai Surdeanu,et al.  Event Extraction as Dependency Parsing for BioNLP 2011 , 2011, BioNLP@ACL.

[166]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[167]  Jian Su,et al.  Coreference Resolution in Biomedical Texts: a Machine Learning Approach , 2008, Ontologies and Text Mining for Life Sciences.

[168]  Timothy Baldwin,et al.  Biomedical Event Annotation with CRFs and Precision Grammars , 2009, BioNLP@HLT-NAACL.

[169]  Matthew A. Jaro,et al.  Advances in Record-Linkage Methodology as Applied to Matching the 1985 Census of Tampa, Florida , 1989 .

[170]  Dietrich Rebholz-Schuhmann,et al.  Applying ontology design patterns to the implementation of relations in GENIA , 2010, Semantic Mining in Biomedicine.

[171]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[172]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[173]  Fernando Pereira,et al.  Identifying gene and protein mentions in text using conditional random fields , 2005, BMC Bioinformatics.

[174]  Dan Klein,et al.  Online EM for Unsupervised Models , 2009, NAACL.

[175]  C. Fraser,et al.  Status of genome projects for nonpathogenic bacteria and archaea , 2000, Nature Biotechnology.

[176]  Quang Le Minh,et al.  A Pattern Approach for Biomedical Event Annotation , 2011, Proceedings of BioNLP Shared Task 2011 Workshop.

[177]  Jun'ichi Tsujii,et al.  A Maximum Entropy Tagger with Unsupervised Hidden Markov Models , 2001, NLPRS.

[178]  Bernard De Baets,et al.  Detecting Entity Relations as a Supporting Task for Bio-Molecular Event Extraction , 2011, BioNLP@ACL.

[179]  Nigel Collier,et al.  Introduction to the Bio-entity Recognition Task at JNLPBA , 2004, NLPBA/BioNLP.

[180]  Jian Su,et al.  An NP-Cluster Based Approach to Coreference Resolution , 2004, COLING.

[181]  Gregory Grefenstette,et al.  Explorations in automatic thesaurus discovery , 1994 .

[182]  György Móra,et al.  Exploring ways beyond the simple supervised learning approach for biological event extraction , 2009, BioNLP@HLT-NAACL.

[183]  U. Schindler,et al.  Interferons inhibit activation of STAT6 by interleukin 4 in human monocytes by inducing SOCS-1 gene expression. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[184]  Sayan Mukherjee,et al.  Feature Selection for SVMs , 2000, NIPS.

[185]  Andrew McCallum,et al.  Model Combination for Event Extraction in BioNLP 2011 , 2011, BioNLP@ACL.

[186]  Malvina Nissim,et al.  Exploring the boundaries: gene and protein identification in biomedical text , 2005, BMC Bioinformatics.

[187]  Claire Nedellec,et al.  Sentence Filtering for Information Extraction in Genomics, a Classification Problem , 2001, PKDD.

[188]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its new supplement TREMBL , 1996, Nucleic Acids Res..

[189]  Jun'ichi Tsujii,et al.  A Markov Logic Approach to Bio-Molecular Event Extraction , 2009, BioNLP@HLT-NAACL.

[190]  Daniel M. Bikel,et al.  Intricacies of Collins’ Parsing Model , 2004, CL.

[191]  Ulf Leser,et al.  Simple tricks for improving pattern-based information extraction from the biomedical literature , 2010, J. Biomed. Semant..

[192]  Jun'ichi Tsujii,et al.  Corpus annotation for mining biomedical events from literature , 2008, BMC Bioinformatics.

[193]  Fernando Pereira,et al.  Discriminative learning and spanning tree algorithms for dependency parsing , 2006 .

[194]  J. Euzéby List of Bacterial Names with Standing in Nomenclature: a folder available on the Internet. , 1997, International journal of systematic bacteriology.

[195]  Hong Yu,et al.  Figure Text Extraction in Biomedical Literature , 2011, PloS one.

[196]  Philippe Bessières,et al.  Extraction of Genic Interactions with the Recursive Logical Theory of an Ontology , 2010, CICLing.

[197]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[198]  Manabu Torii,et al.  SORTAL ANAPHORA RESOLUTION IN MEDLINE ABSTRACTS , 2007, Comput. Intell..

[199]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[200]  Philip S. Yu,et al.  Searching Substructures with Superimposed Distance , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[201]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[202]  Philippe Bessières,et al.  Information Extraction as an Ontology Population Task and Its Application to Genic Interactions , 2008, 2008 20th IEEE International Conference on Tools with Artificial Intelligence.

[203]  Fabio Rinaldi,et al.  An environment for relation mining over richly annotated corpora: the case of GENIA , 2006, BMC Bioinformatics.

[204]  Karën Fort,et al.  BioNLP Shared Task 2011 – Bacteria Gene Interactions and Renaming , 2011, BioNLP@ACL.

[205]  Xiaoqiang Luo,et al.  A Mention-Synchronous Coreference Resolution Algorithm Based On the Bell Tree , 2004, ACL.

[206]  K. Bretonnel Cohen,et al.  HIGH‐PRECISION BIOLOGICAL EVENT EXTRACTION: EFFECTS OF SYSTEM AND OF DATA , 2011, Comput. Intell..

[207]  Dan Klein,et al.  Fast Exact Inference with a Factored Model for Natural Language Parsing , 2002, NIPS.

[208]  Jari Björne,et al.  Scaling up Biomedical Event Extraction to the Entire PubMed , 2010, BioNLP@ACL.

[209]  Sampo Pyysalo,et al.  A Re-Evaluation of Biomedical Named Entity-Term Relations , 2010, J. Bioinform. Comput. Biol..

[210]  Jun'ichi Tsujii,et al.  Challenges in Pronoun Resolution System for Biomedical Text , 2008, LREC.

[211]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[212]  Sampo Pyysalo,et al.  Named Entity Recognition for Bacterial Type IV Secretion Systems , 2011, PloS one.

[213]  Thomas Hofmann,et al.  Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..

[214]  Sampo Pyysalo,et al.  Overview of the Infectious Diseases (ID) task of BioNLP Shared Task 2011 , 2011, BioNLP@ACL.

[215]  Daniel Marcu,et al.  A Large-Scale Exploration of Effective Global Features for a Joint Entity Detection and Tracking Model , 2005, HLT.

[216]  Hwee Tou Ng,et al.  A Machine Learning Approach to Coreference Resolution of Noun Phrases , 2001, CL.

[217]  Rolf Apweiler,et al.  Genome Reviews: standardizing content and representation of information about complete genomes. , 2006, Omics : a journal of integrative biology.

[218]  Yue Wang,et al.  Incorporating GENETAG-style annotation to GENIA corpus , 2009, BioNLP@HLT-NAACL.

[219]  Jörg Stülke,et al.  Connecting parts with processes: SubtiWiki and SubtiPathways integrate gene and pathway annotation for Bacillus subtilis. , 2010, Microbiology.