Biomedical knowledge base construction from text and its applications in knowledge-based systems

While general-purpose Knowledge Bases (KBs) have gone a long way in compiling comprehensive knowledge about people, events, places, etc., domain-specific KBs, such as on health, are equally important, but are less explored. Consequently, a comprehensive and expressive health KB that spans all aspects of biomedical knowledge is still missing. The main goal of this thesis is to develop principled methods for building such a KB and enabling knowledge-centric applications. We address several challenges and make the following contributions: • To construct a health KB, we devise a largely automated and scalable pattern-based knowledge extraction method covering a spectrum of different text genres and distilling a wide variety of facts from different biomedical areas. • To consider higher-arity relations, crucial for proper knowledge representation in advanced domain such as health, we generalize the fact-pattern duality paradigm of previous methods. A key novelty is the integration of facts with missing arguments by extending our framework to partial patterns and facts by reasoning over the composability of partial facts. • To demonstrate the benefits of a health KB, we devise systems for entity-aware search and analytics and for entity-relationshiporiented exploration. Extensive experiments and use-case studies demonstrate the viability of the proposed approaches.

[1]  Gerhard Weikum,et al.  People on drugs: credibility of user statements in health communities , 2014, KDD.

[2]  Sampo Pyysalo,et al.  Event extraction across multiple levels of biological organization , 2012, Bioinform..

[3]  Anna Rumshisky,et al.  Evaluating temporal relations in clinical text: 2012 i2b2 Challenge , 2013, J. Am. Medical Informatics Assoc..

[4]  Gerhard Weikum,et al.  Combining linguistic and statistical analysis to extract relations from web documents , 2006, KDD '06.

[5]  Pedro M. Domingos,et al.  Unsupervised Ontology Induction from Text , 2010, ACL.

[6]  E. Horvitz,et al.  Toward Enhanced Pharmacovigilance Using Patient-Generated Data on the Internet , 2014, Clinical pharmacology and therapeutics.

[7]  Oren Etzioni,et al.  Open Language Learning for Information Extraction , 2012, EMNLP.

[8]  Anna Korhonen,et al.  CRAB 2.0: A text mining tool for supporting literature review in chemical cancer risk assessment , 2014, COLING.

[9]  Dan Roth,et al.  Exploiting Background Knowledge for Relation Extraction , 2010, COLING.

[10]  Christopher Ré,et al.  Large-scale extraction of gene interactions from full-text literature using DeepDive , 2015, Bioinform..

[11]  Olivier Bodenreider,et al.  Aggregating UMLS Semantic Types for Reducing Conceptual Complexity , 2001, MedInfo.

[12]  Jennifer Chu-Carroll,et al.  Building Watson: An Overview of the DeepQA Project , 2010, AI Mag..

[13]  Adam Wright,et al.  An automated technique for identifying associations between medications, laboratory results and problems , 2010, J. Biomed. Informatics.

[14]  M. Surdeanu,et al.  Overview of the English Slot Filling Track at the TAC 2014 Knowledge Base Population Evaluation , 2014 .

[15]  Jari Björne,et al.  BioInfer: a corpus for information extraction in the biomedical domain , 2007, BMC Bioinformatics.

[16]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[17]  Cathy H. Wu,et al.  Text Mining of Protein Phosphorylation Information Using a Generalizable Rule-Based Approach , 2013, BCB.

[18]  Gerhard Weikum,et al.  Harvesting facts from textual web sources by constrained label propagation , 2011, CIKM '11.

[19]  Daniel Jurafsky,et al.  Automatic Labeling of Semantic Roles , 2002, CL.

[20]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[21]  Gerhard Weikum,et al.  Extraction of temporal facts and events from Wikipedia , 2012, TempWeb '12.

[22]  Denilson Barbosa,et al.  Effectiveness and Efficiency of Open Relation Extraction , 2013, EMNLP.

[23]  Alessandro Moschitti,et al.  Making Tree Kernels Practical for Natural Language Learning , 2006, EACL.

[24]  Gerhard Weikum,et al.  Scalable knowledge harvesting with high precision and high recall , 2011, WSDM '11.

[25]  Markus Krötzsch,et al.  Wikidata , 2014, Commun. ACM.

[26]  Karin M. Verspoor,et al.  Approximate Subgraph Matching-Based Literature Mining for Biomedical Events and Relations , 2013, PloS one.

[27]  Yaoyun Zhang,et al.  Domain Adaptation for Semantic Role Labeling of Clinical Text , 2015, AMIA.

[28]  Hongfei Lin,et al.  A protein-protein interaction extraction approach based on deep neural network , 2016, Int. J. Data Min. Bioinform..

[29]  Jun'ichi Tsujii,et al.  Protein-protein interaction extraction by leveraging multiple kernels and parsers , 2009, Int. J. Medical Informatics.

[30]  Sophia Ananiadou,et al.  Discovering and visualizing indirect associations between biomedical concepts , 2011, Bioinform..

[31]  Son Doan,et al.  Global Health Monitor - A Web-based System for Detecting and Mapping Infectious Diseases , 2019, IJCNLP.

[32]  Halil Kilicoglu,et al.  Semantic Relations Asserting the Etiology of Genetic Diseases , 2003, AMIA.

[33]  Andrew Y. Ng,et al.  Semantic Compositionality through Recursive Matrix-Vector Spaces , 2012, EMNLP.

[34]  Heng Ji,et al.  Incremental Joint Extraction of Entity Mentions and Relations , 2014, ACL.

[35]  Gerhard Weikum,et al.  YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia: Extended Abstract , 2013, IJCAI.

[36]  Daniel S. Weld,et al.  Autonomously semantifying wikipedia , 2007, CIKM '07.

[37]  Miguel A. Andrade-Navarro,et al.  Automatic Extraction of Biological Information from Scientific Text: Protein-Protein Interactions , 1999, ISMB.

[38]  Alfonso Valencia,et al.  Extraction of human kinase mutations from literature, databases and genotyping studies , 2009, BMC Bioinformatics.

[39]  M. Shigematsu,et al.  Using Social Media for Actionable Disease Surveillance and Outbreak Management: A Systematic Literature Review , 2015, PloS one.

[40]  Jun'ichi Tsujii,et al.  Semantic Retrieval for the Accurate Identification of Relational Concepts in Massive Textbases , 2006, ACL.

[41]  Jari Björne,et al.  Generalizing Biomedical Event Extraction , 2011, BioNLP@ACL.

[42]  Edgar Meij,et al.  Utilizing Knowledge Graphs in Text-centric Information Retrieval , 2017, WSDM.

[43]  Jiawei Han,et al.  Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions , 2015, IEEE Transactions on Knowledge and Data Engineering.

[44]  Sheng Zhang,et al.  Universal Decompositional Semantics on Universal Dependencies , 2016, EMNLP.

[45]  Cong Liu,et al.  Semantic Relation Classification via Hierarchical Recurrent Neural Network with Attention , 2016, COLING.

[46]  Achim Rettinger,et al.  XKnowSearch!: Exploiting Knowledge Bases for Entity-based Cross-lingual Information Retrieval , 2016, CIKM.

[47]  Heng Ji,et al.  CoType: Joint Extraction of Typed Entities and Relations with Knowledge Bases , 2016, WWW.

[48]  Dmitry Zelenko,et al.  Kernel Methods for Relation Extraction , 2002, J. Mach. Learn. Res..

[49]  Yifan Peng,et al.  Deep learning for extracting protein-protein interactions from biomedical literature , 2017, BioNLP.

[50]  Amy Siu Knowledge-driven entity recognition and disambiguation in biomedical text , 2017 .

[51]  Razvan C. Bunescu,et al.  A Shortest Path Dependency Kernel for Relation Extraction , 2005, HLT.

[52]  Guodong Zhou,et al.  Dependency-directed Tree Kernel-based Protein-Protein Interaction Extraction from Biomedical Literature , 2011, IJCNLP.

[53]  Teruyoshi Hishiki,et al.  Extraction of Gene-Disease Relations from Medline Using Domain Dictionaries and Machine Learning , 2005, Pacific Symposium on Biocomputing.

[54]  Luke S. Zettlemoyer,et al.  Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations , 2011, ACL.

[55]  Andrew McCallum,et al.  Fast and Robust Joint Models for Biomedical Event Extraction , 2011, EMNLP.

[56]  Mark A. Przybocki,et al.  The Automatic Content Extraction (ACE) Program – Tasks, Data, and Evaluation , 2004, LREC.

[57]  Jiawei Han,et al.  CETR: content extraction via tag ratios , 2010, WWW '10.

[58]  Sophia Ananiadou,et al.  Extracting semantically enriched events from biomedical literature , 2012, BMC Bioinformatics.

[59]  Hoifung Poon,et al.  Unsupervised Semantic Parsing , 2009, EMNLP.

[60]  Christopher De Sa,et al.  Incremental Knowledge Base Construction Using DeepDive , 2015, The VLDB Journal.

[61]  Ulf Leser,et al.  ALIBABA: PubMed as a graph , 2006, Bioinform..

[62]  Rong Xu,et al.  Large-scale extraction of accurate drug-disease treatment pairs from biomedical literature for drug repurposing , 2013, BMC Bioinformatics.

[63]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[64]  Karin M. Verspoor,et al.  Optimizing graph-based patterns to extract biomedical events from the literature , 2015, BMC Bioinformatics.

[65]  Zhi Jin,et al.  Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Paths , 2015, EMNLP.

[66]  Gerhard Weikum,et al.  Knowledge harvesting from text and Web sources , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[67]  Alan M. Frieze,et al.  Min-Wise Independent Permutations , 2000, J. Comput. Syst. Sci..

[68]  César de Pablo-Sánchez,et al.  Extracting drug-drug interactions from biomedical texts , 2010, BMC Bioinformatics.

[69]  Hoifung Poon,et al.  Distant Supervision for Cancer Pathway Extraction from Text , 2014, Pacific Symposium on Biocomputing.

[70]  Björn Buchhold,et al.  Semantic Search on Text and Knowledge Bases , 2016, Found. Trends Inf. Retr..

[71]  Gerhard Weikum,et al.  KnowLife: a versatile approach for constructing a large knowledge graph for biomedical sciences , 2015, BMC Bioinformatics.

[72]  Lora Aroyo,et al.  Measuring Crowd Truth for Medical Relation Extraction , 2013, AAAI Fall Symposia.

[73]  José L. V. Mejino,et al.  A reference ontology for biomedical informatics: the Foundational Model of Anatomy , 2003, J. Biomed. Informatics.

[74]  Dan Roth,et al.  Design Challenges and Misconceptions in Named Entity Recognition , 2009, CoNLL.

[75]  Chang Wang,et al.  Medical Relation Extraction with Manifold Models , 2014, ACL.

[76]  James P. Callan,et al.  Explicit Semantic Ranking for Academic Search via Knowledge Graph Embedding , 2017, WWW.

[77]  Hector J. Levesque,et al.  Knowledge Representation and Reasoning , 2004 .

[78]  Jun Zhao,et al.  Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks , 2015, EMNLP.

[79]  Dean Allemang,et al.  Semantic Web for the Working Ontologist - Effective Modeling in RDFS and OWL, Second Edition , 2011 .

[80]  Nanyun Peng,et al.  Cross-Sentence N-ary Relation Extraction with Graph LSTMs , 2017, TACL.

[81]  Halil Kilicoglu,et al.  Semantic MEDLINE: An advanced information management application for biomedicine , 2011, Inf. Serv. Use.

[82]  Núria Queralt-Rosinach,et al.  Extraction of relations between genes and diseases from text and large-scale data analysis: implications for translational research , 2014, BMC Bioinformatics.

[83]  Marcelo Fiszman,et al.  The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text , 2003, J. Biomed. Informatics.

[84]  Peter J. Haas,et al.  Consistent selectivity estimation via maximum entropy , 2007, The VLDB Journal.

[85]  Jerzy W. Jaromczyk,et al.  Rapid and Reusable Text Visualization and Exploration Development with DELVE , 2017, CRI.

[86]  Nanda Kambhatla,et al.  Combining lexical, syntactic, and semantic features with maximum entropy models for extracting relations , 2004, ACL 2004.

[87]  Luciano Del Corro,et al.  MinIE: Minimizing Facts in Open Information Extraction , 2017, EMNLP.

[88]  Kenneth D. Mandl,et al.  HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports , 2008, Journal of the American Medical Informatics Association.

[89]  Dan Roth,et al.  Modeling Semantic Relations Expressed by Prepositions , 2013, TACL.

[90]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.

[91]  Paloma Martínez,et al.  An analysis on the entity annotations in biological corpora , 2014, F1000Research.

[92]  Nigel Collier,et al.  PASBio: predicate-argument structures for event extraction in molecular biology , 2004, BMC Bioinformatics.

[93]  Xinlei Chen,et al.  Never-Ending Learning , 2012, ECAI.

[94]  Estevam R. Hruschka,et al.  Coupled semi-supervised learning for information extraction , 2010, WSDM '10.

[95]  Daniel Marcu,et al.  Extracting Biomolecular Interactions Using Semantic Parsing of Biomedical Text , 2015, AAAI.

[96]  Ralph Grishman,et al.  Information extraction for enhanced access to disease outbreak reports , 2002, J. Biomed. Informatics.

[97]  Dan Roth,et al.  An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines) , 2012, LREC.

[98]  Herman D. Tolentino,et al.  Use of Unstructured Event-Based Reports for Global Infectious Disease Surveillance , 2009, Emerging infectious diseases.

[99]  Rónán O'Beirne,et al.  The Blackwell Guide to the Philosophy of Computing and Information , 2004 .

[100]  Moses Charikar,et al.  Similarity estimation techniques from rounding algorithms , 2002, STOC '02.

[101]  Udo Hahn,et al.  Event Extraction from Trimmed Dependency Graphs , 2009, BioNLP@HLT-NAACL.

[102]  Gang Feng,et al.  Disease Ontology: a backbone for disease semantic integration , 2011, Nucleic Acids Res..

[103]  Alan R. Aronson,et al.  An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..

[104]  Jun'ichi Tsujii,et al.  Corpus annotation for mining biomedical events from literature , 2008, BMC Bioinformatics.

[105]  Rong Xu,et al.  dRiskKB: a large-scale disease-disease risk relationship knowledge base constructed from biomedical text , 2014, BMC Bioinformatics.

[106]  Rohit J. Kate,et al.  Comparative experiments on learning information extractors for proteins and their interactions , 2005, Artif. Intell. Medicine.

[107]  Sergey Brin,et al.  Extracting Patterns and Relations from the World Wide Web , 1998, WebDB.

[108]  Reinhard Schneider,et al.  BioTextQuest+: a knowledge integration platform for literature mining and concept discovery. , 2015, Bioinformatics.

[109]  Aron Culotta,et al.  Dependency Tree Kernels for Relation Extraction , 2004, ACL.

[110]  David S. Johnson,et al.  Approximation algorithms for combinatorial problems , 1973, STOC.

[111]  Juliette Dibie,et al.  Xart system: discovering and extracting correlated arguments of n-ary relations from text , 2016, WIMS.

[112]  Iryna Gurevych,et al.  Cross-Genre and Cross-Domain Detection of Semantic Uncertainty , 2012, CL.

[113]  Oren Etzioni,et al.  Identifying Relations for Open Information Extraction , 2011, EMNLP.

[114]  Daniel Berleant,et al.  Mining MEDLINE: Abstracts, Sentences, or Phrases? , 2001, Pacific Symposium on Biocomputing.

[115]  Oren Etzioni,et al.  Identifying Meaningful Citations , 2015, AAAI Workshop: Scholarly Big Data.

[116]  Estevam R. Hruschka,et al.  Toward an Architecture for Never-Ending Language Learning , 2010, AAAI.

[117]  Padmini Srinivasan,et al.  Ferret: a sentence-based literature scanning system , 2015, BMC Bioinformatics.

[118]  Ivan Titov,et al.  Semantic Role Labeling , 2010, HLT-NAACL.

[119]  F. Sanz,et al.  A Knowledge-Driven Approach to Extract Disease-Related Biomarkers from the Literature , 2014, BioMed research international.

[120]  Michael Schroeder,et al.  GoPubMed: exploring PubMed with the Gene Ontology , 2005, Nucleic Acids Res..

[121]  Doug Downey,et al.  Unsupervised named-entity extraction from the Web: An experimental study , 2005, Artif. Intell..

[122]  R. Altman,et al.  Pharmacogenomics Knowledge for Personalized Medicine , 2012, Clinical pharmacology and therapeutics.

[123]  Mark Craven,et al.  Constructing Biological Knowledge Bases by Extracting Information from Text Sources , 1999, ISMB.

[124]  Dan Roth,et al.  Gauging the internet doctor: ranking medical claims based on community knowledge , 2011, DMMH '11.

[125]  Karin M. Verspoor,et al.  Finding and Exploring Health Information with a Slider-Based User Interface , 2016, HIC.

[126]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[127]  Anna Rumshisky,et al.  Research and applications: Word sense disambiguation in the clinical domain: a comparison of knowledge-rich and knowledge-poor unsupervised methods , 2014, J. Am. Medical Informatics Assoc..

[128]  David Sontag,et al.  Learning a Health Knowledge Graph from Electronic Medical Records , 2017, Scientific Reports.

[129]  Doug Downey,et al.  Local and Global Algorithms for Disambiguation to Wikipedia , 2011, ACL.

[130]  Houfeng Wang,et al.  Bidirectional Recurrent Convolutional Neural Network for Relation Classification , 2016, ACL.

[131]  Rolf Niedermeier,et al.  New Upper Bounds for Maximum Satisfiability , 2000, J. Algorithms.

[132]  Karin M. Verspoor,et al.  Better Health Explorer: Designing for Health Information Seekers , 2015, OZCHI.

[133]  Dan Roth,et al.  The Importance of Syntactic Parsing and Inference in Semantic Role Labeling , 2008, CL.

[134]  Jun Zhao,et al.  Relation Classification via Convolutional Deep Neural Network , 2014, COLING.

[135]  Dietrich Rebholz-Schuhmann,et al.  MedEvi: Retrieving textual evidence of relations between biomedical concepts from Medline , 2008, Bioinform..

[136]  Erik Cambria,et al.  Sentic patterns: Dependency-based rules for concept-level sentiment analysis , 2014, Knowl. Based Syst..

[137]  Hans-Peter Kriegel,et al.  Extraction of semantic biomedical relations from text using conditional random fields , 2008, BMC Bioinformatics.

[138]  Juliane Fluck,et al.  Identification of new drug classification terms in textual resources , 2007, ISMB/ECCB.

[139]  Tom M. Mitchell,et al.  Coupled temporal scoping of relational facts , 2012, WSDM '12.

[140]  Gerhard Weikum,et al.  STICS: searching with strings, things, and cats , 2014, SIGIR.

[141]  Zhiyuan Liu,et al.  Relation Classification via Multi-Level Attention CNNs , 2016, ACL.

[142]  Benjamin M. Good,et al.  Crowdsourcing for bioinformatics , 2013, Bioinform..

[143]  Ralf Steinberger,et al.  Text Mining from the Web for Medical Intelligence , 2007, NATO ASI Mining Massive Data Sets for Security.

[144]  Hoifung Poon,et al.  Grounded Semantic Parsing for Complex Knowledge Extraction , 2015, NAACL.

[145]  Csongor Nyulas,et al.  BioPortal: enhanced functionality via new Web services from the National Center for Biomedical Ontology to access and use ontologies in software applications , 2011, Nucleic Acids Res..

[146]  Gerhard Weikum,et al.  DIDO: a disease-determinants ontology from web sources , 2011, WWW.

[147]  Takashi Chikayama,et al.  Wide-coverage relation extraction from MEDLINE using deep syntax , 2015, BMC Bioinformatics.

[148]  Christopher D. Manning,et al.  Leveraging Linguistic Structure For Open Domain Information Extraction , 2015, ACL.

[149]  Zhiyong Lu,et al.  OpenDMAP: An open source, ontology-driven concept analysis engine, with applications to capturing knowledge regarding protein transport, protein interactions and cell-type-specific gene expression , 2008, BMC Bioinformatics.

[150]  Jari Björne,et al.  Large-Scale Event Extraction from Literature with Multi-Level Gene Normalization , 2013, PloS one.

[151]  Hans-Michael Müller,et al.  Textpresso: An Ontology-Based Information Retrieval and Extraction System for Biological Literature , 2004, PLoS biology.

[152]  Anselmo Peñas,et al.  Temporally Anchored Relation Extraction , 2012, ACL.

[153]  M. Wang,et al.  An Unsupervised Text Mining Method for Relation Extraction from Biomedical Literature , 2014, PloS one.

[154]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[155]  Udo Hahn,et al.  Semedico: A Comprehensive Semantic Search Engine for the Life Sciences , 2017, ACL.

[156]  Tapio Salakoski,et al.  Exploring Biomolecular Literature with EVEX: Connecting Genes through Events, Homology, and Indirect Associations , 2012, Adv. Bioinformatics.

[157]  David S. Wishart,et al.  PolySearch2: a significantly improved text-mining system for discovering associations between human diseases, genes, drugs, metabolites, toxins and more , 2015, Nucleic Acids Res..

[158]  Dafna Shahaf,et al.  Connecting Two (or Less) Dots: Discovering Structure in News Articles , 2012, TKDD.

[159]  Jari Björne,et al.  Extracting Complex Biological Events with Rich Graph-Based Feature Sets , 2009, BioNLP@HLT-NAACL.

[160]  Daniel S. Weld,et al.  Learning 5000 Relational Extractors , 2010, ACL.

[161]  Bowen Zhou,et al.  Classifying Relations by Ranking with Convolutional Neural Networks , 2015, ACL.

[162]  Rafael Berlanga Llavori,et al.  Exploiting semantic annotations for open information extraction: an experience in the biomedical domain , 2014, Knowledge and Information Systems.

[163]  Guodong Zhou,et al.  Hierarchical learning strategy in semantic relation extraction , 2008, Inf. Process. Manag..

[164]  Hoifung Poon,et al.  Literome: PubMed-scale genomic knowledge base in the cloud , 2014, Bioinform..

[165]  Zachary F. Meisel,et al.  Crowdsourcing—Harnessing the Masses to Advance Health and Medicine, a Systematic Review , 2013, Journal of General Internal Medicine.

[166]  Mohamed Yahya,et al.  ReNoun: Fact Extraction for Nominal Attributes , 2014, EMNLP.

[167]  Hans Uszkoreit,et al.  Large-Scale Learning of Relation-Extraction Rules with Distant Supervision from the Web , 2012, International Semantic Web Conference.

[168]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[169]  Xu Han,et al.  Extending the evaluation of Genia Event task toward knowledge base construction and comparison to Gene Regulation Ontology task , 2015, BMC Bioinformatics.

[170]  Daniel S. Weld,et al.  Open Information Extraction Using Wikipedia , 2010, ACL.

[171]  Zhiyong Lu,et al.  Hybrid curation of gene–mutation relations combining automated extraction and crowdsourcing , 2014, Database J. Biol. Databases Curation.

[172]  Andreas Spitz,et al.  EVELIN: Exploration of Event and Entity Links in Implicit Networks , 2017, WWW.

[173]  Barbara Rosario,et al.  Classifying Semantic Relations in Bioscience Texts , 2004, ACL.

[174]  Antoine Doucet,et al.  Multilingual event extraction for epidemic detection , 2015, Artif. Intell. Medicine.

[175]  William W. Cohen,et al.  Bootstrapping Biomedical Ontologies for Scientific Text using NELL , 2012, BioNLP@HLT-NAACL.

[176]  Gerhard Weikum,et al.  Instant Espresso: Interactive Analysis of Relationships in Knowledge Graphs , 2016, WWW.

[177]  Gerhard Weikum,et al.  A Fresh Look on Knowledge Bases: Distilling Named Events from News , 2014, CIKM.

[178]  Andrew McCallum,et al.  Combining joint models for biomedical event extraction , 2012, BMC Bioinformatics.

[179]  Mausam,et al.  Open Information Extraction Systems and Downstream Applications , 2016, IJCAI.

[180]  Jari Björne,et al.  TEES 2.2: Biomedical Event Extraction for Diverse Corpora , 2015, BMC Bioinformatics.

[181]  Gerhard Weikum,et al.  Robust Disambiguation of Named Entities in Text , 2011, EMNLP.

[182]  Michael P. H. Stumpf,et al.  Which species is it? Species-driven gene name disambiguation using random walks over a mixture of adjacency matrices , 2012, Bioinform..

[183]  Jens P. Linge,et al.  MedISys: An early-warning system for the detection of (re-)emerging food- and feed-borne hazards , 2010 .

[184]  Yun Chi,et al.  Canonical forms for labelled trees and their applications in frequent subtree mining , 2005, Knowledge and Information Systems.

[185]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[186]  Fang Kong,et al.  Exploiting Constituent Dependencies for Tree Kernel-Based Semantic Relation Extraction , 2008, COLING.

[187]  Yang Jin,et al.  Simple Algorithms for Complex Relation Extraction with Applications to Biomedical IE , 2005, ACL.

[188]  Xuan Wang,et al.  Life-iNet: A Structured Network-Based Knowledge Exploration and Analytics System for Life Sciences , 2017, ACL.

[189]  Denilson Barbosa,et al.  Shallow Information Extraction for the knowledge Web , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[190]  Amy Siu,et al.  Fast entity recognition in biomedical text , 2013 .

[191]  Luciano Del Corro,et al.  ClausIE: clause-based open information extraction , 2013, WWW.

[192]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[193]  Jihye Choi,et al.  Web-based infectious disease surveillance systems and public health perspectives: a systematic review , 2016, BMC Public Health.

[194]  Zhiyong Lu,et al.  BioCreative III interactive task: an overview , 2011, BMC Bioinformatics.

[195]  Hans Uszkoreit,et al.  Sar-graphs: A language resource connecting linguistic knowledge with semantic relations from knowledge graphs , 2016, J. Web Semant..

[196]  Claire Nédellec,et al.  Learning Language in Logic - Genic Interaction Extraction Challenge , 2005 .

[197]  Andrew McCallum,et al.  Modeling Relations and Their Mentions without Labeled Text , 2010, ECML/PKDD.

[198]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[199]  Gerhard Weikum,et al.  SOFIE: a self-organizing framework for information extraction , 2009, WWW '09.

[200]  Gondy Leroy,et al.  Genescene: An ontology-enhanced integration of linguistic and co-occurrence based relations in biomedical texts: Research Articles , 2005 .

[201]  Razvan C. Bunescu,et al.  Subsequence Kernels for Relation Extraction , 2005, NIPS.

[202]  Hannah Bast,et al.  An index for efficient semantic full-text search , 2013, CIKM.

[203]  Jian Su,et al.  Exploring Various Knowledge in Relation Extraction , 2005, ACL.

[204]  Paolo Ferragina,et al.  Fast and Accurate Annotation of Short Texts with Wikipedia Pages , 2010, IEEE Software.

[205]  Hans Uszkoreit,et al.  Improvement of n-ary Relation Extraction by Adding Lexical Semantics to Distant-Supervision Rule Learning , 2015, ICAART.

[206]  Martin Theobald,et al.  A Temporal-Probabilistic Database Model for Information Extraction , 2013, Proc. VLDB Endow..

[207]  Barend Mons,et al.  Open PHACTS: semantic interoperability for drug discovery. , 2012, Drug discovery today.

[208]  Ulf Leser,et al.  GeneView: a comprehensive semantic search engine for PubMed , 2012, Nucleic Acids Res..

[209]  Jun'ichi Tsujii,et al.  Event Extraction with Complex Event Classification Using Rich Features , 2010, J. Bioinform. Comput. Biol..