The BioHub Knowledge Base: Ontology and Repository for Sustainable Biosourcing

BackgroundThe motivation for the BioHub project is to create an Integrated Knowledge Management System (IKMS) that will enable chemists to source ingredients from bio-renewables, rather than from non-sustainable sources such as fossil oil and its derivatives.MethodThe BioHubKB is the data repository of the IKMS; it employs Semantic Web technologies, especially OWL, to host data about chemical transformations, bio-renewable feedstocks, co-product streams and their chemical components. Access to this knowledge base is provided to other modules within the IKMS through a set of RESTful web services, driven by SPARQL queries to a Sesame back-end. The BioHubKB re-uses several bio-ontologies and bespoke extensions, primarily for chemical feedstocks and products, to form its knowledge organisation schema.ResultsParts of plants form feedstocks, while various processes generate co-product streams that contain certain chemicals. Both chemicals and transformations are associated with certain qualities, which the BioHubKB also attempts to capture. Of immediate commercial and industrial importance is to estimate the cost of particular sets of chemical transformations (leading to candidate surfactants) performed in sequence, and these costs too are captured. Data are sourced from companies’ internal knowledge and document stores, and from the publicly available literature. Both text analytics and manual curation play their part in populating the ontology. We describe the prototype IKMS, the BioHubKB and the services that it supports for the IKMS.AvailabilityThe BioHubKB can be found via http://biohub.cs.manchester.ac.uk/ontology/biohub-kb.owl.

[1]  Lincoln Stein,et al.  The Plant Ontology Database: a community resource for plant structure and developmental stages controlled vocabulary and annotations , 2008, Nucleic Acids Res..

[2]  David Rogers,et al.  Cheminformatics analysis and learning in a data pipelining environment , 2006, Molecular Diversity.

[3]  Phillip Lord,et al.  The Semantic Web takes Wing: Programming Ontologies with Tawny-OWL , 2013, OWLED.

[4]  M. Ashburner,et al.  The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration , 2007, Nature Biotechnology.

[5]  A. Rector,et al.  Relations in biomedical ontologies , 2005, Genome Biology.

[6]  Juan Carlos Serrano-Ruiz,et al.  Transformations of biomass-derived platform molecules: from high added-value chemicals to fuels via aqueous-phase processing. , 2011, Chemical Society reviews.

[7]  Ubbo Visser,et al.  BioAssay Ontology (BAO): a semantic description of bioassays and high-throughput screening results , 2011, BMC Bioinformatics.

[8]  Christoph Steinbeck,et al.  The ChEBI reference database and ontology for biologically relevant chemistry: enhancements for 2013 , 2012, Nucleic Acids Res..

[9]  S. Lewis,et al.  Uberon, an integrative multi-species anatomy ontology , 2012, Genome Biology.

[10]  Gunnar H. Nilsson,et al.  Enriching a primary health care version of ICD-10 using SNOMED CT mapping , 2010, J. Biomed. Semant..

[11]  Bob J. Wielinga,et al.  Using explicit ontologies in KBS development , 1997, Int. J. Hum. Comput. Stud..

[12]  Mark S. Fox,et al.  The Role of Competency Questions in Enterprise Engineering , 1995 .

[13]  Jessica A. Turner,et al.  Modeling biomedical experimental processes with OBI , 2010, J. Biomed. Semant..