Developing a semantically rich ontology for the biobank-administration domain

BackgroundBiobanks are a critical resource for translational science. Recently, semantic web technologies such as ontologies have been found useful in retrieving research data from biobanks. However, recent research has also shown that there is a lack of data about the administrative aspects of biobanks. These data would be helpful to answer research-relevant questions such as what is the scope of specimens collected in a biobank, what is the curation status of the specimens, and what is the contact information for curators of biobanks. Our use cases include giving researchers the ability to retrieve key administrative data (e.g. contact information, contact's affiliation, etc.) about the biobanks where specific specimens of interest are stored. Thus, our goal is to provide an ontology that represents the administrative entities in biobanking and their relations. We base our ontology development on a set of 53 data attributes called MIABIS, which were in part the result of semantic integration efforts of the European Biobanking and Biomolecular Resources Research Infrastructure (BBMRI). The previous work on MIABIS provided the domain analysis for our ontology. We report on a test of our ontology against competency questions that we derived from the initial BBMRI use cases. Future work includes additional ontology development to answer additional competency questions from these use cases.ResultsWe created an open-source ontology of biobank administration called Ontologized MIABIS (OMIABIS) coded in OWL 2.0 and developed according to the principles of the OBO Foundry. It re-uses pre-existing ontologies when possible in cooperation with developers of other ontologies in related domains, such as the Ontology of Biomedical Investigation. OMIABIS provides a formalized representation of biobanks and their administration. Using the ontology and a set of Description Logic queries derived from the competency questions that we identified, we were able to retrieve test data with perfect accuracy. In addition, we began development of a mapping from the ontology to pre-existing biobank data structures commonly used in the U.S.ConclusionsIn conclusion, we created OMIABIS, an ontology of biobank administration. We found that basing its development on pre-existing resources to meet the BBMRI use cases resulted in a biobanking ontology that is re-useable in environments other than BBMRI. Our ontology retrieved all true positives and no false positives when queried according to the competency questions we derived from the BBMRI use cases. Mapping OMIABIS to a data structure used for biospecimen collections in a medical center in Little Rock, AR showed adequate coverage of our ontology.

[1]  Andrew D. Spear Ontology for the Twenty First Century: An Introduction with Recommendations , 2006 .

[2]  Jan-Eric Litton,et al.  BIMS: An information management system for biobanking in the 21st century , 2007, IBM Syst. J..

[3]  M. Ashburner,et al.  The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration , 2007, Nature Biotechnology.

[4]  Lori C. Phillips,et al.  Using the i2b2 hive for clinical discovery: an example. , 2007, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[5]  Elena Beisswanger,et al.  BioTop: An upper domain ontology for the life sciencesA description of its current structure, contents and interfaces to OBO ontologies , 2008, Appl. Ontology.

[6]  Jan-Eric Litton,et al.  Biobanking for Europe , 2007, Briefings Bioinform..

[7]  Mansur R. Kabuka,et al.  Model Formulation: semCDI: A Query Formulation for Semantic Data Integration in caBIG , 2008, J. Am. Medical Informatics Assoc..

[8]  Michael Krauthammer,et al.  Semantic web data warehousing for caGrid , 2009, BMC Bioinformatics.

[9]  Johann Eder,et al.  Information Systems for Federated Biobanks , 2009, Trans. Large Scale Data Knowl. Centered Syst..

[10]  Gunnar H. Nilsson,et al.  Enriching a primary health care version of ICD-10 using SNOMED CT mapping , 2010, J. Biomed. Semant..

[11]  Jessica A. Turner,et al.  Modeling biomedical experimental processes with OBI , 2010, J. Biomed. Semant..

[12]  B Smith,et al.  Putting Biomedical Ontologies to Work , 2010, Methods of Information in Medicine.

[13]  Jack W. London,et al.  Using the Semantically Interoperable Biospecimen Repository Application, caTissue: End User Deployment Lessons Learned , 2010, 2010 IEEE International Conference on BioInformatics and BioEngineering.

[14]  M. Waldenberger,et al.  Comprehensive catalog of European biobanks , 2011, Nature Biotechnology.

[15]  Alan Ruttenberg,et al.  MIREOT: The minimum information to reference an external ontology term , 2009, Appl. Ontology.

[16]  Suzette J. Bielinski,et al.  Applying semantic web technologies for phenome-wide scan using an electronic health record linked Biobank , 2012, Journal of Biomedical Semantics.

[17]  David Cox,et al.  Toward a roadmap in global biobanking for health , 2012, European Journal of Human Genetics.

[18]  David Robinson,et al.  Research resources: curating the new eagle-i discovery system , 2012, Database J. Biol. Databases Curation.

[19]  Stefan Schulz,et al.  Requirements for semantic biobanks. , 2012, Studies in health technology and informatics.

[20]  G. Henderson,et al.  Characterizing biobank organizations in the U.S.: results from a national survey , 2013, Genome Medicine.

[21]  Umberto Nanni,et al.  Standard preanalytical coding for biospecimens: review and implementation of the Sample PREanalytical Code (SPREC). , 2012, Biopreservation and biobanking.

[22]  G. Botti,et al.  Tumor biobanks in translational medicine , 2012, Journal of Translational Medicine.

[23]  M. Fransson,et al.  A Minimum Data Set for Sharing Biobank Samples, Information, and Data: MIABIS. , 2012, Biopreservation and biobanking.

[24]  Jie Liu,et al.  Simplifying MIREOT; a MIREOT Protege Plugin , 2012, SEMWEB.

[25]  Maurício Barcellos Almeida,et al.  Towards a Formal Representation of Document Acts and the Resulting Legal Entities , 2013 .