The ACGT Master Ontology and its applications - Towards an ontology-driven cancer research and management system

OBJECTIVE This paper introduces the objectives, methods and results of ontology development in the EU co-funded project Advancing Clinico-genomic Trials on Cancer-Open Grid Services for Improving Medical Knowledge Discovery (ACGT). While the available data in the life sciences has recently grown both in amount and quality, the full exploitation of it is being hindered by the use of different underlying technologies, coding systems, category schemes and reporting methods on the part of different research groups. The goal of the ACGT project is to contribute to the resolution of these problems by developing an ontology-driven, semantic grid services infrastructure that will enable efficient execution of discovery-driven scientific workflows in the context of multi-centric, post-genomic clinical trials. The focus of the present paper is the ACGT Master Ontology (MO). METHODS ACGT project researchers undertook a systematic review of existing domain and upper-level ontologies, as well as of existing ontology design software, implementation methods, and end-user interfaces. This included the careful study of best practices, design principles and evaluation methods for ontology design, maintenance, implementation, and versioning, as well as for use on the part of domain experts and clinicians. RESULTS To date, the results of the ACGT project include (i) the development of a master ontology (the ACGT-MO) based on clearly defined principles of ontology development and evaluation; (ii) the development of a technical infrastructure (the ACGT Platform) that implements the ACGT-MO utilizing independent tools, components and resources that have been developed based on open architectural standards, and which includes an application updating and evolving the ontology efficiently in response to end-user needs; and (iii) the development of an Ontology-based Trial Management Application (ObTiMA) that integrates the ACGT-MO into the design process of clinical trials in order to guarantee automatic semantic integration without the need to perform a separate mapping process.

[1]  Axel Uhl,et al.  Model-Driven Architecture , 2002, OOIS Workshops.

[2]  Miguel García-Remesal,et al.  ONTOFUSION: Ontology-based integration of genomic and clinical databases , 2006, Comput. Biol. Medicine.

[3]  Mark A. Musen,et al.  PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment , 2000, AAAI/IAAI.

[4]  Luis Martín,et al.  Data Access and Management in ACGT: Tools to Solve Syntactic and Semantic Heterogeneities Between Clinical and Image Databases , 2007, ER Workshops.

[5]  Michael N. Huhns,et al.  An ontology tool for query formulation in an agent-based context , 1997, Proceedings of CoopIS 97: 2nd IFCIS Conference on Cooperative Information Systems.

[6]  Barry Smith,et al.  The Universal Medical Language System and the Gene Ontology: Some Critical Reflections , 2003, KI.

[7]  C. Sotiriou,et al.  Taking gene-expression profiling to the clinic: when will molecular signatures become relevant to patient care? , 2007, Nature Reviews Cancer.

[8]  W Ceusters,et al.  A Terminological and Ontological Analysis of the NCI Thesaurus , 2005, Methods of Information in Medicine.

[9]  Martin Doerr,et al.  The CIDOC Conceptual Reference Module: An Ontological Approach to Semantic Interoperability of Metadata , 2003, AI Mag..

[10]  Bernd Blobel,et al.  Semantic Interoperability between Health Communication Standards through Formal Ontologies , 2009, MIE.

[11]  Chunhua Weng,et al.  User-centered semantic harmonization: A case study , 2007, J. Biomed. Informatics.

[12]  C. Street,et al.  The Cancer Biomedical Informatics Grid (caBIGTM) , 2005, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[13]  Matthias Lange,et al.  SEMEDA: ontology based semantic integration of biological databases , 2003, Bioinform..

[14]  C. Wittekind,et al.  TNM Klassifikation maligner Tumoren , 1987 .

[15]  Jeffrey P. Krischer,et al.  Research Paper: Variation of SNOMED CT Coding of Clinical Research Concepts among Coding Experts , 2007, J. Am. Medical Informatics Assoc..

[16]  M. Brochhausen,et al.  Establishing and Harmonising Ontologies in an Interdisciplinary Health Care and Clinical Research Environment * , 2008 .

[17]  Arnon Rosenthal,et al.  Using semantic values to facilitate interoperability among heterogeneous information systems , 1994, TODS.

[18]  Aldo Gangemi,et al.  Modelling Ontology Evaluation and Validation , 2006, ESWC.

[19]  Tiziana Catarci,et al.  An Ontology Based Visual Tool for Query Formulation Support , 2004, OTM Workshops.

[20]  Deborah L. McGuinness,et al.  The Role of Frame-Based Representation on the Semantic Web , 2001 .

[21]  John H. Gennari,et al.  Integrating Genomic Knowledge Sources Through an Anatomy Ontology , 2005, Pacific Symposium on Biocomputing.

[22]  Manolis Tsiknakis,et al.  A Semantic Grid Infrastructure Enabling Integrated Access and Analysis of Multilevel Biomedical Data in Support of Postgenomic Clinical Trials on Cancer , 2008, IEEE Transactions on Information Technology in Biomedicine.

[23]  James J. Cimino,et al.  Reliability of SNOMED-CT Coding by Three Physicians using Two Terminology Browsers , 2006, AMIA.

[24]  Olivier Bodenreider,et al.  Investigating subsumption in SNOMED CT: An exploration into large description logic-based biomedical terminologies , 2007, Artif. Intell. Medicine.

[25]  Michael Y. Galperin The Molecular Biology Database Collection: 2008 update , 2007, Nucleic Acids Res..

[26]  Barry Smith The evaluation of ontologies: Editorial review vs. democratic ranking , 2008 .

[27]  Stuart E. Madnick,et al.  Improving data quality through effective use of data semantics , 2006, Data Knowl. Eng..

[28]  Alberto Anguita,et al.  Enabling Cross Constraint Satisfaction in RDF-Based Heterogeneous Database Integration , 2008, 2008 20th IEEE International Conference on Tools with Artificial Intelligence.

[29]  Barry Smith,et al.  SNAP and SPAN: Towards Dynamic Spatial Ontology , 2004, Spatial Cogn. Comput..

[30]  Izet Masic Medical Informatics in a United and Healthy Europe , 2009, MIE.

[31]  Gilberto Fragoso,et al.  caCORE version 3: Implementation of a model driven, service-oriented architecture for semantic interoperability , 2008, J. Biomed. Informatics.

[32]  Kerry K Kakazu,et al.  The Cancer Biomedical Informatics Grid (caBIG): pioneering an expansive network of information and tools for collaborative cancer research. , 2004, Hawaii medical journal.

[33]  Joel H. Saltz,et al.  Model Formulation: caGrid 1.0: An Enterprise Grid Infrastructure for Biomedical Research , 2008, J. Am. Medical Informatics Assoc..

[34]  Sudha Ram,et al.  Information systems interoperability: What lies beneath? , 2004, TOIS.

[35]  Pedro Rangel Henriques,et al.  Using the ontology paradigm to integrate information systems , 2004 .

[36]  Cornelius Rosse,et al.  A Reference Ontology for Bioinformatics: The Foundational Model of Anatomy , 2003 .

[37]  Werner Ceusters,et al.  Towards a Reference Terminology for Ontology Research and Development in the Biomedical Domain , 2006, KR-MED.

[38]  Michael R. Genesereth,et al.  Query planning in infomaster , 1997, SAC '97.

[39]  A. Kumara,et al.  The Unified Medical Language System and the Gene Ontology : Some Critical Reflections , 2003 .

[40]  B Blobel,et al.  eHealth: combining health telematics, telemedicine, biomedical engineering and bioinformatics to the edge. , 2008, Methods of information in medicine.

[41]  C. Bizer,et al.  D2R MAP - A Database to RDF Mapping Language , 2003, WWW.

[42]  Alberto Anguita,et al.  OntoDataClean: Ontology-Based Integration and Preprocessing of Distributed Data , 2006, ISBMDA.

[43]  Patrick Lambrix,et al.  Ontology-based integration for bioinformatics , 2005 .

[44]  José L. V. Mejino,et al.  A reference ontology for biomedical informatics: the Foundational Model of Anatomy , 2003, J. Biomed. Informatics.

[45]  M. Brochhausen,et al.  Ontology Based Data Management Systems for Post-Genomic Clinical Trials within a European Grid Infrastructure for Cancer Research , 2007, 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[46]  Arnd Poetzsch-Heffter,et al.  Consistency Checking for Workflows with an Ontology-Based Data Perspective , 2009, DEXA.

[47]  Michel Klein,et al.  Combining and relating ontologies: an analysis of problems and solutions , 2001, OIS@IJCAI.

[48]  M. Ashburner,et al.  The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration , 2007, Nature Biotechnology.

[49]  A. Rector,et al.  Relations in biomedical ontologies , 2005, Genome Biology.

[50]  Edward H. Shortliffe,et al.  Viewpoint: The Unified Medical Language System: Toward a Collaborative Approach for Solving Terminologic Problems , 1998, J. Am. Medical Informatics Assoc..