A Terminological and Ontological Analysis of the NCI Thesaurus

OBJECTIVE The National Cancer Institute Thesaurus is described by its authors as "a biomedical vocabulary that provides consistent, unambiguous codes and definitions for concepts used in cancer research" and which "exhibits ontology-like properties in its construction and use". We performed a qualitative analysis of the Thesaurus in order to assess its conformity with principles of good practice in terminology and ontology design. MATERIALS AND METHODS We used both the on-line browsable version of the Thesaurus and its OWL-representation (version 04.08b, released on August 2, 2004), measuring each in light of the requirements put forward in relevant ISO terminology standards and in light of ontological principles advanced in the recent literature. RESULTS We found many mistakes and inconsistencies with respect to the term-formation principles used, the underlying knowledge representation system, and missing or inappropriately assigned verbal and formal definitions. CONCLUSION Version 04.08b of the NCI Thesaurus suffers from the same broad range of problems that have been observed in other biomedical terminologies. For its further development, we recommend the use of a more principled approach that allows the Thesaurus to be tested not just for internal consistency but also for its degree of correspondence to that part of reality which it is designed to represent.

[1]  L. Stein,et al.  OWL Web Ontology Language - Reference , 2004 .

[2]  Jan Wielemaker,et al.  Native Preemptive Threads in SWI-Prolog , 2003, ICLP.

[3]  Gilberto Fragoso,et al.  Enterprise Vocabulary Development in Protege / OWL : Workflow and Concept History Requirements , 2004 .

[4]  Domenico M. Pisanelli,et al.  Ontologies in Medicine , 2004 .

[5]  Werner Ceusters,et al.  Ontology-Based Error Detection in SNOMED-CT® , 2004, MedInfo.

[6]  James A. Hendler,et al.  The National Cancer Institute's Thésaurus and Ontology , 2003, J. Web Semant..

[7]  Werner Ceusters,et al.  Mistakes in medical ontologies: where do they come from and how can they be detected? , 2004, Studies in health technology and informatics.

[8]  Mark S. Tuttle,et al.  NCI Thesaurus: Using Science-Based Terminology to Integrate Cancer Research Results , 2004, MedInfo.

[9]  Barry Smith,et al.  The Role of Foundational Relations in the Alignment of Biomedical Ontologies , 2004, MedInfo.

[10]  Stefan Schulz,et al.  Towards a Broad-Coverage Biomedical Ontology Based on Description Logics , 2002, Pacific Symposium on Biocomputing.

[11]  Werner Ceusters,et al.  Ontology and medical terminology: Why description logics are not enough , 2003 .

[12]  Steffen Schulze-Kremer,et al.  Revising the UMLS Semantic Network , 2004 .

[13]  Barry Smith,et al.  Beyond Concepts: Ontology as Reality Representation , 2004 .

[14]  Steffen Schulze-Kremer,et al.  The Ontology of the Gene Ontology , 2003, AMIA.

[15]  José L. V. Mejino,et al.  A reference ontology for biomedical informatics: the Foundational Model of Anatomy , 2003, J. Biomed. Informatics.

[16]  Olivier Bodenreider,et al.  The Ontology-Epistemology Divide: A Case Study in Medical Terminology. , 2004, Formal ontology in information systems : proceedings of the ... International Conference. FOIS.

[17]  James J. Cimino,et al.  Research Paper: Auditing the Unified Medical Language System with Semantic Methods , 1998, J. Am. Medical Informatics Assoc..

[18]  Barry Smith,et al.  Biodynamic ontology: applying BFO in the biomedical domain. , 2004, Studies in health technology and informatics.

[19]  Jeremy J. Carroll,et al.  Resource description framework (rdf) concepts and abstract syntax , 2003 .

[20]  Peter Spyns,et al.  Ontologies: a revamped cross-disciplinary buzzword or a truly promising interdisciplinary research topic? , 2021, Linguistica Antverpiensia, New Series – Themes in Translation Studies.

[21]  A. Kumara,et al.  The Unified Medical Language System and the Gene Ontology : Some Critical Reflections , 2003 .

[22]  Yves A. Lussier,et al.  Putting Data Integration into Practice: Using Biomedical Terminologies to Add Structure to Existing Data Sources , 2003, AMIA.

[23]  Jordan B. Peterson The Meaning of Meaning , 2007 .

[24]  Christian Nøhr,et al.  Medinfo 2004: Proceedings Of THe 11th World Congress On Medical Informatics , 2004 .

[25]  Luc Schneider,et al.  Ontological Foundations of Natural Language Communication in Multiagent Systems , 2003, KES.