论文信息 - Analysis of Error Concentrations in SNOMED

Analysis of Error Concentrations in SNOMED

Two high-level abstraction networks for the knowledge content of a terminology, known respectively as the "area taxonomy" and "p-area taxonomy," have previously been defined. Both are derived automatically from partitions of the terminology's concepts. An important application of these networks is in auditing, where a number of systematic regimens have been formulated utilizing them. In particular, the taxonomies tend to highlight certain kinds of concept groups where errors are more likely to be found. Using results garnered from applications of our auditing regimens to SNOMED CT, an investigation into the concentration of errors among such groups is carried out. Three hypotheses pertaining to the error distributions are put forth. The results support the fact that certain groups presented by the taxonomies show higher error percentages as compared to other groups. The bootstrap is used to assess their statistical significance. This knowledge will help direct auditing efforts to increase their impact.

[1] Olivier Bodenreider,et al. Investigating subsumption in DL-based terminologies: A Case Study in SNOMED CT , 2004, KR-MED.

[2] Werner Ceusters,et al. Mistakes in medical ontologies: where do they come from and how can they be detected? , 2004, Studies in health technology and informatics.

[3] Yue Wang,et al. Research Paper: Auditing as Part of the Terminology Design Life Cycle , 2006, J. Am. Medical Informatics Assoc..

[4] Robert Tibshirani,et al. An Introduction to the Bootstrap , 1994 .

[5] Yue Wang,et al. Structural methodologies for auditing SNOMED , 2007, J. Biomed. Informatics.

[6] Werner Ceusters,et al. Ontology-Based Error Detection in SNOMED-CT® , 2004, MedInfo.