Evaluating alignment quality between iconic language and reference terminologies using similarity metrics

BackgroundVisualization of Concepts in Medicine (VCM) is a compositional iconic language that aims to ease information retrieval in Electronic Health Records (EHR), clinical guidelines or other medical documents. Using VCM language in medical applications requires alignment with medical reference terminologies. Alignment from Medical Subject Headings (MeSH) thesaurus and International Classification of Diseases – tenth revision (ICD10) to VCM are presented here. This study aim was to evaluate alignment quality between VCM and other terminologies using different measures of inter-alignment agreement before integration in EHR.MethodsFor medical literature retrieval purposes and EHR browsing, the MeSH thesaurus and the ICD10, both organized hierarchically, were aligned to VCM language. Some MeSH to VCM alignments were performed automatically but others were performed manually and validated. ICD10 to VCM alignment was entirely manually performed. Inter-alignment agreement was assessed on ICD10 codes and MeSH descriptors, sharing the same Concept Unique Identifiers in the Unified Medical Language System (UMLS). Three metrics were used to compare two VCM icons: binary comparison, crude Dice Similarity Coefficient (DSCcrude), and semantic Dice Similarity Coefficient (DSCsemantic), based on Lin similarity. An analysis of discrepancies was performed.ResultsMeSH to VCM alignment resulted in 10,783 relations: 1,830 of which were manually performed and 8,953 were automatically inherited. ICD10 to VCM alignment led to 19,852 relations. UMLS gathered 1,887 alignments between ICD10 and MeSH. Only 1,606 of them were used for this study. Inter-alignment agreement using only validated MeSH to VCM alignment was 74.2% [70.5-78.0]CI95%, DSCcrude was 0.93 [0.91-0.94]CI95%, and DSCsemantic was 0.96 [0.95-0.96]CI95%. Discrepancy analysis revealed that even if two thirds of errors came from the reviewers, UMLS was nevertheless responsible for one third.ConclusionsThis study has shown strong overall inter-alignment agreement between MeSH to VCM and ICD10 to VCM manual alignments. VCM icons have now been integrated into a guideline search engine (http://www.cismef.org) and a health terminologies portal (http://www.hetop.eu).

[1]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[2]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[3]  B. Everitt,et al.  Statistical methods for rates and proportions , 1973 .

[4]  H. Toutenburg Fleiss, J. L.: Statistical Methods for Rates and Proportions. John Wiley & Sons, New York‐London‐Sydney‐Toronto 1973. XIII, 233 S. , 1974 .

[5]  J. Fleiss Measuring agreement between two judges on the presence or absence of a trait. , 1975, Biometrics.

[6]  Lawrence E. Leonard,et al.  Inter-Indexer Consistency and Retrieval Effectiveness: Measurement of Relationships , 1975 .

[7]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[8]  W. Grove Statistical Methods for Rates and Proportions, 2nd ed , 1981 .

[9]  D. Lindberg,et al.  Unified Medical Language System , 2020, Definitions.

[10]  D. Lindberg,et al.  The Unified Medical Language System , 1993, Methods of Information in Medicine.

[11]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[12]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[13]  Timothy E. McMahon,et al.  National library of medicine web site: (http: //www.nlm.nih.gov/.) National Institutes of Health, U.S. Department of Health and Human Services. Reviewed in July 1998 , 1999, Gov. Inf. Q..

[14]  Olivier Bodenreider,et al.  An Evaluation of Hybrid Methods for Matching Biomedical Terminologies: Mapping the Gene Ontology to the UMLS®$ , 2003, MIE.

[15]  Olivier Bodenreider,et al.  Utilizing the UMLS for Semantic Mapping between Terminologies , 2005, AMIA.

[16]  Ali Shiri,et al.  Challenges and issues in terminology mapping: a digital library perspective , 2005, Electron. Libr..

[17]  George Hripcsak,et al.  Technical Brief: Agreement, the F-Measure, and Reliability in Information Retrieval , 2005, J. Am. Medical Informatics Assoc..

[18]  Olivier Bodenreider,et al.  Besides Precision & Recall: Exploring Alternative Approaches to Evaluating an Automatic Indexing Tool for MEDLINE , 2006, AMIA.

[19]  H. C. Coumou,et al.  How do primary care physicians seek answers to clinical questions? A literature review. , 2006, Journal of the Medical Library Association : JMLA.

[20]  Olivier Bodenreider,et al.  Combining Lexical and Semantic Methods of Inter-terminology Mapping Using the UMLS , 2007, MedInfo.

[21]  Jean-Baptiste Lamy,et al.  Design of a graphical and interactive interface for facilitating access to drug contraindications, cautions for use, interactions and adverse effects , 2008, BMC Medical Informatics Decis. Mak..

[22]  Anders Grimsmo,et al.  Instant availability of patient records, but diminished availability of patient information: A multi-method study of GP's use of electronic patient records , 2008, BMC Medical Informatics Decis. Mak..

[23]  Anke J. E. de Veer,et al.  Factors influencing the implementation of clinical guidelines for health care professionals: A systematic meta-review , 2008, BMC Medical Informatics Decis. Mak..

[24]  P. Wieteck Furthering the development of standardized nursing terminology through an ENP-ICNP cross-mapping. , 2008, International nursing review.

[25]  Alain Venot,et al.  An iconic language for the graphical representation of medical concepts , 2008, BMC Medical Informatics Decis. Mak..

[26]  Stéfan Jacques Darmoni,et al.  Towards iconic language for patient records, drug monographs, guidelines and medical search engines , 2010, MedInfo.

[27]  Thomas H. Payne,et al.  Transition from Paper to Electronic Inpatient Physician Notes Case Description , 2022 .

[28]  Olivier Bodenreider,et al.  Exploiting UMLS Semantics for Checking Semantic Consistency among UMLS concepts , 2010, MedInfo.

[29]  T. Merabti,et al.  Teaching medicine with a terminology/ontology portal. , 2012, Studies in health technology and informatics.

[30]  Lina Fatima Soualmia,et al.  Validating the semantics of a medical iconic language using ontological reasoning , 2013, J. Biomed. Informatics.