Semantic Similarity and Relatedness between Clinical Terms: An Experimental Study.

Automated approaches to measuring semantic similarity and relatedness can provide necessary semantic context information for information retrieval applications and a number of fundamental natural language processing tasks including word sense disambiguation. Challenges for the development of these approaches include the limited availability of validated reference standards and the need for better understanding of the notions of semantic relatedness and similarity in medical vocabulary. We present results of a study in which eight medical residents were asked to judge 724 pairs of medical terms for semantic similarity and relatedness. The results of the study confirm the existence of a measurable mental representation of semantic relatedness between medical terms that is distinct from similarity and independent of the context in which the terms occur. This study produced a validated publicly available dataset for developing automated approaches to measuring semantic relatedness and similarity.

[1]  J. Gabrieli,et al.  Effects of Semantic and Associative Relatedness on Automatic Priming , 1998 .

[2]  Hoa A. Nguyen,et al.  A Cluster-Based Approach for Semantic Similarity in the Biomedical Domain , 2006, 2006 International Conference of the IEEE Engineering in Medicine and Biology Society.

[3]  G. Miller,et al.  Contextual correlates of semantic similarity , 1991 .

[4]  Sharon L. Thompson-Schill,et al.  Predicting judged similarity of natural categories from their neural representations , 2009, Neuropsychologia.

[5]  Tom Michael Mitchell,et al.  Predicting Human Brain Activity Associated with the Meanings of Nouns , 2008, Science.

[6]  A. Venot,et al.  Appraisal of the MedDRA Conceptual Structure for Describing and Grouping Adverse Drug Reactions , 2005, Drug safety.

[7]  James J. Cimino,et al.  Towards the development of a conceptual distance metric for the UMLS , 2004, J. Biomed. Informatics.

[8]  Allan Collins,et al.  A spreading-activation theory of semantic processing , 1975 .

[9]  A. Tversky Features of Similarity , 1977 .

[10]  Ted Pedersen,et al.  UMLS-Interface and UMLS-Similarity : Open Source Software for Measuring Paths and Semantic Similarity , 2009, AMIA.

[11]  L. Ferrand,et al.  Quand « Amour » amorce « Soleil » (ou pourquoi l'amorçage affectif n'est pas un (simple) cas d'amorçage sémantique). , 2006 .

[12]  John B. Goodenough,et al.  Contextual correlates of synonymy , 1965, CACM.

[13]  J. Fleiss,et al.  Intraclass correlations: uses in assessing rater reliability. , 1979, Psychological bulletin.

[14]  Ted Pedersen,et al.  Measures of semantic similarity and relatedness in the biomedical domain , 2007, J. Biomed. Informatics.

[15]  Patrice Degoulet,et al.  Using semantic distance for the efficient coding of medical concepts , 2000, AMIA.

[16]  Mark A. Musen,et al.  Comparison of Ontology-based Semantic-Similarity Measures , 2008, AMIA.