Similarity of medical concepts in question and answering of health communities

The ability to automatically categorize submitted questions based on topics and suggest similar question and answer to the users reduces the number of redundant questions. Our objective was to compare intra-topic and inter-topic similarity between question and answers by using concept-based similarity computing analysis. We gathered existing question and answers from several popular online health communities. Then, Unified Medical Language System concepts related to selected questions and experts in different topics were extracted and weighted by term frequency -inverse document frequency values. Finally, the similarity between weighted vectors of Unified Medical Language System concepts was computed. Our result showed a considerable gap between intra-topic and inter-topic similarities in such a way that the average of intra-topic similarity (0.095, 0.192, and 0.110, respectively) was higher than the average of inter-topic similarity (0.012, 0.025, and 0.018, respectively) for questions of the top 3 popular online communities including NetWellness, WebMD, and Yahoo Answers. Similarity scores between the content of questions answered by experts in the same and different topics were calculated as 0.51 and 0.11, respectively. Concept-based similarity computing methods can be used in developing intelligent question and answering retrieval systems that contain auto recommendation functionality for similar questions and experts.

[1]  Robert M. Anderson,et al.  Patient empowerment: reflections on the challenge of fostering the adoption of a new paradigm. , 2005, Patient education and counseling.

[2]  Alan R. Aronson,et al.  An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..

[3]  R. Croyle,et al.  Frustrated and Confused: The American Public Rates its Cancer-Related Information-Seeking Experiences , 2008, Journal of General Internal Medicine.

[4]  Robert A. Greenes,et al.  Positive attitudes and failed queries: an exploration of the conundrums of consumer health information retrieval , 2004, Int. J. Medical Informatics.

[5]  Reijo Savolainen,et al.  Time as a context of information seeking , 2006 .

[6]  D. Lewis,et al.  Consumer health informatics : informing consumers and improving health care , 2010 .

[7]  Christian Köhler,et al.  How do consumers search for and appraise health information on the world wide web? Qualitative study using focus groups, usability tests, and in-depth interviews , 2002, BMJ : British Medical Journal.

[8]  Peter L. Elkin,et al.  Consumer Health Informatics: Informing Consumers and Improving Health Care , 2006 .

[9]  Yan Zhang,et al.  Consumer health information searching process in real life settings , 2012, ASIST.

[10]  Stephen A. Marine,et al.  NetWellness 1995 - 2005: Ten Years of Experience and Growth as a NonProfit Consumer Health Information and Ask-an-Expert Service , 2005, AMIA.

[11]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[12]  Kam-Fai Wong,et al.  Interpreting TF-IDF term weights as making relevance decisions , 2008, TOIS.

[13]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[14]  Elaine Toms,et al.  How consumers search for health information , 2007, Health Informatics J..

[15]  Gary Hsieh,et al.  The presentation of health-related search results and its impact on negative emotional outcomes , 2013, CHI.

[16]  B M Wildemuth,et al.  Information-seeking behaviors of medical students: a classification of questions asked of librarians and physicians. , 1994, Bulletin of the Medical Library Association.

[17]  R. Lefebvre,et al.  Digital social networks and health. , 2013, Circulation.

[18]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[19]  Yan Zhang,et al.  Searching : an Analysis of Questions in a Social Q & A Community , 2010 .

[20]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[21]  Elizabeth Sillence,et al.  Trust and mistrust of online health sites , 2004, CHI.

[22]  Akiko Aizawa,et al.  An information-theoretic perspective of tf-idf measures , 2003, Inf. Process. Manag..

[23]  Guo Zhang,et al.  Scholarly conformity: Origins, framework, applications and implications , 2012, ASIST.

[24]  E. Coiera,et al.  Impact of Web Searching and Social Feedback on Consumer Decision Making: A Prospective Online Experiment , 2008, Journal of medical Internet research.