Language Complexity in On-line Health Information Retrieval

The number of people searching for on-line health information has been steadily growing over the years so it is crucial to understand their specific requirements in order to help them finding easily and quickly the specific in-formation they are looking for. Although generic search engines are typically used by health information seekers as the starting point for searching information, they have been shown to be limited and unsatisfactory because they make generic searches, often overloading the user with the provided amount of results. Moreover, they are not able to provide specific information to different types of users. At the same time, specific search engines mostly work on medical literature and provide extracts from medical journals that are mainly useful for medical researchers and experts but not for non-experts. A question then arises: Is it possible to facilitate the search of on-line health/medical information based on specific user requirements? In this pa-per, after analysing the main characteristics and requirements of on-line health seeking, we provide a first answer to this question by exploiting the Web structured data for the health domain and presenting a system that allows different types of users, i.e., non-medical experts and medical experts, to retrieve Web pages with language complexity levels suitable to their expertise. Furthermore, we apply our methodology to the results of a generic search engine, such as Google, in order to re-rank them and provide different users with the proper health/medical Web pages in terms of language complexity.

[1]  Davide Taibi,et al.  ULearn: Personalized Medical Learning on the Web for Patient Empowerment , 2019, ICWL.

[2]  Karin M. Verspoor,et al.  Better Health Explorer: Designing for Health Information Seekers , 2015, OZCHI.

[3]  Jianxing Chi,et al.  Automatic Classification of Users’ Health Information Need Context: Logistic Regression Analysis of Mouse-Click and Eye-Tracker Data , 2017, Journal of medical Internet research.

[4]  Davide Taibi,et al.  Towards integration of web data into a coherent educational data graph , 2013, WWW '13 Companion.

[5]  Markus Helfert,et al.  Provision of Tailored Health Information for Patient Empowerment: An Initial Study , 2019, CompSysTech.

[6]  Giosuè Lo Bosco,et al.  An Automatic System for Helping Health Consumers to Understand Medical Texts , 2015, HEALTHINF.

[7]  David Kauchak,et al.  Improving Perceived and Actual Text Difficulty for Health Information Consumers using Semi-Automated Methods , 2012, AMIA.

[8]  Yin Yang,et al.  A study of medical and health queries to web search engines. , 2004, Health information and libraries journal.

[9]  Alla Keselman,et al.  Towards Consumer-Friendly PHRs: Patients' Experience with Reviewing Their Health Records , 2007, AMIA.

[10]  Helen Hasan,et al.  Understanding the Diversity of User Requirements for Interactive Online Health Services , 2016 .

[11]  Mathieu d'Aquin,et al.  Analysing and Improving Embedded Markup of Learning Resources on the Web , 2017, WWW.

[12]  Giosuè Lo Bosco,et al.  A web search methodology for health consumers , 2014, CompSysTech.

[13]  Giosuè Lo Bosco,et al.  U-MedSearch: A Meta Search Engine of Medical Content for Different Users and Learning Needs , 2015 .

[14]  Hao Yang,et al.  MedSearch: a specialized search engine for medical information retrieval , 2008, CIKM '08.

[15]  Célia Boyer,et al.  D8.1.1. Requirements for the general public health search , 2011 .

[16]  S. Baker,et al.  A Population-Based Survey , 2017 .

[17]  Davide Taibi,et al.  Development and Practical Use of a Medical Vocabulary-Thesaurus-Dictionary for Patient Empowerment , 2018, CompSysTech.

[18]  Antonio Hortelano La encuesta sobre Equipamiento y Uso de Tecnologías de Información y Comunicación en los Hogares (TIC-H) del INE , 2013 .

[19]  d'Anquin Analysing and Improving Embedded Markup of Learning Resources on the Web , 2017 .

[20]  Alla Keselman,et al.  Developing informatics tools and strategies for consumer-centered health communication. , 2008, Journal of the American Medical Informatics Association : JAMIA.

[21]  Christian Bizer,et al.  The WebDataCommons Microdata, RDFa and Microformat Dataset Series , 2014, International Semantic Web Conference.

[22]  Markus Helfert,et al.  Facilitating Access to Health Web Pages with Different Language Complexity Levels , 2019, ICT4AWE.

[23]  David Kauchak,et al.  Improving Consumer Understanding of Medical Text: Development and Validation of a New SubSimplify Algorithm to Automatically Generate Term Explanations in English and Spanish , 2018, Journal of medical Internet research.

[24]  C. Chronaki,et al.  eHealth Trends in Europe 2005-2007: A Population-Based Survey , 2008, Journal of medical Internet research.

[25]  Alla Keselman,et al.  Research Paper: Consumer Health Information Seeking as Hypothesis Testing , 2008, J. Am. Medical Informatics Assoc..