论文信息 - Team IRLabDAIICT at ShARe/CLEF eHealth 2014 Task 3: User-centered Information Retrieval System for Clinical Documents

Team IRLabDAIICT at ShARe/CLEF eHealth 2014 Task 3: User-centered Information Retrieval System for Clinical Documents

In this paper we, Team IRLabDAIICT, describe our participation in the ShARe/CLEF ehealth 2014 task 3: Information Retrieval for addressing questions related to patients health based on clinical reports. We submitted a total of six runs out of the seven in this years task. In our approach we focus on examining the relevance between the documents and user generated query by conducting experiments through query analysis. Our major challenge is to bridge the conceptual gap between the user-generated queries (in-formal query) to biomedical specific terminology (formal query). We incorporate the MeSH (Medical Subject Headings) library , which is a medical thesaurus mapping layman terms to medical synonym terms in order to target the concept matching problem. We use blind relevance feedback model for relevance feedback and query-likelihood model for query expansion which performed the best in the experiments conducted by us. The retrieval system is evaluated based on various parameters as: mean average precision, precision (P@5), precision (P@10), NDCG@5 and NDCG@10, with P@10 and NDCG@10 being the primary and secondary evaluation measures. The experiments were conducted on the gigantic 43.6 GB ShARe/CLEF 2013 Task 3 dataset harvested using (a) EU-FP7 Khresmoi project and and (b) a new 2014 set of English general realistic public queries based on the discharge summary contents. We have obtained the highest result in our baseline run (run 1), with compared to our other five runs, which is 0.706 as declared by ShARe/CLEF organizing committee. We further propose to incorporate a machine learning based retrieval algorithm prediction model for further exploration.

Prasenjit Majumder | Ganesh Iyer | Harsh Thakkar | Kesha Shah

[1] Sanna Salanterä,et al. Overview of the ShARe/CLEF eHealth Evaluation Lab 2013 , 2013, CLEF.

[2] Julie Fisher,et al. User Centred Quality Health Information Provision: Benefits and Challenges , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[3] Gareth J. F. Jones,et al. ShARe/CLEF eHealth Evaluation Lab 2014, Task 3: User-centred Health Information Retrieval , 2014, CLEF.

[4] Carla Teixeira Lopes,et al. Health Information Retrieval - State of the art report , 2022, ArXiv.

[5] Hinrich Schütze,et al. Introduction to information retrieval , 2008 .

[6] Sanna Salanterä,et al. ShARe/CLEF eHealth Evaluation Lab 2013, Task 3: Information Retrieval to Address Patients' Questions when Reading Clinical Reports , 2013, CLEF.

[7] Stephen E. Robertson,et al. Understanding inverse document frequency: on theoretical arguments for IDF , 2004, J. Documentation.

[8] Hongfang Liu,et al. Using Discharge Summaries to Improve Information Retrieval in Clinical Domain , 2013, CLEF.

[9] Sen Na,et al. Concept-based Medical Document Retrieval: THCIB at CLEF eHealth Lab 2013 Task 3 , 2013, CLEF.

[10] W. Bruce Croft,et al. A language modeling approach to information retrieval , 1998, SIGIR '98.

[11] Alan R. Aronson,et al. An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..