Result Diversification in Clinical Case Reports Retrieval based on Main Finding

Clinical case reports are the ‘eyewitness’ in biomedical literature and provide a valuable, unique, albeit noisy and underutilized type of evidence. Main finding is the reason for writing up the reports. Main finding based case reports retrieval provides way for user to conveniently access information of eyewitness evidence. However, user retrieval requirements are often ambiguous and diverse, traditional similarity based retrieval mechanism cannot meet different needs of users. Here, we conduct research of result diversification in case reports retrieval based on main finding. First, four similarity measurements for comparing main finding contents are used for initial result ranking; second, two implicit reranking algorithms and two explicit reranking algorithms are applied for result diversification. Experimental result showed that the methods we used had improved sub-topics coverage rate (CR@ X%) in re-ranking result, which proved the effectiveness of our research work for improving result diversification degree.

[1]  Z. Mao,et al.  COVID-19 Infection in a Patient with End-Stage Kidney Disease , 2020, Nephron.

[2]  Neil R. Smalheiser,et al.  A manual corpus of annotated main findings of clinical case reports , 2019, Database.

[3]  B. Koçak,et al.  Case report: A kidney transplant patient with mild COVID‐19 , 2020, Transplant infectious disease : an official journal of the Transplantation Society.

[4]  Philip S. Yu,et al.  G-Bean: An ontology-graph based web tool for biomedical literature retrieval , 2013, BIBM.

[5]  Zixi Hong,et al.  COVID-19 in Hemodialysis Patients: A Report of 5 Cases , 2020, American Journal of Kidney Diseases.

[6]  Craig MacDonald,et al.  Exploiting query reformulations for web search result diversification , 2010, WWW '10.

[7]  Yongman Lv,et al.  Coronavirus disease 2019 (COVID-19) pneumonia in a hemodialysis patient , 2020, Medicine.

[8]  S. Niwattanakul,et al.  Using of Jaccard Coefficient for Keywords Similarity , 2022 .

[9]  Craig MacDonald,et al.  Explicit Search Result Diversification through Sub-queries , 2010, ECIR.

[10]  Xiaojin Zhu,et al.  Ranking Biomedical Passages for Relevance and Diversity: University of Wisconsin, Madison at TREC Genomics 2006 , 2006, TREC.

[11]  Yifan Peng,et al.  BioSentVec: creating sentence embeddings for biomedical texts , 2018, 2019 IEEE International Conference on Healthcare Informatics (ICHI).

[12]  Marti A. Hearst,et al.  A Simple Algorithm for Identifying Abbreviation Definitions in Biomedical Text , 2002, Pacific Symposium on Biocomputing.

[13]  Hrvoje Barić,et al.  Why should medical editors CARE about case reports? , 2013, Croatian medical journal.

[14]  Wei Lu,et al.  Result diversification in image retrieval based on semantic distance , 2019, Inf. Sci..

[15]  Evaggelia Pitoura,et al.  Search result diversification , 2010, SGMD.

[16]  Neil R. Smalheiser,et al.  Unsupervised low-dimensional vector representations for words, phrases and text that are transparent, scalable, and produce similarity metrics that are not redundant with neural embeddings , 2019, J. Biomed. Informatics.

[17]  W. Bruce Croft,et al.  Diversity by proportionality: an election-based approach to search result diversification , 2012, SIGIR '12.

[18]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[19]  D. Du,et al.  A familial cluster, including a kidney transplant recipient, of Coronavirus Disease 2019 (COVID‐19) in Wuhan, China , 2020, American journal of transplantation : official journal of the American Society of Transplantation and the American Society of Transplant Surgeons.

[21]  Tetsuya Sakai,et al.  Search Result Diversification Based on Hierarchical Intents , 2015, CIKM.

[22]  Divesh Srivastava,et al.  On query result diversification , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[23]  Yu Jiang,et al.  Design and implementation of Metta, a metasearch engine for biomedical literature retrieval intended for systematic reviewers , 2014, Health Inf. Sci. Syst..

[24]  Ismail Sengör Altingövde,et al.  Explicit search result diversification using score and rank aggregation methods , 2015, J. Assoc. Inf. Sci. Technol..

[25]  V. Gopikrishna A report on case reports , 2010, Journal of conservative dentistry : JCD.

[26]  Aaron M Cohen,et al.  Identifying main finding sentences in clinical case reports , 2020, Database J. Biol. Databases Curation.

[27]  Mark Sanderson,et al.  Using score differences for search result diversification , 2014, SIGIR.

[28]  Gregory V. Bard,et al.  Spelling-Error Tolerant, Order-Independent Pass-Phrases via the Damerau-Levenshtein String-Edit Distance Metric , 2007, ACSW.