Finding Clinical Knowledge from MEDLINE Abstracts by Text Summarization Technique

Today, the MEDLINE is an important repository containing more than 26 million citations and abstracts in the fields of medicine, while PubMed provides free access to MEDLINE and links to full-text articles. MEDLINE abstracts becomes a potential source of new knowledge in medical field. However, it is time-consuming and labour-intensive to find knowledge from MEDLINE abstracts, when a search returns much abstracts and each may contain a large volume of information. Therefore, this work aims to present a method of summarizing clinical knowledge from a MEDLINE abstract. The main mechanisms of the proposed method are driven on natural language processing (NLP) and text filtering techniques. The case study of this work is to summarize the clinical knowledge from a MEDLINE abstracts relating to cervical cancer in clinical trials. In the evaluation stage, the actual results obtained from a domain expert are used to compare the predicted results. After testing by recall, precision, and F-score, they return the satisfactory results, where the average of recall, precision, and F-measure are 0.84, 1.00, and 0.91 respectively.

[1]  Tharam S. Dillon,et al.  Thinking PubMed: an Innovative System for Mental Health Domain , 2008, 2008 21st IEEE International Symposium on Computer-Based Medical Systems.

[2]  Rong Xu,et al.  Comparing a knowledge-driven approach to a supervised machine learning approach in large-scale extraction of drug-side effect relationships from free-text biomedical literature , 2015, BMC Bioinformatics.

[3]  Dezon Finch,et al.  TagLine: Information Extraction for Semi-Structured Text Elements In Medical Progress Notes , 2012 .

[4]  Kazuhiko Ohe,et al.  TEXT2TABLE: Medical Text Summarization System Based on Named Entity Recognition and Modality Identification , 2009, BioNLP@HLT-NAACL.

[5]  Dezon Finch,et al.  TagLine: Information Extraction for Semi-Structured Text in Medical Progress Notes , 2014, AMIA.

[6]  Cheng-Zen Yang,et al.  Duplication Detection for Software Bug Reports Based on BM25 Term Weighting , 2012, 2012 Conference on Technologies and Applications of Artificial Intelligence.

[7]  Hua Xu,et al.  A hybrid system for temporal information extraction from clinical text , 2013, J. Am. Medical Informatics Assoc..

[8]  Malik Yousef,et al.  One-Class SVMs for Document Classification , 2002, J. Mach. Learn. Res..

[9]  Ricardo Baeza-Yates,et al.  Modern Information Retrieval - the concepts and technology behind search, Second edition , 2011 .

[10]  Jimmy J. Lin,et al.  Knowledge Extraction for Clinical Question Answering: Preliminary Results , 2005 .

[11]  Xiaodong Chen,et al.  BMExpert: Mining MEDLINE for Finding Experts in Biomedical Domains Based on Language Model , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[12]  Vishal Gupta,et al.  Recent automatic text summarization techniques: a survey , 2016, Artificial Intelligence Review.

[13]  Sharvari Govilkar,et al.  Comparative Study of Text Summarization Methods , 2014 .

[14]  Sung-Hyon Myaeng,et al.  Procedural Knowledge Extraction on MEDLINE Abstracts , 2011, AMT.

[15]  Jimmy J. Lin,et al.  Answering Clinical Questions with Knowledge-Based and Statistical Techniques , 2007, CL.

[16]  Rong Xu,et al.  A knowledge-driven conditional approach to extract pharmacogenomics specific drug-gene relationships from free text , 2012, J. Biomed. Informatics.

[17]  Guilherme Del Fiol,et al.  Text summarization in the biomedical domain: A systematic review of recent research , 2014, J. Biomed. Informatics.

[18]  Snehasis Mukhopadhyay,et al.  Knowledge Extraction and Extrapolation Using Ancient and Modern Biomedical Literature , 2009, 2009 International Conference on Advanced Information Networking and Applications Workshops.

[19]  Zhiyong Lu,et al.  PubMed and beyond: a survey of web tools for searching biomedical literature , 2011, Database J. Biol. Databases Curation.

[20]  Panagiotis Stamatopoulos,et al.  Summarization from Medical Documents: A Survey , 2005, Artif. Intell. Medicine.

[21]  Guilherme Del Fiol,et al.  Automatically Extracting Sentences from Medline Citations to Support Clinicians' Information Needs , 2012, 2012 IEEE Second International Conference on Healthcare Informatics, Imaging and Systems Biology.

[22]  Gurpreet Singh Lehal,et al.  A Survey of Text Summarization Extractive Techniques , 2010 .

[23]  Kamal Sarkar,et al.  Using Domain Knowledge for Text Summarization in Medical Domain , 2009 .