Mining Electronic Health Records to Guide and Support Clinical Decision Support Systems

Clinical decision support systems require well-designed electronic health record (EHR) systems and vice versa. The data stored or captured in EHRs are diverse and include demographics, billing, medications, and laboratory reports; and can be categorized as structured, semi-structured and unstructured data. Various data and text mining techniques have been used to extract these data from EHRs for use in decision support, quality improvement and research. Mining EHRs has been used to identify cohorts, correlated phenotypes in genome-wide association studies, disease correlations and risk factors, drugdrug interactions, and to improve health services. However, mining EHR data is a challenge with many issues and barriers. The aim of this chapter is to discuss how data and text mining techniques may guide and support the building of improved clinical decision support systems.

[1]  Wen-Lian Hsu,et al.  Chapter 12: Text Mining in Biomedicine and Healthcare , 2014 .

[2]  V Koutkias,et al.  Big Data - Smart Health Strategies , 2014, Yearbook of Medical Informatics.

[3]  Di Zhao,et al.  Combining PubMed knowledge and EHR data to develop a weighted bayesian network for pancreatic cancer prediction , 2011, J. Biomed. Informatics.

[4]  S. Brunak,et al.  Mining electronic health records: towards better research applications and clinical care , 2012, Nature Reviews Genetics.

[5]  T. Murdoch,et al.  The inevitable application of big data to health care. , 2013, JAMA.

[6]  Shahram Ebadollahi,et al.  Prevalence of heart failure signs and symptoms in a large primary care population identified through the use of text and data mining of the electronic health record. , 2014, Journal of cardiac failure.

[7]  Cui Tao,et al.  Normalization and standardization of electronic health records for high-throughput phenotyping: the SHARPn consortium. , 2013, Journal of the American Medical Informatics Association : JAMIA.

[8]  M. Boland,et al.  Feasibility of Feature-based Indexing, Clustering, and Search of Clinical Trials , 2013, Methods of Information in Medicine.

[9]  I. Kohane,et al.  Electronic medical records for discovery research in rheumatoid arthritis , 2010, Arthritis care & research.

[10]  Patrice Degoulet,et al.  Translational research platforms integrating clinical and omics data: a review of publicly available solutions , 2014, Briefings Bioinform..

[11]  T. Lasko,et al.  Computational Phenotype Discovery Using Unsupervised Feature Learning over Noisy, Sparse, and Irregular Clinical Data , 2013, PloS one.

[12]  Yike Guo,et al.  tranSMART: An Open Source and Community-Driven Informatics and Data Sharing Platform for Clinical and Translational Research , 2013, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[13]  Eleni I. Georga,et al.  Data mining for blood glucose prediction and knowledge discovery in diabetic patients: The METABO diabetes modeling and management system , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[14]  N. Maglaveras,et al.  Constructing Clinical Decision Support Systems for Adverse Drug Event Prevention: A Knowledge-based Approach. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[15]  Stephen B. Johnson,et al.  A review of approaches to identifying patient phenotype cohorts using electronic health records , 2013, J. Am. Medical Informatics Assoc..

[16]  Hsinchun Chen,et al.  Diabetes-Related Topic Detection in Chinese Health Websites Using Deep Learning , 2014, ICSH.

[17]  Nigam H. Shah,et al.  Mining clinical text for signals of adverse drug-drug interactions , 2014, J. Am. Medical Informatics Assoc..

[18]  Blaz Zupan,et al.  Predictive data mining in clinical medicine: Current issues and guidelines , 2008, Int. J. Medical Informatics.

[19]  Muin J. Khoury,et al.  Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes , 2010, BMC Medical Informatics Decis. Mak..

[20]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[21]  Devjani Chatterjee,et al.  Design-phase prediction of potential cancer clinical trial accrual success using a research data mart. , 2013, Journal of the American Medical Informatics Association : JAMIA.

[22]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[23]  E. Balas,et al.  Improving clinical practice using clinical decision support systems: a systematic review of trials to identify features critical to success , 2005, BMJ : British Medical Journal.

[24]  Andrew Hayen,et al.  Integrating electronic health record information to support integrated care: Practical application of ontologies to improve the accuracy of diabetes disease registers , 2014, J. Biomed. Informatics.

[25]  Stanley M. Huff,et al.  Ontologies, vocabularies, and data models , 2007 .

[26]  Christopher G Chute,et al.  Discovering peripheral arterial disease cases from radiology notes using natural language processing. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[27]  Manish Kumar,et al.  TMUNSW: Disorder Concept Recognition and Normalization in Clinical Notes for SemEval-2014 Task 7 , 2014, SemEval@COLING.

[28]  Joshua C. Denny,et al.  Chapter 13: Mining Electronic Health Records in the Genomics Era , 2012, PLoS Comput. Biol..

[29]  J. Pathak,et al.  Electronic health records-driven phenotyping: challenges, recent advances, and perspectives. , 2013, Journal of the American Medical Informatics Association : JAMIA.

[30]  Pradeep Ray,et al.  Evaluation of caBIG® caTissue Software , 2013 .

[31]  Bruce R. Schatz,et al.  Designing and evaluating a clustering system for organizing and integrating patient drug outcomes in personal health messages , 2012, AMIA.

[32]  Peter J. Haug,et al.  Data Preparation Framework for Preprocessing Clinical Data in Data Mining , 2006, AMIA.

[33]  A Depeursinge,et al.  Clinical Data Mining: a Review , 2009, Yearbook of Medical Informatics.

[34]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[35]  Jean-Baptiste Lamy,et al.  Design and usability study of an iconic user interface to ease information retrieval of medical guidelines. , 2014, Journal of the American Medical Informatics Association : JAMIA.

[36]  Jane Taggart,et al.  Data quality and fitness for purpose of routinely collected data--a general practice case study from an electronic practice-based research network (ePBRN). , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[37]  John F. Hurdle,et al.  Extracting Information from Textual Documents in the Electronic Health Record: A Review of Recent Research , 2008, Yearbook of Medical Informatics.

[38]  E. Shortliffe Computer programs to support clinical decision making. , 1990, JAMA.

[39]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..

[40]  Karin M. Verspoor,et al.  Big Data in Medicine Is Driving Big Changes , 2014, Yearbook of Medical Informatics.

[41]  Clement J. McDonald,et al.  What can natural language processing do for clinical decision support? , 2009, J. Biomed. Informatics.

[42]  Kaveh G Shojania,et al.  Effect of point-of-care computer reminders on physician behaviour: a systematic review , 2010, Canadian Medical Association Journal.

[43]  Wen-Lian Hsu,et al.  New Challenges for Biological Text-Mining in the Next Decade , 2010, Journal of Computer Science and Technology.

[44]  Tejal K. Gandhi,et al.  Incomplete care--on the trail of flaws in the system. , 2011, The New England journal of medicine.

[45]  O Bodenreider,et al.  Biomedical ontologies in action: role in knowledge management, data integration and decision support. , 2008, Yearbook of medical informatics.

[46]  Naren Ramakrishnan,et al.  Mining Electronic Health Records , 2010, Computer.