A Review of the End-to-End Methodologies for Clinical Concept Extraction

Our study provided a review of the concept extraction literature from January 2009 to June 2019. The systematic summarization of concept extraction methodologic development processes illustrated the diversity, complexity, usability, challenges and limitations of both rule-based and statistical traditional machine learning approaches for clinical concept extraction.

[1]  Hui Chen,et al.  GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition , 2019, AAAI.

[2]  David L. Buckeridge,et al.  A novel method of adverse event detection can accurately identify venous thromboembolisms (VTEs) from narrative electronic health record data , 2014, J. Am. Medical Informatics Assoc..

[3]  Christopher G Chute,et al.  An Information Extraction Framework for Cohort Identification Using Electronic Health Records , 2013, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[4]  Paul A. Harris,et al.  PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability , 2016, J. Am. Medical Informatics Assoc..

[5]  Jun'ichi Tsujii,et al.  Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries , 2012, J. Am. Medical Informatics Assoc..

[6]  Yaoyun Zhang,et al.  Semantic Role Labeling of Clinical Text: Comparing Syntactic Parsers and Features , 2016, AMIA.

[7]  Maria T. Pazienza,et al.  Information Extraction , 2002, Lecture Notes in Computer Science.

[8]  Bruce E. Bray,et al.  Congestive heart failure information extraction framework for automated treatment performance measures assessment , 2017, J. Am. Medical Informatics Assoc..

[9]  P. Hinds,et al.  Automated Outcome Classification of Computed Tomography Imaging Reports for Pediatric Traumatic Brain Injury. , 2016, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.

[10]  Elaine Marsh,et al.  MUC-7 Evaluation of IE Technology: Overview of Results , 1998, MUC.

[11]  Lucila Ohno-Machado,et al.  Natural language processing: an introduction , 2011, J. Am. Medical Informatics Assoc..

[12]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[13]  Mike Conway,et al.  Extracting a stroke phenotype risk factor from Veteran Health Administration clinical reports: an information content analysis , 2016, Journal of Biomedical Semantics.

[14]  Qingxia Chen,et al.  An active learning-enabled annotation system for clinical named entity recognition , 2017, BMC Medical Informatics and Decision Making.

[15]  Zina M. Ibrahim,et al.  Improving RNN with Attention and Embedding for Adverse Drug Reactions , 2017, DH.

[16]  Ming Yang,et al.  Entity recognition from clinical texts via recurrent neural network , 2017, BMC Medical Informatics and Decision Making.

[17]  Allan Fong,et al.  Call Case Dashboard: Tracking R1 Exposure to High-Acuity Cases Using Natural Language Processing. , 2016, Journal of the American College of Radiology : JACR.

[18]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[19]  Zachary Chase Lipton The mythos of model interpretability , 2016, ACM Queue.

[20]  Feichen Shen,et al.  HPO2Vec+: Leveraging heterogeneous knowledge resources to enrich node embeddings for the Human Phenotype Ontology , 2019, J. Biomed. Informatics.

[21]  Peter Szolovits,et al.  Toward high-throughput phenotyping: unbiased automated feature extraction and selection from knowledge sources , 2015, J. Am. Medical Informatics Assoc..

[22]  Sunghwan Sohn,et al.  Deep learning and alternative learning strategies for retrospective real-world clinical data , 2019, npj Digital Medicine.

[23]  Jonathan M. Garibaldi,et al.  Automatic detection of protected health information from clinic narratives , 2015, J. Biomed. Informatics.

[24]  Abeed Sarker,et al.  Portable automatic text classification for adverse drug reaction detection via multi-corpus training , 2015, J. Biomed. Informatics.

[25]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[26]  Hongfang Liu,et al.  Detection of Surgical Site Infection Utilizing Automated Feature Generation in Clinical Notes , 2018, Journal of Healthcare Informatics Research.

[27]  Herbert S. Chase,et al.  Early recognition of multiple sclerosis using natural language processing of the electronic health record , 2017, BMC Medical Informatics and Decision Making.

[28]  Carol Friedman,et al.  Research Paper: A General Natural-language Text Processor for Clinical Radiology , 1994, J. Am. Medical Informatics Assoc..

[29]  Min Li,et al.  High accuracy information extraction of medication information from clinical notes: 2009 i2b2 medication extraction challenge , 2010, J. Am. Medical Informatics Assoc..

[30]  Stéphane M. Meystre,et al.  Extraction of left ventricular ejection fraction information from various types of clinical reports , 2017, J. Biomed. Informatics.

[31]  Spencer S. Jones,et al.  Health Information Technology: An Updated Systematic Review With a Focus on Meaningful Use , 2014, Annals of Internal Medicine.

[32]  Sunghwan Sohn,et al.  Facilitating post-surgical complication detection through sublanguage analysis , 2014, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[33]  John F. Hurdle,et al.  Extracting Information from Textual Documents in the Electronic Health Record: A Review of Recent Research , 2008, Yearbook of Medical Informatics.

[34]  William J. Clancey,et al.  The Epistemology of a Rule-Based Expert System - A Framework for Explanation , 1981, Artif. Intell..

[35]  István Hegedüs,et al.  Research Paper: Semi-automated Construction of Decision Rules to Predict Morbidities from Clinical Texts , 2009, J. Am. Medical Informatics Assoc..

[36]  Harry A. Pierson,et al.  Deep learning in robotics: a review of recent research , 2017, Adv. Robotics.

[37]  Son Doan,et al.  Recognizing Medication related Entities in Hospital Discharge Summaries using Support Vector Machine , 2010, COLING.

[38]  Alan R. Aronson,et al.  An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..

[39]  Franck Dernoncourt,et al.  Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives , 2018, PloS one.

[40]  Kalpana Raja,et al.  Agile text mining for the 2014 i2b2/UTHealth Cardiac risk factors challenge , 2015, J. Biomed. Informatics.

[41]  Stéphane M. Meystre,et al.  Improving Heart Failure Information Extraction by Domain Adaptation , 2013, MedInfo.

[42]  Ying Li,et al.  Validating drug repurposing signals using electronic health records: a case study of metformin associated with reduced cancer mortality , 2014, J. Am. Medical Informatics Assoc..

[43]  Hongfang Liu,et al.  Detection of clinically important colorectal surgical site infection using Bayesian network. , 2017, The Journal of surgical research.

[44]  Qing Zeng-Treitler,et al.  A Suite of Natural Language Processing Tools Developed for the I2B2 Project , 2006, AMIA.

[45]  Yanshan Wang,et al.  Natural Language Processing for the Identification of Silent Brain Infarcts From Neuroimaging Reports , 2019, JMIR medical informatics.

[46]  D. Blumenthal,et al.  Achieving a Nationwide Learning Health System , 2010, Science Translational Medicine.

[47]  William Rose,et al.  Practical implementation of an existing smoking detection pipeline and reduced support vector machine training corpus requirements , 2014, J. Am. Medical Informatics Assoc..

[48]  Hua Xu,et al.  A hybrid system for temporal information extraction from clinical text , 2013, J. Am. Medical Informatics Assoc..

[49]  Jingqi Wang,et al.  Enhancing Clinical Concept Extraction with Contextual Embedding , 2019, J. Am. Medical Informatics Assoc..

[50]  Cyril Labbé,et al.  Named Entity Recognition Over Electronic Health Records Through a Combined Dictionary-based Approach , 2016, CENTERIS/ProjMAN/HCist.

[51]  Ankur Teredesai,et al.  Interpretable Machine Learning in Healthcare , 2018, BCB.

[52]  Wei-Hung Weng,et al.  Publicly Available Clinical BERT Embeddings , 2019, Proceedings of the 2nd Clinical Natural Language Processing Workshop.

[53]  Brian G. Arndt,et al.  Tethered to the EHR: Primary Care Physician Workload Assessment Using EHR Event Log Data and Time-Motion Observations , 2017, The Annals of Family Medicine.

[54]  Jonathan M. Garibaldi,et al.  A hybrid model for automatic identification of risk factors for heart disease , 2015, J. Biomed. Informatics.

[55]  Roland Vollgraf,et al.  Contextual String Embeddings for Sequence Labeling , 2018, COLING.

[56]  Sherri Rose,et al.  Challenges in adapting existing clinical natural language processing systems to multiple, diverse health care settings , 2017, J. Am. Medical Informatics Assoc..

[57]  Christopher G Chute,et al.  A high throughput semantic concept frequency based approach for patient identification: a case study using type 2 diabetes mellitus clinical notes. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[58]  Clement J. McDonald,et al.  What can natural language processing do for clinical decision support? , 2009, J. Biomed. Informatics.

[59]  Xiaolong Wang,et al.  De-identification of clinical notes via recurrent neural network and conditional random field. , 2017, Journal of biomedical informatics.

[60]  Joshua C Denny,et al.  Automated extraction of clinical traits of multiple sclerosis in electronic medical records , 2013, Journal of the American Medical Informatics Association : JAMIA.

[61]  Shuying Shen,et al.  Textractor: a hybrid system for medications and reason for their prescription extraction from clinical text documents , 2010, J. Am. Medical Informatics Assoc..

[62]  Yefeng Wang,et al.  Cascading Classifiers for Named Entity Recognition in Clinical Notes , 2009, BiomedicalIE@RANLP.

[63]  Prakash M. Nadkarni,et al.  Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions , 2011, J. Am. Medical Informatics Assoc..

[64]  Sophia Ananiadou,et al.  Anatomical Entity Recognition with a Hierarchical Framework Augmented by External Resources , 2014, PloS one.

[65]  Robert J. Taylor,et al.  Implementation Brief: Description of a Rule-based System for the i2b2 Challenge in Natural Language Processing for Clinical Data , 2009, J. Am. Medical Informatics Assoc..

[66]  Kerstin Denecke,et al.  Extraction Of Adverse Events From Clinical Documents To Support Decision Making Using Semantic Preprocessing , 2015, MedInfo.

[67]  Wendy W. Chapman,et al.  Developing a natural language processing application for measuring the quality of colonoscopy procedures , 2011, J. Am. Medical Informatics Assoc..

[68]  Jay Urbain,et al.  Mining heart disease risk factors in clinical text with named entity recognition and distributional semantic models , 2015, J. Biomed. Informatics.

[69]  Nikolaos Doulamis,et al.  Deep Learning for Computer Vision: A Brief Review , 2018, Comput. Intell. Neurosci..

[70]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[71]  Ankur Agarwal,et al.  A Natural Language Processing Framework for Assessing Hospital Readmissions for Patients With COPD , 2018, IEEE Journal of Biomedical and Health Informatics.

[72]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..

[73]  Hua Xu,et al.  Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features , 2013, BMC Medical Informatics and Decision Making.

[74]  Hongfang Liu,et al.  A common type system for clinical natural language processing , 2013, J. Biomed. Semant..

[75]  Goran Nenadic,et al.  A text mining approach to the prediction of disease status from clinical discharge summaries. , 2009, Journal of the American Medical Informatics Association : JAMIA.

[76]  Michael S. Lew,et al.  Deep learning for visual understanding: A review , 2016, Neurocomputing.

[77]  I. Kohane,et al.  Electronic medical records for discovery research in rheumatoid arthritis , 2010, Arthritis care & research.

[78]  Hongfang Liu,et al.  Clinical documentation variations and NLP system portability: a case study in asthma birth cohorts across institutions , 2017, J. Am. Medical Informatics Assoc..

[79]  Goran Nenadic,et al.  Deep learning meets ontologies: experiments to anchor the cardiovascular disease ontology in the biomedical literature , 2018, Journal of Biomedical Semantics.

[80]  Joshua C. Denny,et al.  The KnowledgeMap Project: Development of a Concept-Based Medical School Curriculum Database , 2003, AMIA.

[81]  Shyam Visweswaran,et al.  Automated annotation and classification of BI-RADS assessment from radiology reports , 2017, J. Biomed. Informatics.

[82]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[83]  Zhiyong Lu,et al.  NCBI at 2013 ShARe/CLEF eHealth Shared Task: Disorder Normalization in Clinical Notes with Dnorm , 2013, CLEF.

[84]  Chandra Bhagavatula,et al.  Semi-supervised sequence tagging with bidirectional language models , 2017, ACL.

[85]  Chunye Wang,et al.  A Hybrid Approach to Extracting Disorder Mentions from Clinical Notes , 2015, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[86]  Zhiyong Lu,et al.  Transfer Learning in Biomedical Natural Language Processing: An Evaluation of BERT and ELMo on Ten Benchmarking Datasets , 2019, BioNLP@ACL.

[87]  D. Moher,et al.  Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement , 2009, BMJ.

[88]  C. Zheng,et al.  Using Natural Language Processing and Machine Learning to Identify Gout Flares From Electronic Clinical Notes , 2014, Arthritis care & research.

[89]  Hongfang Liu,et al.  Staggered NLP-assisted refinement for Clinical Annotations of Chronic Disease Events , 2016, LREC.

[90]  Goran Nenadic,et al.  Automatic mining of symptom severity from psychiatric evaluation notes , 2017, International journal of methods in psychiatric research.

[91]  Stéphane M. Meystre,et al.  Adapting existing natural language processing resources for cardiovascular risk factors identification in clinical notes , 2015, J. Biomed. Informatics.

[92]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[93]  Özlem Uzuner,et al.  Extracting medication information from clinical text , 2010, J. Am. Medical Informatics Assoc..

[94]  Peter Szolovits,et al.  Utilizing uncoded consultation notes from electronic medical records for predictive modeling of colorectal cancer , 2016, Artif. Intell. Medicine.

[95]  José García Rodríguez,et al.  A Review on Deep Learning Techniques Applied to Semantic Segmentation , 2017, ArXiv.

[96]  Shamkant B. Navathe,et al.  Identifying Patients with Depression Using Free-text Clinical Documents , 2015, MedInfo.

[97]  Wen-wai Yim,et al.  Structuring Free-text Microbiology Culture Reports For Secondary Use , 2015, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[98]  Yorick Wilks,et al.  Information Extraction: Beyond Document Retrieval , 1998, Int. J. Comput. Linguistics Chin. Lang. Process..

[99]  Hong-Jun Yoon,et al.  Automated histologic grading from free-text pathology reports using graph-of-words features and machine learning , 2017, 2017 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI).

[100]  Dayne Freitag,et al.  Machine Learning for Information Extraction in Informal Domains , 2000, Machine Learning.

[101]  Tung Tran,et al.  Predicting mental conditions based on "history of present illness" in psychiatric notes with deep neural networks. , 2017, Journal of biomedical informatics.

[102]  Chao Zhao,et al.  WI-ENRE in CLEF eHealth Evaluation Lab 2015: Clinical Named Entity Recognition Based on CRF , 2015, CLEF.

[103]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .

[104]  Girija Chetty,et al.  A Multilevel NER Framework for Automatic Clinical Name Entity Recognition , 2017, 2017 IEEE International Conference on Data Mining Workshops (ICDMW).

[105]  Hongfang Liu,et al.  Journal of Biomedical Informatics , 2022 .

[106]  Carlos Guestrin,et al.  "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[107]  Thomas C. Rindflesch,et al.  EDGAR: extraction of drugs, genes and relations from the biomedical literature. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.