Deidentification of free-text medical records using pre-trained bidirectional transformers
暂无分享,去创建一个
Alistair E. W. Johnson | Tom J. Pollard | Lucas Bulgarelli | A. Johnson | T. Pollard | Lucas Bulgarelli
[1] Tim Kraska,et al. Custodes: Auditable Hypothesis Testing , 2019, ArXiv.
[2] Peter Szolovits,et al. Automated de-identification of free-text medical records , 2008, BMC Medical Informatics Decis. Mak..
[3] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[4] Iz Beltagy,et al. SciBERT: A Pretrained Language Model for Scientific Text , 2019, EMNLP.
[5] Natalia Gimelshein,et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.
[6] Peter Szolovits,et al. MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.
[7] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[8] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.
[9] Lysandre Debut,et al. HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.
[10] Yiming Yang,et al. XLNet: Generalized Autoregressive Pretraining for Language Understanding , 2019, NeurIPS.
[11] Michele Filannino,et al. De-identification of psychiatric intake records: Overview of 2016 CEGS N-GRID shared tasks Track 1. , 2017, Journal of biomedical informatics.
[12] Michael Mayo,et al. A survey of automatic de-identification of longitudinal clinical narratives , 2018, ArXiv.
[13] David Sontag,et al. Why Is My Classifier Discriminatory? , 2018, NeurIPS.
[14] Elizabeth Ford,et al. For the greater good? Patient and public attitudes to use of medical free text data in research , 2017, International Journal of Population Data Science.
[15] Elizabeth Ford,et al. "Giving something back": A systematic review and ethical enquiry of public opinions on the use of patient data for research in the United Kingdom and the Republic of Ireland. , 2018, Wellcome open research.
[16] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.
[17] Jaewoo Kang,et al. BioBERT: a pre-trained biomedical language representation model for biomedical text mining , 2019, Bioinform..
[18] Yossi Matias,et al. Customization scenarios for de-identification of clinical notes , 2020, BMC Medical Informatics and Decision Making.
[19] Peter Szolovits,et al. Evaluating the state-of-the-art in automatic de-identification. , 2007, Journal of the American Medical Informatics Association : JAMIA.
[20] Özlem Uzuner,et al. Automated systems for the de-identification of longitudinal clinical narratives: Overview of 2014 i2b2/UTHealth shared task Track 1 , 2015, J. Biomed. Informatics.
[21] Alec Radford,et al. Improving Language Understanding by Generative Pre-Training , 2018 .
[22] R'emi Louf,et al. HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.
[23] Yaoyun Zhang,et al. A hybrid approach to automatic de-identification of psychiatric notes. , 2017, Journal of biomedical informatics.
[24] Lynette Hirschman,et al. The MITRE Identification Scrubber Toolkit: Design, training, and assessment , 2010, Int. J. Medical Informatics.
[25] Franck Dernoncourt,et al. NeuroNER: an easy-to-use program for named-entity recognition based on neural networks , 2017, EMNLP.
[26] Özlem Uzuner,et al. Annotating longitudinal clinical narratives for de-identification: The 2014 i2b2/UTHealth corpus , 2015, J. Biomed. Informatics.
[27] Peter Norvig,et al. The Unreasonable Effectiveness of Data , 2009, IEEE Intelligent Systems.
[28] Franck Dernoncourt,et al. De-identification of patient notes with recurrent neural networks , 2016, J. Am. Medical Informatics Assoc..
[29] Lynda L. McGhie,et al. THE HEALTH INSURANCE PORTABILITY AND ACCOUNTABILITY ACT , 2004 .
[30] Kevin Gimpel,et al. ALBERT: A Lite BERT for Self-supervised Learning of Language Representations , 2019, ICLR.
[31] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.
[32] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.
[33] R. Califf,et al. Health Insurance Portability and Accountability Act (HIPAA): must there be a trade-off between privacy and quality of health care, or can we advance both? , 2003, Circulation.
[34] Wei-Hung Weng,et al. Publicly Available Clinical BERT Embeddings , 2019, Proceedings of the 2nd Clinical Natural Language Processing Workshop.
[35] Liam Peyton,et al. A unified framework for evaluating the risk of re-identification of text de-identification tools , 2016, J. Biomed. Informatics.
[36] Sameer Singh,et al. Do NLP Models Know Numbers? Probing Numeracy in Embeddings , 2019, EMNLP.
[37] Xiaolong Wang,et al. De-identification of clinical notes via recurrent neural network and conditional random field. , 2017, Journal of biomedical informatics.
[38] Xiaolong Wang,et al. Automatic de-identification of electronic medical records using token-level and character-level conditional random fields , 2015, J. Biomed. Informatics.
[39] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.