The Impact of Pretrained Language Models on Negation and Speculation Detection in Cross-Lingual Medical Text: Comparative Study

Background Negation and speculation are critical elements in natural language processing (NLP)-related tasks, such as information extraction, as these phenomena change the truth value of a proposition. In the clinical narrative that is informal, these linguistic facts are used extensively with the objective of indicating hypotheses, impressions, or negative findings. Previous state-of-the-art approaches addressed negation and speculation detection tasks using rule-based methods, but in the last few years, models based on machine learning and deep learning exploiting morphological, syntactic, and semantic features represented as spare and dense vectors have emerged. However, although such methods of named entity recognition (NER) employ a broad set of features, they are limited to existing pretrained models for a specific domain or language. Objective As a fundamental subsystem of any information extraction pipeline, a system for cross-lingual and domain-independent negation and speculation detection was introduced with special focus on the biomedical scientific literature and clinical narrative. In this work, detection of negation and speculation was considered as a sequence-labeling task where cues and the scopes of both phenomena are recognized as a sequence of nested labels recognized in a single step. Methods We proposed the following two approaches for negation and speculation detection: (1) bidirectional long short-term memory (Bi-LSTM) and conditional random field using character, word, and sense embeddings to deal with the extraction of semantic, syntactic, and contextual patterns and (2) bidirectional encoder representations for transformers (BERT) with fine tuning for NER. Results The approach was evaluated for English and Spanish languages on biomedical and review text, particularly with the BioScope corpus, IULA corpus, and SFU Spanish Review corpus, with F-measures of 86.6%, 85.0%, and 88.1%, respectively, for NeuroNER and 86.4%, 80.8%, and 91.7%, respectively, for BERT. Conclusions These results show that these architectures perform considerably better than the previous rule-based and conventional machine learning–based systems. Moreover, our analysis results show that pretrained word embedding and particularly contextualized embedding for biomedical corpora help to understand complexities inherent to biomedical text.

[1]  Horacio Rodríguez,et al.  Syntactic methods for negation detection in radiology reports in Spanish , 2016, BioNLP@ACL.

[2]  Ted Briscoe,et al.  Weakly Supervised Learning for Hedge Classification in Scientific Literature , 2007, ACL.

[3]  Xiaolong Wang,et al.  A Cascade Method for Detecting Hedges and their Scope in Natural Language Text , 2010, CoNLL Shared Task.

[4]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[5]  Bonnie L. Webber,et al.  Detecting negation scope is easy, except when it isn’t , 2017, EACL.

[6]  Prakash M. Nadkarni,et al.  Research Paper: Use of General-purpose Negation Detection to Augment Concept Indexing of Medical Documents: A Quantitative Study Using the UMLS , 2001, J. Am. Medical Informatics Assoc..

[7]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[8]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[9]  Grace I. Paterson,et al.  Systematized nomenclature of medicine clinical terms (SNOMED CT) to represent computed tomography procedures , 2011, Comput. Methods Programs Biomed..

[10]  Lluís Padró,et al.  Negation cues detection using CRF on Spanish product review texts , 2018 .

[11]  Montserrat Marimon,et al.  Annotation of negation in the IULA Spanish Clinical Record Corpus , 2017 .

[12]  Nigel Collier,et al.  The GENIA project: corpus-based knowledge acquisition and information extraction from genome research papers , 1999, EACL.

[13]  Koldo Gojenola,et al.  Learning to extract adverse drug reaction events from electronic health records in Spanish , 2016, Expert Syst. Appl..

[14]  Wang Ling,et al.  Two/Too Simple Adaptations of Word2Vec for Syntax Problems , 2015, NAACL.

[15]  Xiaoyan Li,et al.  Exploiting Multi-Features to Detect Hedges and their Scope in Biomedical Texts , 2010, CoNLL Shared Task.

[16]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[17]  Guodong Zhou,et al.  Tree Kernel-based Negation and Speculation Scope Detection with Structured Syntactic Parse Features , 2013, EMNLP.

[18]  Carol Friedman,et al.  Research Paper: A General Natural-language Text Processor for Clinical Radiology , 1994, J. Am. Medical Informatics Assoc..

[19]  Xuanjing Huang,et al.  Detecting Hedge Cues and their Scopes with Average Perceptron , 2010, CoNLL Shared Task.

[20]  Roser Morante,et al.  A review of Spanish corpora annotated with negation , 2018, COLING.

[21]  Lilja Øvrelid,et al.  Representing and Resolving Negation for Sentiment Analysis , 2012, 2012 IEEE 12th International Conference on Data Mining Workshops.

[22]  Gary D. Bader,et al.  Transfer learning for biomedical named entity recognition with neural networks , 2018, bioRxiv.

[23]  Y. Kato A natural history of negation. By LAURENCE R. HORN. Chicago: The University of Chicago Press, 1989. Pp. xxii, 637 , 1991 .

[24]  Cyril Grouin,et al.  Detecting negation of medical problems in French clinical notes , 2012, IHI '12.

[25]  Ting Liu,et al.  Negation Scope Detection with Recurrent Neural Networks Models in Review Texts , 2016, ICYCSEE.

[26]  Luis Alfonso Ureña López,et al.  SFU ReviewSP-NEG: a Spanish corpus annotated with negation for sentiment analysis. A typology of negation patterns , 2018, Lang. Resour. Evaluation.

[27]  Roser Morante,et al.  Memory-Based Resolution of In-Sentence Scopes of Hedge Cues , 2010, CoNLL Shared Task.

[28]  Xuan Wang,et al.  Exploiting Rich Features for Detecting Hedges and their Scope , 2010, CoNLL Shared Task.

[29]  Alexis Kalokerinos A natural history of negation , 1991 .

[30]  Maria Georgescul,et al.  A Hedgehop over a Max-Margin Framework Using Hedge Cues , 2010, CoNLL Shared Task.

[31]  Leonardo Campillos Llanos,et al.  A preliminary analysis of negation in a Spanish clinical records dataset∗ Análisis preliminar de la negación en un conjunto de informes cĺınicos en español , 2017 .

[32]  Franck Dernoncourt,et al.  Transfer Learning for Named-Entity Recognition with Neural Networks , 2017, LREC.

[33]  L. Donatelli Cues , Scope , and Focus : Annotating Negation in Spanish Corpora , 2018 .

[34]  Wendy W. Chapman,et al.  A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries , 2001, J. Biomed. Informatics.

[35]  Roser Morante,et al.  *SEM 2012 Shared Task: Resolving the Scope and Focus of Negation , 2012, *SEMEVAL.

[36]  Rui Yan,et al.  How Transferable are Neural Networks in NLP Applications? , 2016, EMNLP.

[37]  Stephan Oepen,et al.  Resolving Speculation: MaxEnt Cue Classification and Dependency-Based Scope Rules , 2010, CoNLL Shared Task.

[38]  Padmini Srinivasan,et al.  The Language of Bioscience: Facts, Speculations, and Statements In Between , 2004, HLT-NAACL 2004.

[39]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[40]  Sampo Pyysalo,et al.  brat: a Web-based Tool for NLP-Assisted Text Annotation , 2012, EACL.

[41]  Dong Wen,et al.  Detecting negation and scope in Chinese clinical notes using character and word embedding , 2017, Comput. Methods Programs Biomed..

[42]  Gary D. Bader,et al.  Transfer learning for biomedical named entity recognition with neural networks , 2018 .

[43]  Wendy W. Chapman,et al.  ConText: An Algorithm for Identifying Contextual Features from Clinical Text , 2007, BioNLP@ACL.

[44]  Mariona Taulé,et al.  AnCora: Multilevel Annotated Corpora for Catalan and Spanish , 2008, LREC.

[45]  Barbara Di Eugenio,et al.  A Lucene and Maximum Entropy Model Based Hedge Detection System , 2010, CoNLL Shared Task.

[46]  Franck Dernoncourt,et al.  NeuroNER: an easy-to-use program for named-entity recognition based on neural networks , 2017, EMNLP.

[47]  Veronika Vincze CoNLL 2010 Shared Task Proposal: Learning to detect hedges and their scope in natural language texts , 2010 .

[48]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[49]  Timothy D. Imler,et al.  Clinical decision support with natural language processing facilitates determination of colonoscopy surveillance intervals. , 2014, Clinical gastroenterology and hepatology : the official clinical practice journal of the American Gastroenterological Association.

[50]  Juan Martínez-Romo,et al.  Extending a Deep Learning Approach for Negation Cues Detection in Spanish , 2019, IberLEF@SEPLN.

[51]  M. Goetz,et al.  Clinical performance pilot using cognitive computing for clinical trial matching at Mayo Clinic. , 2018 .

[52]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[53]  Hongfang Liu,et al.  DEEPEN: A negation detection system for clinical text incorporating dependency relation into NegEx , 2015, J. Biomed. Informatics.

[54]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[55]  Ernestina Menasalvas Ruiz,et al.  An Approach to Detect Negation on Medical Documents in Spanish , 2014, Brain Informatics and Health.

[56]  Richard Socher,et al.  Learned in Translation: Contextualized Word Vectors , 2017, NIPS.

[57]  John Liu,et al.  sense2vec - A Fast and Accurate Method for Word Sense Disambiguation In Neural Word Embeddings , 2015, ArXiv.

[58]  Maite Taboada,et al.  A review corpus annotated for negation, speculation and their scope , 2012, LREC.

[59]  Ralph Grishman,et al.  Exploiting Diverse Knowledge Sources via Maximum Entropy in Named Entity Recognition , 1998, VLC@COLING/ACL.

[60]  Wei Luo,et al.  Speculation and Negation Scope Detection via Convolutional Neural Networks , 2016, EMNLP.

[61]  Sophia Ananiadou,et al.  Annotation and detection of drug effects in text for pharmacovigilance , 2018, Journal of Cheminformatics.

[62]  Mike Conway,et al.  Extending the NegEx Lexicon for Multiple Languages , 2013, MedInfo.

[63]  Maria Skeppstedt,et al.  Negation detection in Swedish clinical text: An adaption of NegEx to Swedish , 2011, J. Biomed. Semant..

[64]  Son Doan,et al.  Using Hedges to Enhance a Disease Outbreak Report Text Mining System , 2009, BioNLP@HLT-NAACL.

[65]  Hercules Dalianis,et al.  Clinical Text Mining , 2018, Springer International Publishing.

[66]  Nigam H Shah,et al.  Impact of Predicting Health Care Utilization Via Web Search Behavior: A Data-Driven Analysis , 2016, Journal of medical Internet research.

[67]  Tapio Salakoski,et al.  Distributional Semantics Resources for Biomedical Text Processing , 2013 .

[68]  Ted Briscoe,et al.  Combining Manual Rules and Supervised Learning for Hedge Cue and Scope Detection , 2010, CoNLL Shared Task.

[69]  János Csirik,et al.  The CoNLL-2010 Shared Task: Learning to Detect Hedges and their Scope in Natural Language Text , 2010, CoNLL Shared Task.

[70]  Alicia Pérez,et al.  Word embeddings for negation detection in health records written in Spanish , 2018, Soft Comput..

[71]  Weinan Zhang,et al.  A syntactic path-based hybrid neural network for negation scope detection , 2018, Frontiers of Computer Science.

[72]  Dragomir R. Radev,et al.  Detecting Speculations and their Scopes in Scientific Text , 2009, EMNLP.

[73]  W. Bruce Croft,et al.  Research Paper: Ad Hoc Classification of Radiology Reports , 1999, J. Am. Medical Informatics Assoc..

[74]  Manuel J. Maña López,et al.  A machine-learning approach to negation and speculation detection in clinical texts , 2012, J. Assoc. Inf. Sci. Technol..

[75]  Katharina Kaiser,et al.  Syntactical Negation Detection in Clinical Practice Guidelines , 2008, MIE.

[76]  János Csirik,et al.  The BioScope corpus: biomedical texts annotated for uncertainty, negation and their scopes , 2008, BMC Bioinformatics.

[77]  Thomas Fang Zheng,et al.  Transfer learning for speech and language processing , 2015, 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA).

[78]  Hong Yu,et al.  Biomedical negation scope detection with conditional random fields , 2010, J. Am. Medical Informatics Assoc..

[79]  James Paul White UWashington: Negation Resolution using Machine Learning Methods , 2012, *SEM@NAACL-HLT.

[80]  Kazuhiko Ohe,et al.  TEXT2TABLE: Medical Text Summarization System Based on Named Entity Recognition and Modality Identification , 2009, BioNLP@HLT-NAACL.

[81]  Roser Morante,et al.  Learning the Scope of Hedge Cues in Biomedical Texts , 2009, BioNLP@HLT-NAACL.