Speculation and Negation detection in French biomedical corpora

In this work, we propose to address the detection of negation and speculation, and of their scope, in French biomedical documents. It has been indeed observed that they play an important role and provide crucial clues for other NLP applications. Our methods are based on CRFs and BiLSTM. We reach up to 97.21 % and 91.30 % F-measure for the detection of negation and speculation cues, respectively , using CRFs. For the computing of scope, we reach up to 90.81 % and 86.73 % F-measure on negation and speculation , respectively, using BiLSTM-CRF fed with word embeddings.

[1]  Ronald M. Summers,et al.  NegBio: a high-performance tool for negation and uncertainty detection in radiology reports , 2017, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[2]  Petr Sojka,et al.  Software Framework for Topic Modelling with Large Corpora , 2010 .

[3]  Roser Morante,et al.  *SEM 2012 Shared Task: Resolving the Scope and Focus of Negation , 2012, *SEMEVAL.

[4]  Padmini Srinivasan,et al.  The Language of Bioscience: Facts, Speculations, and Statements In Between , 2004, HLT-NAACL 2004.

[5]  Stephan Oepen,et al.  Speculation and Negation: Rules, Rankers, and the Role of Syntax , 2012, CL.

[6]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[7]  Emily M. Bender,et al.  Simple Negation Scope Resolution through Deep Parsing: A Semantic Solution to a Semantic Problem , 2014, ACL.

[8]  Shuying Shen,et al.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text , 2011, J. Am. Medical Informatics Assoc..

[9]  Wendy W. Chapman,et al.  A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries , 2001, J. Biomed. Informatics.

[10]  Joshua C. Denny,et al.  Identifying QT prolongation from ECG impressions using Natural Language Processing and Negation Detection , 2007, MedInfo.

[11]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[12]  Stephan Oepen,et al.  UiO1: Constituent-Based Discriminative Ranking for Negation Resolution , 2012, *SEMEVAL.

[13]  Katharina Kaiser,et al.  Syntactical Negation Detection in Clinical Practice Guidelines , 2008, MIE.

[14]  Peter L. Elkin,et al.  A controlled trial of automated classification of negation from clinical notes , 2005, BMC Medical Informatics Decis. Mak..

[15]  János Csirik,et al.  The BioScope corpus: biomedical texts annotated for uncertainty, negation and their scopes , 2008, BMC Bioinformatics.

[16]  Andon Tchechmedjiev,et al.  French ConText: Détecter la négation, la temporalité et le sujet dans les textes cliniques Français , 2017 .

[17]  Wei Luo,et al.  Speculation and Negation Scope Detection via Convolutional Neural Networks , 2016, EMNLP.

[18]  Stephan Oepen,et al.  Syntactic Scope Resolution in Uncertainty Analysis , 2010, COLING.

[19]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[20]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[21]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[22]  Rodney D. Nielsen,et al.  Towards comprehensive syntactic and semantic annotations of the clinical narrative , 2013, J. Am. Medical Informatics Assoc..

[23]  Wendy W. Chapman,et al.  ConText: An algorithm for determining negation, experiencer, and temporal status from clinical reports , 2009, J. Biomed. Informatics.

[24]  Xiaolong Wang,et al.  A Cascade Method for Detecting Hedges and their Scope in Natural Language Text , 2010, CoNLL Shared Task.

[25]  Noriko Tomuro,et al.  Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes , 2011, ACL.

[26]  Bonnie L. Webber,et al.  Neural Networks For Negation Scope Detection , 2016, ACL.

[27]  Halil Kilicoglu,et al.  A High-Precision Approach to Detecting Hedges and their Scopes , 2010, CoNLL Shared Task.

[28]  Cyril Grouin,et al.  Detecting negation of medical problems in French clinical notes , 2012, IHI '12.

[29]  Pierre Zweigenbaum,et al.  The Quaero French Medical Corpus : A Ressource for Medical Entity Recognition and Normalization , 2014 .

[30]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[31]  Dragomir R. Radev,et al.  Detecting Speculations and their Scopes in Scientific Text , 2009, EMNLP.