What's great and what's not: learning to classify the scope of negation for improved sentiment analysis

Automatic detection of linguistic negation in free text is a critical need for many text processing applications, including sentiment analysis. This paper presents a negation detection system based on a conditional random field modeled using features from an English dependency parser. The scope of negation detection is limited to explicit rather than implied negations within single sentences. A new negation corpus is presented that was constructed for the domain of English product reviews obtained from the open web, and the proposed negation extraction system is evaluated against the reviews corpus as well as the standard BioScope negation corpus, achieving 80.0% and 75.5% F1 scores, respectively. The impact of accurate negation detection on a state-of-the-art sentiment analysis system is also reported.

[1]  Philip J. Stone,et al.  Extracting Information. (Book Reviews: The General Inquirer. A Computer Approach to Content Analysis) , 1967 .

[2]  János Csirik,et al.  The BioScope corpus: biomedical texts annotated for uncertainty, negation and their scopes , 2008, BMC Bioinformatics.

[3]  Yuji Matsumoto MaltParser: A language-independent system for data-driven dependency parsing , 2005 .

[4]  Joakim Nivre,et al.  Deterministic Dependency Parsing of English Text , 2004, COLING.

[5]  Marshall S. Smith,et al.  The general inquirer: A computer approach to content analysis. , 1967 .

[6]  Claire Cardie,et al.  Structured Local Training and Biased Potential Functions for Conditional Random Fields with Application to Coreference Resolution , 2007, HLT-NAACL.

[7]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[8]  Karo Moilanen,et al.  Sentiment Composition , 2007 .

[9]  G. Tottie Negation in English speech and writing : a study in variation , 1993 .

[10]  Claire Cardie,et al.  Learning with Compositional Semantics as Structural Inference for Subsentential Sentiment Analysis , 2008, EMNLP.

[11]  Cristian Danescu-Niculescu-Mizil,et al.  Without a ’doubt’? Unsupervised Discovery of Downward-Entailing Operators , 2009, NAACL.

[12]  T. Givón,et al.  English grammar : a function-based introduction , 1995 .

[13]  Roser Morante,et al.  A Metalearning Approach to Processing the Scope of Negation , 2009, CoNLL.

[14]  Tejashri Inadarchand Jain,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2010 .

[15]  Svatava Spurná,et al.  Negation in English , 2008 .

[16]  Sasha Blair-Goldensohn,et al.  The viability of web-derived polarity lexicons , 2010, NAACL.

[17]  Henry Jackman,et al.  Readings in the Philosophy of Language , 1999 .

[18]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[19]  Kentaro Inui,et al.  Dependency Tree-based Sentiment Classification using CRFs with Hidden Variables , 2010, NAACL.

[20]  Mike Wells,et al.  Structured Models for Fine-to-Coarse Sentiment Analysis , 2007, ACL.