A Semantic Approach to Negation Detection and Word Disambiguation with Natural Language Processing

This study aims to demonstrate the methods for detecting negations in a sentence by uniquely evaluating the lexical structure of the text via word-sense disambiguation. The proposed framework examines all the unique features in the various expressions within a text to resolve the contextual usage of all tokens and decipher the effect of negation on sentiment analysis. The application of popular expression detectors skips this important step, thereby neglecting the root words caught in the web of negation and making text classification difficult for machine learning and sentiment analysis. This study adopts the Natural Language Processing (NLP) approach to discover and antonimize words that were negated for better accuracy in text classification using a knowledge base provided by an NLP library called WordHoard. Early results show that our initial analysis improved on traditional sentiment analysis, which sometimes neglects negations or assigns an inverse polarity score. The SentiWordNet analyzer was improved by 35%, the Vader analyzer by 20% and the TextBlob by 6%.

[1]  Anna Rumshisky,et al.  BERT Busters: Outlier Dimensions that Disrupt Transformers , 2021, FINDINGS.

[2]  Anil Bas,et al.  Exploring Transformers in Natural Language Generation: GPT, BERT, and XLNet , 2021, ArXiv.

[3]  Suraj Sawant,et al.  NegBERT: A Transfer Learning Approach for Negation Detection and Scope Resolution , 2019, LREC.

[4]  Okpala Izunna Udebuana,et al.  Analysis of Evaluated Sentiments; a Pseudo-Linguistic Approach and Online Acceptability Index for Decision-Making with Data: Nigerian Election in View , 2019 .

[5]  Allyson Ettinger,et al.  What BERT Is Not: Lessons from a New Suite of Psycholinguistic Diagnostics for Language Models , 2019, TACL.

[6]  Karl Stratos,et al.  Label-Agnostic Sequence Labeling by Copying Nearest Neighbors , 2019, ACL.

[7]  Qing-Xing Qu,et al.  A proposal for Kansei knowledge extraction method based on natural language processing technology and online product reviews , 2019, Comput. Ind..

[8]  Nikita,et al.  Fake News Detection Using Machine Learning approaches: A systematic Review , 2019, 2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI).

[9]  J. Butt,et al.  Negation , 2018, A New Reference Grammar of Modern Spanish.

[10]  Luke S. Zettlemoyer,et al.  AllenNLP: A Deep Semantic Natural Language Processing Platform , 2018, ArXiv.

[11]  Xuejie Zhang,et al.  Refining Word Embeddings Using Intensity Scores for Sentiment Analysis , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[12]  Satish R. Devane,et al.  Multilingual machine translation : An analytical study , 2017, 2017 International Conference on Intelligent Computing and Control Systems (ICICCS).

[13]  Matthew R. Hallowell,et al.  Automated content analysis for construction safety: A natural language processing system to extract precursors and outcomes from unstructured injury reports , 2016 .

[14]  Yaxin Bi,et al.  Improved lexicon-based sentiment analysis for social media analytics , 2015, Security Informatics.

[15]  W. T. Coombs,et al.  The value of communication during a crisis: Insights from strategic communication research , 2015 .

[16]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.

[17]  Leysia Palen,et al.  Natural Language Processing to the Rescue? Extracting "Situational Awareness" Tweets During Mass Emergency , 2011, ICWSM.

[18]  Hong Yu,et al.  Biomedical negation scope detection with conditional random fields , 2010, J. Am. Medical Informatics Assoc..

[19]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[20]  Lior Rokach,et al.  Negation recognition in medical narrative reports , 2008, Information Retrieval.

[21]  Dominic Widdows,et al.  Orthogonal Negation in Vector Spaces for Modelling Word-Meanings and Document Retrieval , 2003, ACL.

[22]  Prakash M. Nadkarni,et al.  Research Paper: Use of General-purpose Negation Detection to Augment Concept Indexing of Medical Documents: A Quantitative Study Using the UMLS , 2001, J. Am. Medical Informatics Assoc..

[23]  Wendy W. Chapman,et al.  A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries , 2001, J. Biomed. Informatics.

[24]  Chunyu Kit,et al.  Tokenization as the Initial Phase in NLP , 1992, COLING.

[25]  Jessica Kropczynski,et al.  Perception Analysis: Pro- and Anti- Vaccine Classification with NLP and Machine Learning , 2022, HICSS.

[26]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[27]  Jianqiang Li,et al.  Lexicon-Enhanced LSTM With Attention for General Sentiment Analysis , 2018, IEEE Access.

[28]  A. M. Abirami,et al.  A survey on sentiment analysis methods and approach , 2017, 2016 Eighth International Conference on Advanced Computing (ICoAC).

[29]  Andreas Müller,et al.  Introduction to Machine Learning with Python: A Guide for Data Scientists , 2016 .

[30]  Vanesa Del Río Zamora Comparative Study of the Use of Double Negatives by Native English Speakers and Spanish Learners of English , 2015 .

[31]  Shen Jiaxua Division of negatives and noun/verb division in English and Chinese , 2010 .

[32]  D. Kandzari,et al.  Double negatives. , 2003, American heart journal.

[33]  Mirella Lapata,et al.  Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics , 2003, ACL 2003.