A Comparison Among Significance Tests and Other Feature Building Methods for Sentiment Analysis: A First Study

Words that participate in the sentiment (positive or negative) classification decision are known as significant words for sentiment classification. Identification of such significant words as features from the corpus reduces the amount of irrelevant information in the feature set under supervised sentiment classification settings. In this paper, we conceptually study and compare various types of feature building methods, viz., unigrams, TFIDF, Relief, Delta-TFIDF, \(\chi ^2\) test and Welch’s t-test for sentiment analysis task. Unigrams and TFIDF are the classic ways of feature building from the corpus. Relief, Delta-TFIDF and \(\chi ^2\) test have recently attracted much attention for their potential use as feature building methods in sentiment analysis. On the contrary, t-test is the least explored for the identification of significant words from the corpus as features.

[1]  Timothy W. Finin,et al.  Delta TFIDF: An Improved Feature Space for Sentiment Analysis , 2009, ICWSM.

[2]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[3]  Malcolm Farrow,et al.  Use of the Chi-Squared Test to Examine Vocabulary Differences in English Language Corpora Representing Seven Different Countries , 2007, Lit. Linguistic Comput..

[4]  Hiroshi Kanayama,et al.  Fully Automatic Lexicon Expansion for Domain-oriented Sentiment Analysis , 2006, EMNLP.

[5]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[6]  Pushpak Bhattacharyya,et al.  Domain Sentiment Matters: A Two Stage Sentiment Analyzer , 2015, ICON.

[7]  Aoying Zhou,et al.  An information theoretic approach to sentiment polarity classification , 2012, WebQuality '12.

[8]  Abdelwadood Mesleh,et al.  Chi Square Feature Extraction Based Svms Arabic Language Text Categorization System , 2007 .

[9]  A. Kilgarriff Comparing Corpora , 2001 .

[10]  Ivor W. Tsang,et al.  Hybrid Heterogeneous Transfer Learning through Deep Learning , 2014, AAAI.

[11]  Magali Paquot,et al.  Distinctive words in academic writing: a comparison of three statistical tests for keyword extraction , 2009 .

[12]  Timothy W. Finin,et al.  Improving binary classification on text problems using differential word features , 2009, CIKM.

[13]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[14]  Luka Cehovin,et al.  Empirical evaluation of feature selection methods in classification , 2010, Intell. Data Anal..

[15]  Helene Fowkes,et al.  A method based on the chi-square test for document classification , 2001, SIGIR '01.

[16]  Pushpak Bhattacharyya,et al.  Lost in Translation: Viability of Machine Translation for Cross Language Sentiment Analysis , 2013, CICLing.

[17]  Anand Rajaraman,et al.  Mining of Massive Datasets , 2011 .

[18]  Yoshua Bengio,et al.  Domain Adaptation for Large-Scale Sentiment Classification: A Deep Learning Approach , 2011, ICML.

[19]  Abdulmohsen Al-Thubaity,et al.  Automatic Arabic Text Classification , 2008 .

[20]  Kam-Fai Wong,et al.  Interpreting TF-IDF term weights as making relevance decisions , 2008, TOIS.

[21]  Vincent Ng,et al.  Examining the Role of Linguistic Knowledge Sources in the Automatic Identification and Classification of Reviews , 2006, ACL.

[22]  Yücel Saygin,et al.  Learning Domain-Specific Polarity Lexicons , 2012, 2012 IEEE 12th International Conference on Data Mining Workshops.

[23]  Christopher Joseph Pal,et al.  Cross Lingual Adaptation: An Experiment on Sentiment Classifications , 2010, ACL.

[24]  Qiang Yang,et al.  Cross-domain sentiment classification via spectral feature alignment , 2010, WWW '10.

[25]  Philipp Koehn,et al.  Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[26]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[27]  Hiroshi Motoda,et al.  Computational Methods of Feature Selection , 2022 .

[28]  Paul Rayson,et al.  Comparing Corpora using Frequency Profiling , 2000, Proceedings of the workshop on Comparing corpora -.

[29]  Alex Cheng,et al.  A System for Multilingual Sentiment Learning On Large Data Sets , 2012, COLING.

[30]  Pushpak Bhattacharyya,et al.  Detecting Domain Dedicated Polar Words , 2013, IJCNLP.

[31]  Xin Jin,et al.  Machine Learning Techniques and Chi-Square Feature Selection for Cancer Classification Using SAGE Gene Expression Profiles , 2006, BioDM.

[32]  Din J. Wasem,et al.  Mining of Massive Datasets , 2014 .

[33]  John Blitzer,et al.  Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification , 2007, ACL.

[34]  Josef Steinberger,et al.  Sentiment Analysis in Czech Social Media Using Supervised Machine Learning , 2013, WASSA@NAACL-HLT.

[35]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[36]  Panagiotis Papapetrou,et al.  Significance testing of word frequencies in corpora , 2016, Digit. Scholarsh. Humanit..

[37]  Xiaojun Wan,et al.  Co-Training for Cross-Lingual Sentiment Classification , 2009, ACL.

[38]  Tony A. Meyer,et al.  SpamBayes: Effective open-source, Bayesian based, email classification system , 2004, CEAS.