论文信息 - ConSent: Context-based sentiment analysis

ConSent: Context-based sentiment analysis

Abstract We present ConSent, a novel context-based approach for the task of sentiment analysis. Our approach builds on techniques from the field of information retrieval to identify key terms indicative of the existence of sentiment. We model these terms and the contexts in which they appear and use them to generate features for supervised learning. The two major strengths of the proposed model are its robustness against noise and the easy addition of features from multiple sources to the feature set. Empirical evaluation over multiple real-world domains demonstrates the merit of our approach, compared to state-of the art methods both in noiseless and noisy text.

[1] Bo Pang,et al. Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[2] Janyce Wiebe,et al. Articles: Recognizing Contextual Polarity: An Exploration of Features for Phrase-Level Sentiment Analysis , 2009, CL.

[3] Stefan Evert,et al. SentiKLUE: Updating a Polarity Classifier in 48 Hours , 2014, *SEMEVAL.

[4] Hironori Takeuchi,et al. Mining of Business-Oriented Conversations at a Call Center , 2008 .

[5] ChengXiang Zhai,et al. Positional language models for information retrieval , 2009, SIGIR.

[6] George Hripcsak,et al. Technical Brief: Agreement, the F-Measure, and Reliability in Information Retrieval , 2005, J. Am. Medical Informatics Assoc..

[7] Juan José Rodríguez Diez,et al. Rotation Forest: A New Classifier Ensemble Method , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8] Saif Mohammad,et al. NRC-Canada: Building the State-of-the-Art in Sentiment Analysis of Tweets , 2013, *SEMEVAL.

[9] Xiaojun Wan,et al. Co-Training for Cross-Lingual Sentiment Classification , 2009, ACL.

[10] Djoerd Hiemstra,et al. Term-specific smoothing for the language modeling approach to information retrieval: the importance of a query term , 2002, SIGIR '02.

[11] Jaime G. Carbonell,et al. Document Representation and Query Expansion Models for Blog Recommendation , 2008, ICWSM.

[12] David D. Lewis,et al. Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval , 1998, ECML.

[13] Justin Zobel,et al. Methods for Identifying Versioned and Plagiarized Documents , 2003, J. Assoc. Inf. Sci. Technol..

[14] Daniel Shawcross Wilkerson,et al. Winnowing: local algorithms for document fingerprinting , 2003, SIGMOD '03.

[15] Haoqi Zhang,et al. An Iterative Dual Pathway Structure for Speech-to-Text Transcription , 2011, Human Computation.

[16] R. Hallowell. The relationships of customer satisfaction, customer loyalty, and profitability: an empirical study , 1996 .

[17] Patrick Paroubek,et al. Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[18] Lillian Lee,et al. Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[19] Diana Maynard,et al. Automatic Detection of Political Opinions in Tweets , 2011, #MSM.

[20] Satoshi Morinaga,et al. Mining product reputations on the Web , 2002, KDD.

[21] John Carroll,et al. Weakly supervised techniques for domain-independent sentiment classification , 2009, TSA@CIKM.

[22] Ioannis Pitas,et al. Automatic emotional speech classification , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[23] Peter D. Turney. Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[24] Nicolas Le Roux,et al. The Curse of Highly Variable Functions for Local Kernel Machines , 2005, NIPS.

[25] Lior Rokach,et al. Wikipedia-based query performance prediction , 2014, SIGIR.

[26] Yoshua Bengio,et al. The Curse of Dimensionality for Local Kernel Machines , 2005 .

[27] ChengXiang Zhai,et al. Statistical Language Models for Information Retrieval , 2008, NAACL.

[28] Lior Rokach,et al. Methodology for Connecting Nouns to Their Modifying Adjectives , 2014, CICLing.

[29] Diego Reforgiato Recupero,et al. Sentiment Analysis: Adjectives and Adverbs are Better than Adjectives Alone , 2007, ICWSM.

[30] Thomas Oommen,et al. Sampling Bias and Class Imbalance in Maximum-likelihood Logistic Regression , 2011 .

[31] Youngja Park,et al. Towards real-time measurement of customer satisfaction using automatically generated call transcripts , 2009, CIKM.

[32] Maite Taboada,et al. Methods for Creating Semantic Orientation Dictionaries , 2006, LREC.

[33] Ian H. Witten,et al. The WEKA data mining software: an update , 2009, SKDD.

[34] Frederick F. Reichheld,et al. Loyalty Rules: How Today's Leaders Build Lasting Relationships , 2001 .