Coarse-and Fine-Grained Sentiment Analysis of Social Media Text

INTRODUCTION People make judgments about the world around them. They harbor positive and negative attitudes about people, organizations, places, events, and ideas. We regard these types of attitudes as sentiments. Sentiments are private states,1 cognitive phenomena that are not directly observable by others. However, expressions of sentiment can be manifested in actions, including written and spoken language. Sentiment analysis is the study of automated techniques for extracting sentiment from written language. This has been a very active area entiment analysis—the automated extraction of expressions of positive or negative attitudes from text—has received considerable attention from researchers during the past 10 years. During the same period, the widespread growth of social media has resulted in an explosion of publicly available, user-generated text on the World Wide Web. These data can potentially be utilized to provide real-time insights into the aggregated sentiments of people. The tools provided by statistical natural language processing and machine learning, along with exciting new scalable approaches to working with large volumes of text, make it possible to begin extracting sentiments from the web. We discuss some of the challenges of sentiment extraction and some of the approaches employed to address these challenges. In particular, we describe work we have done to annotate sentiment in blogs at the levels of sentences and subsentences (clauses); to classify subjectivity at the level of sentences; and to identify the targets, or topics, of sentiment at the level of clauses.

[1]  Vikas Sindhwani,et al.  Data Quality from Crowdsourcing: A Study of Annotation Selection Criteria , 2009, HLT-NAACL 2009.

[2]  Nicolas Nicolov,et al.  Targeting Sentiment Expressions through Supervised Ranking of Linguistic Configurations , 2009, ICWSM.

[3]  Eduard Hovy,et al.  Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text , 2006 .

[4]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[5]  Claire Cardie,et al.  Identifying Sources of Opinions with Conditional Random Fields and Extraction Patterns , 2005, HLT.

[6]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[7]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[8]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[9]  Beth Levin,et al.  English Verb Classes and Alternations: A Preliminary Investigation , 1993 .

[10]  Ellen Alers Answer me this , 2010 .

[11]  Theresa Wilson Fine-grained subjectivity and sentiment analysis: recognizing the intensity, polarity, and attitudes of private states , 2008 .

[12]  Vangelis Metsis,et al.  Spam Filtering with Naive Bayes - Which Naive Bayes? , 2006, CEAS.

[13]  Claire Cardie,et al.  Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.

[14]  Soo-Min Kim,et al.  Automatic Detection of Opinion Bearing Words and Sentences , 2005, IJCNLP.

[15]  Carolyn Wei Formation of Norms in a Blog Community , 2004 .

[16]  Christopher D. Manning,et al.  Fast Exact Inference with a Factored Model for Natural Language Parsing , 2002, NIPS 2002.

[17]  Klaus Krippendorff,et al.  Content Analysis: An Introduction to Its Methodology , 1980 .

[18]  R. Shah,et al.  Least Squares Support Vector Machines , 2022 .