Mining Fine-grained Opinion Expressions with Shallow Parsing

Opinion analysis deals with public opinions and trends, but subjective language is highly ambiguous. In this paper, we follow a simple data-driven technique to learn fine-grained opinions. We select an intersection set of Wall Street Journal documents that is included both in the Penn Discourse Tree Bank (PDTB) and in the Multi-Perspective Question Answering (MPQA) corpus. This is done in order to explore the usefulness of discourselevel structure to facilitate the extraction of fine-grained opinion expressions. Here we perform shallow parsing of MPQA expressions with connective based discourse structure, and then also with Named Entities (NE) and some syntax features using conditional random fields; the latter feature set is basically a collection of NEs and a bundle of features that is proved to be useful in a shallow discourse parsing task. We found that both of the feature-sets are useful to improve our baseline at different levels of this fine-grained opinion expression mining task.

[1]  Fernando Pereira,et al.  Shallow Parsing with Conditional Random Fields , 2003, NAACL.

[2]  Claire Cardie,et al.  Identifying Sources of Opinions with Conditional Random Fields and Extraction Patterns , 2005, HLT.

[3]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[4]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[5]  Swapna Somasundaran,et al.  Recognizing Stances in Online Debates , 2009, ACL.

[6]  John A. Carroll,et al.  Applied morphological processing of English , 2001, Natural Language Engineering.

[7]  Heiner Stuckenschmidt,et al.  Fine-Grained Sentiment Analysis with Structural Features , 2011, IJCNLP.

[8]  Claire Cardie,et al.  Topic Identification for Fine-Grained Opinion Analysis , 2008, COLING.

[9]  Nicholas Asher,et al.  Appraisal of Opinion Expressions in Discourse , 2009 .

[10]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[11]  Annie Zaenen,et al.  Contextual Valence Shifters , 2006, Computing Attitude and Affect in Text.

[12]  Erik F. Tjong Kim Sang,et al.  Representing Text Chunks , 1999, EACL.

[13]  Kathleen R. McKeown,et al.  Predicting the semantic orientation of adjectives , 1997 .

[14]  Oscar Täckström,et al.  Semi-supervised latent variable models for sentence-level sentiment analysis , 2011, ACL.

[15]  Claire Cardie,et al.  Identifying Expressions of Opinion in Context , 2007, IJCAI.

[16]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[17]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[18]  Uzay Kaymak,et al.  Polarity analysis of texts using discourse structure , 2011, CIKM '11.

[19]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[20]  Richard Johansson,et al.  Shallow Discourse Parsing with Conditional Random Fields , 2011, IJCNLP.

[21]  Claire Cardie,et al.  Hierarchical Sequential Learning for Extracting Opinions and Their Attributes , 2010, ACL.

[22]  Claire Cardie,et al.  Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.

[23]  Soo-Min Kim,et al.  Determining the Sentiment of Opinions , 2004, COLING.

[24]  Rich Caruana,et al.  Greedy Attribute Selection , 1994, ICML.

[25]  Sucheta Ghosh End-to-End Discourse Parsing with Cascaded Structured Prediction , 2012 .

[26]  Pat Langley,et al.  Editorial: On Machine Learning , 1986, Machine Learning.

[27]  Livio Robaldo,et al.  The Penn Discourse TreeBank 2.0. , 2008, LREC.

[28]  Soo-Min Kim,et al.  Automatic Identification of Pro and Con Reasons in Online Reviews , 2006, ACL.

[29]  Richard Johansson,et al.  Relational Features in Fine-Grained Opinion Analysis , 2013, CL.