Joint and Pipeline Probabilistic Models for Fine-Grained Sentiment Analysis: Extracting Aspects, Subjective Phrases and their Relations

Sentiment analysis and opinion mining are often addressed as a text classification or entity recognition problem, involving the detection or classification of aspects and subjective phrases. Many approaches do not model the relation between aspects and subjective phrases explicitly, implicitly assuming that a subjective phrase refers to a certain aspect if they co-occur together in the same sentence, thus potentially sacrificing accuracy. Instead, in the approach presented in this paper, we model the relation between aspects and subjective phrases explicitly, exploiting a flexible model based on imperatively defined factor graphs (IDF). The extraction of subjective phrases, aspects and the relation between them is modeled as a joint inference problem and compared to a pure pipeline architecture. Our goal is to analyse and quantify to what extent a joint model outperforms a pipeline model in terms of extraction of aspects, subjective phrases and the relation between them. Our results show that, while we have a substantial improvement on predicting targets using a joint inference model, the performance on subjective phrase detection and relation extraction actually decreases only slightly.

[1]  Philipp Cimiano,et al.  Bi-directional Inter-dependencies of Subjective Expressions and Targets and their Value for a Joint Model , 2013, ACL.

[2]  Iryna Gurevych,et al.  Extracting Opinion Targets in a Single and Cross-Domain Setting with Conditional Random Fields , 2010, EMNLP.

[3]  Oscar Täckström,et al.  Semi-supervised latent variable models for sentence-level sentiment analysis , 2011, ACL.

[4]  Iryna Gurevych,et al.  Using Anaphora Resolution to Improve Opinion Target Identification in Movie Reviews , 2010, ACL.

[5]  Andrew McCallum,et al.  SampleRank: Training Factor Graphs with Atomic Gradients , 2011, ICML.

[6]  A. Valencia,et al.  Overview of the protein-protein interaction annotation extraction task of BioCreative II , 2008, Genome Biology.

[7]  Claire Cardie,et al.  Extracting Opinion Expressions with semi-Markov Conditional Random Fields , 2012, EMNLP.

[8]  Sung-Hyon Myaeng,et al.  Detecting Opinions and their Opinion Targets in NTCIR-8 , 2010, NTCIR.

[9]  Andrew McCallum,et al.  Tractable Learning and Inference with High-Order Representations , 2006 .

[10]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[11]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[12]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[13]  Andrew McCallum,et al.  FACTORIE: Probabilistic Programming via Imperatively Defined Factor Graphs , 2009, NIPS.

[14]  Richard Johansson,et al.  Extracting Opinion Expressions and Their Polarities - Exploration of Pipelines and Joint Models , 2011, ACL.

[15]  Andrew McCallum,et al.  FACTORIE: Efficient Probabilistic Programming for Relational Factor Graphs via Imperative Declarations of Structure, Inference and Learning , 2008 .

[16]  Claire Cardie,et al.  Joint Inference for Fine-grained Opinion Extraction , 2013, ACL.

[17]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[18]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[19]  William W. Cohen,et al.  Semi-Markov Conditional Random Fields for Information Extraction , 2004, NIPS.

[20]  Stuart J. Russell,et al.  BLOG: Relational Modeling with Unknown Objects , 2004 .

[21]  Xuanjing Huang,et al.  Opinion Mining with Sentiment Graph , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[22]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[23]  M. de Rijke,et al.  A Corpus for Entity Profiling in Microblog Posts , 2012 .

[24]  Brendan J. Frey,et al.  Factor graphs and the sum-product algorithm , 2001, IEEE Trans. Inf. Theory.

[25]  Jordan L. Boyd-Graber,et al.  Grammatical structures for word-level sentiment detection , 2012, NAACL.

[26]  X. Jin Factor graphs and the Sum-Product Algorithm , 2002 .

[27]  Dan Klein,et al.  Fast Exact Inference with a Factored Model for Natural Language Parsing , 2002, NIPS.

[28]  Xiaoyan Zhu,et al.  Sentiment Analysis with Global Topics and Local Dependency , 2010, AAAI.

[29]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[30]  S. Chib,et al.  Understanding the Metropolis-Hastings Algorithm , 1995 .