Detection of Product Comparisons - How Far Does an Out-of-the-Box Semantic Role Labeling System Take You?

This short paper presents a pilot study investigating the training of a standard Semantic Role Labeling (SRL) system on product reviews for the new task of detecting comparisons. An (opinionated) comparison consists of a comparative “predicate” and up to three “arguments”: the entity evaluated positively, the entity evaluated negatively, and the aspect under which the comparison is made. In user-generated product reviews, the “predicate” and “arguments” are expressed in highly heterogeneous ways; but since the elements are textually annotated in existing datasets, SRL is technically applicable. We address the interesting question how well training an outof-the-box SRL model works for English data. We observe that even without any feature engineering or other major adaptions to our task, the system outperforms a reasonable heuristic baseline in all steps (predicate identification, argument identification and argument classification) and in three different datasets.

[1]  Pierre Nugues,et al.  Multilingual Semantic Role Labeling , 2009, CoNLL Shared Task.

[2]  Bernd Bohnet,et al.  Very high accuracy and fast dependency parsing is not a contradiction , 2010, COLING 2010.

[3]  Youngjoong Ko,et al.  Finding relevant features for Korean comparative sentence extraction , 2011, Pattern Recognit. Lett..

[4]  Bing Liu,et al.  Mining Opinions in Comparative Sentences , 2008, COLING.

[5]  Bing Liu,et al.  Identifying comparative sentences in text documents , 2006, SIGIR.

[6]  Hinrich Schütze,et al.  Sentiment Relevance , 2013, ACL.

[7]  Janyce Wiebe,et al.  Annotating Opinions in the World Press , 2003, SIGDIAL Workshop.

[8]  Bing Liu,et al.  Mining Comparative Sentences and Relations , 2006, AAAI.

[9]  Friederike Moltmann,et al.  Coordination and comparatives , 1992 .

[10]  Petra Hendriks,et al.  Review of C. Kennedy, Projecting the adjective. The syntax and semantics of gradability and comparison , 1999 .

[11]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[12]  Youngjoong Ko,et al.  Extracting Comparative Sentences from Korean Text Documents Using Comparative Lexical Patterns and Machine Learning Techniques , 2009, ACL.

[13]  K. J. Evans,et al.  Computer Intensive Methods for Testing Hypotheses: An Introduction , 1990 .

[14]  Scott M. Smith,et al.  Computer Intensive Methods for Testing Hypotheses: An Introduction , 1989 .

[15]  Youngjoong Ko,et al.  Extracting Comparative Entities and Predicates from Texts Using Comparative Type Classification , 2011, ACL.

[16]  Guo-Hui Li,et al.  Mining Chinese comparative sentences by semantic role labeling , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[17]  Stephen Shaoyi Liao,et al.  Mining comparative opinions from customer reviews for Competitive Intelligence , 2011, Decis. Support Syst..

[18]  Miriam Eckert,et al.  The ICWSM 2010 JDPA Sentiment Corpus for the Automotive Domain , 2010 .