Open Domain Suggestion Mining: Problem Definition and Datasets

We propose a formal definition for the task of suggestion mining in the context of a wide range of open domain applications. Human perception of the term \emph{suggestion} is subjective and this effects the preparation of hand labeled datasets for the task of suggestion mining. Existing work either lacks a formal problem definition and annotation procedure, or provides domain and application specific definitions. Moreover, many previously used manually labeled datasets remain proprietary. We first present an annotation study, and based on our observations propose a formal task definition and annotation procedure for creating benchmark datasets for suggestion mining. With this study, we also provide publicly available labeled datasets for suggestion mining in multiple domains.

[1]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[2]  Sung-Hyon Myaeng,et al.  Automatic extraction of advice-revealing sentences foradvice mining from online forums , 2013, K-CAP.

[3]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[4]  Paul Buitelaar,et al.  Towards the Extraction of Customer-to-Customer Suggestions from Reviews , 2015, EMNLP.

[5]  Sung-Hyon Myaeng,et al.  Mining advices from weblogs , 2012, CIKM.

[6]  Niranjan Pedanekar,et al.  Wishful Thinking - Finding suggestions and ’buy’ wishes from product reviews , 2010, HLT-NAACL 2010.

[7]  Alicia Martínez Flor,et al.  A theoretical review of the speech act of suggesting: towards a taxonomy for its use in FLT , 2005 .

[8]  J. Searle A classification of illocutionary acts , 1976, Language in Society.

[9]  Caroline Brun,et al.  Suggestion Mining: Detecting Suggestions for Improvement in Users' Comments , 2013, Res. Comput. Sci..

[10]  Samaneh Moghaddam,et al.  Beyond Sentiment Analysis: Mining Defects and Improvements from Customer Feedback , 2015, ECIR.

[11]  David Crystal,et al.  A dictionary of linguistics and phonetics , 1997 .

[12]  Yajuan Duan,et al.  The Automated Acquisition of Suggestions from Tweets , 2013, AAAI.

[13]  Lokendra Shastri,et al.  Suggestion Mining from Customer Reviews , 2011, AMCIS.

[14]  Nicholas Asher,et al.  Appraisal of Opinion Expressions in Discourse , 2009 .

[15]  Valentin Jijkoun,et al.  Mining User Experiences from Online Forums: An Exploration , 2010, HLT-NAACL 2010.

[16]  Benno Stein,et al.  A Review Corpus for Argumentation Analysis , 2014, CICLing.

[17]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[18]  V. H. Dudman Indicative and subjunctive , 1988 .

[19]  Xiaowei Guan A Study on the Formalization of English Subjunctive Mood , 2012 .

[20]  Sung-Hyon Myaeng,et al.  Toward advice mining: conditional random fields for extracting advice-revealing text units , 2013, CIKM.

[21]  Julio Gonzalo,et al.  Overview of RepLab 2014: Author Profiling and Reputation Dimensions for Online Reputation Management , 2014, CLEF.

[22]  Alicia Martínez-Flor A Theoretical Review of the Speech Act of Suggesting: Towards a Taxonomy for its Use in FLT1 , 2005 .

[23]  Roser Morante,et al.  Modality and Negation: An Introduction to the Special Issue , 2012, CL.