Extracting opinionated (sub)features from a stream of product reviews using accumulated novelty and internal re-organization

Opinion stream mining extends conventional opinion mining by monitoring a stream of reviews and detecting changes in the attitude of people toward products. However, next to the opinions of people on concrete products, product features-on which people also bestow their opinions-are equally important: such features appear on all products of a given brand and can deliver clues to product vendors on what improvements should be done in the next version of a product. In this study, we propose an opinion stream mining framework that discovers implicit product features and assesses their polarity, while it also monitors features and their polarity as the stream evolves. An earlier version of this framework has been presented in Zimmermann et?al. (2013). The extended framework encompasses an additional mechanism that merges clusters representing similar product features. We report on extensive experiments for both the original framework and the extended one, using two opinionated streams.

[1]  Geoff Holmes,et al.  MOA-TweetReader: Real-Time Analysis in Twitter Streaming Data , 2011, Discovery Science.

[2]  Freddy Y. Y. Choi Advances in domain independent linear text segmentation , 2000, ANLP.

[3]  Xu Ling,et al.  Topic sentiment mixture: modeling facets and opinions in weblogs , 2007, WWW '07.

[4]  Sasha Blair-Goldensohn,et al.  Building a Sentiment Summarizer for Local Service Reviews , 2008 .

[5]  Hans-Peter Kriegel,et al.  Density-based Projected Clustering over High Dimensional Data Streams , 2012, SDM.

[6]  Martin Ester,et al.  Opinion digger: an unsupervised opinion miner from unstructured product reviews , 2010, CIKM.

[7]  Myra Spiliopoulou,et al.  Extracting Opinionated (Sub)Features from a Stream of Product Reviews , 2013, Discovery Science.

[8]  Myra Spiliopoulou,et al.  Adaptive semi supervised opinion classifier with forgetting mechanism , 2014, SAC.

[9]  Wagner Meira,et al.  Effective sentiment stream analysis with self-augmenting training and demand-driven projection , 2011, SIGIR.

[10]  Vijay B. Raut,et al.  Opinion Mining and Summarization of Hotel Reviews , 2014, 2014 International Conference on Computational Intelligence and Communication Networks.

[11]  Changqin Quan,et al.  Unsupervised product feature extraction for feature-oriented opinion determination , 2014, Inf. Sci..

[12]  Hitoshi Isahara,et al.  A Statistical Model for Domain-Independent Text Segmentation , 2001, ACL.

[13]  Aoying Zhou,et al.  Density-Based Clustering over an Evolving Data Stream with Noise , 2006, SDM.

[14]  Albert Bifet,et al.  Sentiment Knowledge Discovery in Twitter Streaming Data , 2010, Discovery Science.

[15]  Myra Spiliopoulou,et al.  Discovering and monitoring product features and the opinions on them with OPINSTREAM , 2015, Neurocomputing.

[16]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[17]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[18]  Philip S. Yu,et al.  A Framework for Clustering Massive Text and Categorical Data Streams , 2006, SDM.

[19]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[20]  Jingbo Zhu,et al.  Multi-aspect opinion polling from textual reviews , 2009, CIKM.

[21]  Meng Wang,et al.  Domain-Assisted Product Aspect Hierarchy Generation: Towards Hierarchical Organization of Unstructured Consumer Reviews , 2011, EMNLP.

[22]  Chong Long,et al.  A Review Selection Approach for Accurate Feature Rating Estimation , 2010, COLING.

[23]  Philip S. Yu,et al.  A Framework for Clustering Evolving Data Streams , 2003, VLDB.

[24]  Pushpak Bhattacharyya,et al.  Feature Specific Sentiment Analysis for Product Reviews , 2012, CICLing.

[25]  Sudipto Guha,et al.  Clustering Data Streams: Theory and Practice , 2003, IEEE Trans. Knowl. Data Eng..

[26]  Brigitte Bigi,et al.  Using Kullback-Leibler Distance for Text Categorization , 2003, ECIR.

[27]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[28]  Daniel A. Keim,et al.  Visual sentiment analysis on twitter data streams , 2011, 2011 IEEE Conference on Visual Analytics Science and Technology (VAST).

[29]  Hans-Peter Kriegel,et al.  Discovering global and local bursts in a stream of news , 2012, SAC '12.