CAPRA: a comprehensive approach to product ranking using customer reviews

Online shopping generates billions of dollars in revenues, including both the physical goods and online services. Product images and associated descriptions are the two main sources of information used by the shoppers to gain knowledge about a product. However, these two pieces of information may not always present the true picture of the product. Images could be deceiving, and descriptions could be overwhelming or cryptic. Moreover, the relative rank of these products among the peers may lead to inconsistencies. Hence, a useful and widely used piece of information is “user reviews”. A number of vendors like Amazon have created whole ecosystems around user reviews, thereby boosting their revenues. However, extracting the relevant and useful information out of the plethora of reviews is not straight forward, and is a very tedious job. In this paper we propose a product ranking system that facilitates the online shopping experience by analyzing the reviews for sentiments, evaluating their usefulness, extracting and weighing different product features and aspects, ranking it among similar comparable products, and finally creating a unified rank for each product. Experiment results show the usefulness of our proposed approach in providing an effective and reliable online shopping experience in comparison with similar approaches.

[1]  Masrah Azrifah Azmi Murad,et al.  Sentiment classification of customer reviews based on fuzzy logic , 2010, 2010 International Symposium on Information Technology.

[2]  A. McCallum,et al.  Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[3]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[4]  Regina Barzilay,et al.  Content Models with Attitude , 2011, ACL.

[5]  Paul A. Pavlou,et al.  Overcoming the J-shaped distribution of product reviews , 2009, CACM.

[6]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[7]  Soo-Min Kim,et al.  Automatically Assessing Review Helpfulness , 2006, EMNLP.

[8]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[9]  A. Choudhary,et al.  Mining millions of reviews: a technique to rank products based on importance of reviews , 2011, ICEC '11.

[10]  Alok N. Choudhary,et al.  Sentiment Analysis of Conditional Sentences , 2009, EMNLP.

[11]  Janyce Wiebe,et al.  Articles: Recognizing Contextual Polarity: An Exploration of Features for Phrase-Level Sentiment Analysis , 2009, CL.

[12]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[13]  Kevin Lane Keller,et al.  Consumer Evaluations of Brand Extensions , 1990 .

[14]  Annie Zaenen,et al.  Contextual Valence Shifters , 2006, Computing Attitude and Affect in Text.

[15]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[16]  Ming Liu,et al.  Research of Product Ranking Technology Based on Opinion Mining , 2009, 2009 Second International Conference on Intelligent Computation Technology and Automation.

[17]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[18]  P. Parker,et al.  Marketing universals: Consumers' use of brand name, price, physical appearance, and retailer , 1994 .

[19]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[20]  Alok N. Choudhary,et al.  Voice of the Customers: Mining Online Customer Reviews for Product Feature-based Ranking , 2010, WOSN.

[21]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[22]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[23]  Wei-keng Liao,et al.  SES: Sentiment Elicitation System for Social Media Data , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[24]  Ming Zhou,et al.  Low-Quality Product Review Detection in Opinion Summarization , 2007, EMNLP.

[25]  Rou Song,et al.  Automated Error Detection of Vocabulary Usage in College English Writing , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[26]  Alistair Kennedy,et al.  SENTIMENT CLASSIFICATION of MOVIE REVIEWS USING CONTEXTUAL VALENCE SHIFTERS , 2006, Comput. Intell..

[27]  Panagiotis G. Ipeirotis,et al.  Designing Ranking Systems for Consumer Reviews : The Impact of Review Subjectivity on Product Sales and Review Quality , 2006 .

[28]  Walter Daelemans,et al.  TiMBL: Tilburg Memory-Based Learner , 2007 .

[29]  Jan Svartvik,et al.  A __ comprehensive grammar of the English language , 1988 .

[30]  Edgar A. Pessemier,et al.  A New Way to Determine Buying Decisions , 1959 .

[31]  Pengzhu Zhang,et al.  Health-Related Hot Topic Detection in Online Communities Using Text Clustering , 2013, PloS one.

[32]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[33]  Andreas S. Weigend,et al.  A neural network approach to topic spotting , 1995 .

[34]  Claire Cardie,et al.  Topic Identification for Fine-Grained Opinion Analysis , 2008, COLING.

[35]  Kai Hwang,et al.  Rainbow Product Ranking for Upgrading E-Commerce , 2009, IEEE Internet Computing.

[36]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[37]  S. Leela,et al.  Review on Sentence - Level Clustering with Various Fuzzy Clustering Techniques , 2013 .