Prediction of star ratings from online reviews

Huge abundance e-commerce websites and online reviews have become crucial these days. These reviews help customers in making decisions but one must go through huge pile of reviews in many sites. We have summarized the reviews into STARS on a scale of 1-5 which are easy to perceive. So, for a given customer review, we predict the star rating of the review. Proposed approach in this research work first pre-process review data and then train different classifiers like Multinomial Naïve Bayes, Bigram Multinomial Naïve Bayes, Trigram Multinomial Naïve Bayes, Bigram-Trigram Multinomial Naïve Bayes, Random Forest. Finally, trained models predict star rating of a review. Comparing the performance of these classifiers, it is observed that Random Forest is better than the other classifiers in terms of accuracy. However, Bigram-Trigram Multinomial Naïve Bayes is on par with the results of classifier like Random Forest as well as has far less computational time.

[1]  Sonali Agarwal,et al.  High speed streaming data analysis of web generated log streams , 2015, 2015 IEEE 10th International Conference on Industrial and Information Systems (ICIIS).

[2]  Sonali Agarwal,et al.  Comparative Study of Big Data Computing and Storage Tools: A Review , 2016 .

[3]  Sasank Channapragada,et al.  Prediction of rating based on review text of Yelp reviews , 2015 .

[4]  Asif Ekbal,et al.  Feature Extraction and Opinion Mining in Online Product Reviews , 2014, 2014 International Conference on Information Technology.

[5]  Sonali Agarwal,et al.  Critical parameter analysis of Vertical Hoeffding Tree for optimized performance using SAMOA , 2017, Int. J. Mach. Learn. Cybern..

[6]  Hiroshi Inamura,et al.  Multi-label categorizing local event information from micro-blogs , 2016, 2016 Ninth International Conference on Mobile Computing and Ubiquitous Networking (ICMU).

[7]  Sonali Agarwal,et al.  Stream Data Mining: Platforms, Algorithms, Performance Evaluators and Research Trends , 2016 .

[8]  Sanjeev Ahuja,et al.  Sentiment analysis of movie reviews: A study on feature selection & classification algorithms , 2016, 2016 International Conference on Microelectronics, Computing and Communications (MicroCom).

[9]  Geetika Gautam,et al.  Sentiment analysis of twitter data using machine learning approaches and semantic analysis , 2014, 2014 Seventh International Conference on Contemporary Computing (IC3).

[10]  Dagmar Monett,et al.  Predicting star ratings based on annotated reviews of mobile apps , 2016, 2016 Federated Conference on Computer Science and Information Systems (FedCSIS).

[11]  Rakesh Chada Data Mining Yelp Data-Predicting rating stars from review text , 2014 .

[12]  Santanu Kumar Rath,et al.  Classification of Sentimental Reviews Using Machine Learning Techniques , 2015 .

[13]  Ajay Rana,et al.  User reviews data analysis using opinion mining on web , 2015, 2015 International Conference on Futuristic Trends on Computational Analysis and Knowledge Management (ABLAZE).

[14]  Justin Zhijun Zhan,et al.  Sentiment analysis using product review data , 2015, Journal of Big Data.

[15]  Callen Rain,et al.  Analysis in Amazon Reviews Using Probabilistic Machine Learning , 2012 .