A Filter Based Improved Decision Tree Sentiment Classification Model for RealTime Amazon Product Review Data

E-Commerce product features and reviews are considered to be the essential factors in real-time e-commerce sites for product recommendation systems. Due to inaccuracy decision patterns, in most cases e-commerce user fails to predict the products based on the user ratings and review comments. Traditional sentiment classification models are independent of data filtering, transformation and sentiment score computing techniques which require high computing memory, time and mostly leading to false-positive rate. To overcome these issues, a novel sentiment score-based product recommendation model is proposed on the real-time product data. In this model, a new product ranking score, filtering, and hybrid decision tree classifiers are implemented. Initially, real-time amazon product review data is captured using Document Object Model (DOM) parser. The features from the review comments are extracted using lexicon Feature Dictionary (FD) and AFINN, Normalized Product Review Score (NPRS) are generated to compute the class label for product review sentiment prediction. Ranked Principal Component Analysis (RPCA) is used as a feature selection measure to overcome the problem of data sparsity. Random Tree, Hoeffding Tree, Adaboost + Random Tree, the three variants of decision tree classifiers are used for product sentiment classification. The proposed filter-based improved decision tree sentiment classification model for real-time amazon product review data recommends the product based on the user query by prediction using a new novel normalized product review sentiment score and ranked feature selection measure. The proposed product recommendation, the decision-making system maximizes sentiment classification accuracy. Experimental results are compared against the traditional decisionmaking classification models in terms of correctly classified instances, error rate, and PRC, F-measure, kappa statistics. The proposed model experimental results show high efficiency.

[1]  Feng Mai,et al.  The effect of online reviews on product sales: A joint sentiment-topic analysis , 2019, Inf. Manag..

[2]  Xiao Liu,et al.  BPSO-Adaboost-KNN ensemble learning algorithm for multi-class imbalanced data classification , 2016, Eng. Appl. Artif. Intell..

[3]  Xianchao Zhang,et al.  PwAdaBoost: Possible world based AdaBoost algorithm for classifying uncertain data , 2019, Knowl. Based Syst..

[4]  B. Palese,et al.  The relative importance of service quality dimensions in E-commerce experiences , 2018, Int. J. Inf. Manag..

[5]  Bo Song,et al.  Cross-border e-commerce commodity risk assessment using text mining and fuzzy rule-based reasoning , 2019, Adv. Eng. Informatics.

[6]  Manoj Kumar Tiwari,et al.  Predicting the consumer's purchase intention of durable goods: An attribute-level analysis , 2017, Journal of Business Research.

[7]  Li Chen,et al.  User perception of sentiment-integrated critiquing in recommender systems , 2017, Int. J. Hum. Comput. Stud..

[8]  Lu Xu,et al.  Bagging classification tree-based robust variable selection for radial basis function network modeling in metabonomics data analysis , 2018 .

[9]  Hung-Yi Lin,et al.  Efficient classifiers for multi-class classification problems , 2012, Decis. Support Syst..

[10]  Yaochu Jin,et al.  Stacking-based ensemble learning of decision trees for interpretable prostate cancer detection , 2019, Appl. Soft Comput..

[11]  P. Deepa Shenoy,et al.  A novel sentiment aware dictionary for multi-domain sentiment classification , 2017, Comput. Electr. Eng..

[12]  Ang Liu,et al.  Application of data analytics for product design: Sentiment analysis of online product reviews , 2018, CIRP Journal of Manufacturing Science and Technology.

[13]  Giovanni Acampora,et al.  A computational intelligence approach to efficiently predicting review ratings in e-commerce , 2016, Appl. Soft Comput..

[14]  Reinald Kim Amplayo,et al.  Incorporating product description to sentiment topic models for improved aspect-based sentiment analysis , 2018, Inf. Sci..

[15]  Erik Cambria,et al.  Aspect extraction for opinion mining with a deep convolutional neural network , 2016, Knowl. Based Syst..

[16]  Yogesh Kumar Dwivedi,et al.  Exploring reviews and review sequences on e-commerce platform: A study of helpful reviews on Amazon.in , 2018, Journal of Retailing and Consumer Services.

[17]  Yang Liu,et al.  Ranking products through online reviews: A method based on sentiment analysis technique and intuitionistic fuzzy set theory , 2017, Inf. Fusion.

[18]  Hong-Jie Xing,et al.  Robust AdaBoost based ensemble of one-class support vector machines , 2020, Inf. Fusion.

[19]  Carlos Angel Iglesias,et al.  A semantic similarity-based perspective of affect lexicons for sentiment analysis , 2019, Knowl. Based Syst..

[20]  Qi Zhang,et al.  An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews , 2018, Expert Syst. Appl..