A Voting-Based Sentiment Classification Model

Sentiment analysis is used to depict sentiments present in the text structures, including news, reviews, and articles, and classify them as positive, or negative. It has gained significant attention due to the increase in individuals utilizing social media platforms to express sentiments about organizations, products, and administrations. Many methods are being devised to improve the efficacy of automated sentiment classification. The study proposes a voting-based ensemble model Majority Voting (MV) using five supervised machine learning classifiers named Logistic Regression (LR), Support Vector Machine (SVM), Artificial Neural Network (ANN), Decision Tree (DT), and Random Forest (RF) as base classifiers and a majority voting rule-based mechanism to get the final prediction. The performance of the proposed method is assessed using minimum, maximum, mean, and median values of precision, recall, f-score, and accuracy. The results of 900 values of the classification accuracy (3 datasets * 6 (classification methods) * 10 data subsets (k-fold cross-validation for \(k=10\)) * 5 runs), indicates that the proposed approach outperforms the individual classifiers in majority of the cases.

[1]  Yang Liu,et al.  Multi-class sentiment classification: The experimental comparisons of feature selection and machine learning algorithms , 2017, Expert Syst. Appl..

[2]  Divya Kumar,et al.  ABC Based Neural Network Approach for Churn Prediction in Telecommunication Sector , 2017 .

[3]  Avi Arampatzis,et al.  A comparative evaluation of pre-processing techniques and their interactions for twitter sentiment analysis , 2018, Expert Syst. Appl..

[4]  P. Deepa Shenoy,et al.  A novel sentiment aware dictionary for multi-domain sentiment classification , 2017, Comput. Electr. Eng..

[5]  Azwa Abdul Aziz,et al.  Evaluating cross domain sentiment analysis using supervised machine learning techniques , 2017, 2017 Intelligent Systems Conference (IntelliSys).

[6]  Eric W. T. Ngai,et al.  Multiple-Domain Sentiment Classification for Cantonese Using a Combined Approach , 2018, PACIS.

[7]  Agostino Di Ciaccio,et al.  Computational Statistics and Data Analysis Measuring the Prediction Error. a Comparison of Cross-validation, Bootstrap and Covariance Penalty Methods , 2022 .

[8]  Daniela Moctezuma,et al.  A case study of Spanish text transformations for twitter sentiment analysis , 2017, Expert Syst. Appl..

[9]  Laura Ferrari,et al.  A Comparison between Preprocessing Techniques for Sentiment Analysis in Twitter , 2016, KDWeb.

[10]  João Francisco Valiati,et al.  Document-level sentiment classification: An empirical comparison between SVM and ANN , 2013, Expert Syst. Appl..

[11]  Misha Denil,et al.  From Group to Individual Labels Using Deep Features , 2015, KDD.

[12]  Alun D. Preece,et al.  The role of idioms in sentiment analysis , 2015, Expert Syst. Appl..

[13]  Adriano Lorena Inácio de Oliveira,et al.  Smoothed n-gram based models for tweet language identification: A case study of the Brazilian and European Portuguese national varieties , 2017, Appl. Soft Comput..