Multi-level particle swarm optimisation and its parallel version for parameter optimisation of ensemble models: a case of sentiment polarity prediction

Ensemble learning is increasingly used in sentiment analysis. Determining the parameter settings of ensemble models, however, is not easy. Besides its own parameters, an ensemble model has base-predictors that have their individual parameters. Some ensemble models use a specific base-predictor and could be optimised using standard metaheuristics such as the Particle Swarm Optimisation (PSO) approach. Optimising ensemble models with multiple base-predictor candidates is more complicated and challenging, as there are multiple options to choose from. We therefore propose Multi-Level PSO (ML-PSO) and Parallel ML-PSO (PML-PSO) to optimise the parameters of ensemble models, especially those with multiple base-predictors, for sentiment analysis. The idea is to utilise multiple PSOs as particles of the main PSO. The main PSO optimises ensemble-model parameters and determines the best base-predictor, whereas PSOs within it optimise the corresponding base-predictor’s parameters. Experimental results using Bagging Predictors as the underlying ensemble model show that ML-PSO can improve prediction accuracy, while PML-PSO is able to speed up the processing time and further improve the accuracy.

[1]  Yidi Wang,et al.  A new general nearest neighbor classification based on the mutual neighborhood information , 2017, Knowl. Based Syst..

[2]  Ramanjot Kaur,et al.  Twitter Sentiment Analysis using Machine Learning and Optimization Techniques , 2018 .

[3]  Bambang Setiahadi,et al.  Automatic Classification of Sunspot Groups for Space Weather Analysis , 2013 .

[4]  Muhammad Nazir,et al.  PSO-GA Based Optimized Feature Selection Using Facial and Clothing Information for Gender Classification , 2014 .

[5]  Jianping Zeng,et al.  Emotion space model for classifying opinions in stock message board , 2016, Expert Syst. Appl..

[6]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[7]  Harish Sharma,et al.  A Survey on Parallel Particle Swarm Optimization Algorithms , 2019, Arabian Journal for Science and Engineering.

[8]  Raymond Chiong,et al.  A sentiment analysis-based machine learning approach for financial market prediction via news disclosures , 2018, GECCO.

[9]  V. Sugumaran,et al.  Misfire detection in an IC engine using vibration signal and decision tree algorithms , 2014 .

[10]  Keith Dobney,et al.  Earliest “Domestic” Cats in China Identified as Leopard Cat (Prionailurus bengalensis) , 2016, PloS one.

[11]  Franciska de Jong,et al.  Care more about customers: Unsupervised domain-independent aspect detection for sentiment analysis of customer reviews , 2013, Knowl. Based Syst..

[12]  Dongjin Yu,et al.  Rating prediction using review texts with underlying sentiments , 2017, Inf. Process. Lett..

[13]  Kehe Wu,et al.  Algorithm and Implementation of Distributed ESN Using Spark Framework and Parallel PSO , 2017 .

[14]  L. Shah,et al.  Reliability and reproducibility of individual differences in functional connectivity acquired during task and resting state , 2016, Brain and behavior.

[15]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[16]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[17]  Sui-xi Kong,et al.  Sentiment classification and computing for online reviews by a hybrid SVM and LSA based approach , 2018, Cluster Computing.

[18]  Dumitru Dumitrescu,et al.  Evolutionary swarm cooperative optimization in dynamic environments , 2009, Natural Computing.

[19]  Roliana Ibrahim,et al.  Ordinal-based and frequency-based integration of feature selection methods for sentiment analysis , 2017, Expert Syst. Appl..

[20]  Surendra Kumar,et al.  RAPID PSO BASED FEATURES SELECTION FOR CLASSIFICATION , 2017 .

[21]  Athena Vakali,et al.  Sentiment analysis leveraging emotions and word embeddings , 2017 .

[22]  C. Sunitha,et al.  Sentiment Analysis: A Comparative Study on Different Approaches☆ , 2016 .

[23]  R. Tibshirani,et al.  Generalized Additive Models , 1986 .

[24]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[25]  Trevor Hastie,et al.  Multi-class AdaBoost ∗ , 2009 .

[26]  Omid Bozorg-Haddad,et al.  Advanced Optimization by Nature-Inspired Algorithms , 2018 .

[27]  Xiaohui Hu,et al.  Sentiment analysis of Chinese online reviews using ensemble learning framework , 2018, Cluster Computing.

[28]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[29]  Raymond Chiong,et al.  Dynamic Function Optimization: The Moving Peaks Benchmark , 2013, Metaheuristics for Dynamic Optimization.

[30]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[31]  Ming-Yuan Cho,et al.  Feature Selection and Parameters Optimization of SVM Using Particle Swarm Optimization for Fault Classification in Power Distribution Systems , 2017, Comput. Intell. Neurosci..

[32]  Bartosz Wojciechowski,et al.  Differential diagnosis of eating disorders with the use of classification trees (decision algorithm) , 2016 .

[33]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[34]  Hongyan Cui,et al.  Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce , 2016, PloS one.

[35]  Aytug Onan,et al.  A multiobjective weighted voting ensemble classifier based on differential evolution algorithm for text sentiment classification , 2016, Expert Syst. Appl..

[36]  David Cornforth,et al.  Using Support Vector Machine Ensembles for Target Audience Classification on Twitter , 2015, PloS one.

[37]  Raymond Chiong,et al.  Multi-PSO based Classifier Selection and Parameter Optimisation for Sentiment Polarity Prediction , 2018, 2018 IEEE Conference on Big Data and Analytics (ICBDA).

[38]  Zhongyi Hu,et al.  Predicting rating polarity through automatic classification of review texts , 2017, 2017 IEEE Conference on Big Data and Analytics (ICBDA).

[39]  Raymond Chiong,et al.  A hybrid particle swarm optimisation approach for energy-efficient single machine scheduling with cumulative deterioration and multiple maintenances , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[40]  Sangjae Lee,et al.  Predicting the helpfulness of online reviews using multilayer perceptron neural networks , 2014, Expert Syst. Appl..

[41]  Jie Sun,et al.  Dynamic financial distress prediction with concept drift based on time weighting combined with Adaboost support vector machine ensemble , 2017, Knowl. Based Syst..

[42]  Kristina Machova,et al.  COMBINED APPROACH FOR SENTIMENT ANALYSIS IN SLOVAK USING A DICTIONARY ANNOTATED BY PARTICLE SWARM OPTIMIZATION , 2018 .

[43]  Gregorius Satia Budhi,et al.  Java Characters Recognition using Evolutionary Neural Network and Combination of Chi2 and Backpropagation Neural Network , 2014 .

[44]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[45]  Russell C. Eberhart,et al.  A discrete binary version of the particle swarm algorithm , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[46]  Gregorius Satia Budhi,et al.  Handwritten Javanese Character Recognition Using Several Artificial Neural Network Methods , 2014 .

[47]  Adriano L. I. Oliveira,et al.  Swarm optimization clustering methods for opinion mining , 2018, Natural Computing.

[48]  Ying Tan,et al.  Magnifier Particle Swarm Optimization , 2009, Nature-Inspired Algorithms for Optimisation.

[49]  Mochamad Wahyudi,et al.  SENTIMENT ANALYSIS OF SMARTPHONE PRODUCT REVIEW USING SUPPORT VECTOR MACHINE ALGORITHM-BASED PARTICLE SWARM OPTIMIZATION , 2016 .

[50]  Nicholas G. Martin,et al.  The variance shared across forms of childhood trauma is strongly associated with liability for psychiatric and substance use disorders , 2016, Brain and behavior.

[51]  Kavitha Chinniyan,et al.  Semantic similarity based web document classification using support vector machine , 2017, Int. Arab J. Inf. Technol..

[52]  A. Zolfaghari,et al.  A new hybrid method for multi-objective fuel management optimization using parallel PSO-SA , 2014 .

[53]  Mc Borja,et al.  An Introduction to Generalized Linear Models, 3rd edition , 2009 .

[54]  Sungzoon Cho,et al.  Box-office forecasting based on sentiments of movie reviews and Independent subspace method , 2016, Inf. Sci..

[55]  Eric R. Ziegel,et al.  An Introduction to Generalized Linear Models , 2002, Technometrics.

[56]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[57]  Elisabetta Fersini,et al.  Expressive signals in social media languages to improve polarity detection , 2016, Inf. Process. Manag..

[58]  Yunxia Wang,et al.  Text Classifier Based on an Improved SVM Decision Tree , 2012 .

[59]  Panos Panagiotopoulos,et al.  Beyond positive or negative: Qualitative sentiment analysis of social media reactions to unexpected stressful events , 2016, Comput. Hum. Behav..

[60]  S. Menard Logistic Regression: From Introductory to Advanced Concepts and Applications , 2009 .

[61]  David Cornforth,et al.  Identifying the High-Value Social Audience from Twitter through Text-Mining Methods , 2015 .

[62]  Raymond Chiong,et al.  Malicious Web Domain Identification using Online Credibility and Performance Data by Considering the Class Imbalance Issue , 2018, Ind. Manag. Data Syst..

[63]  P. N. Suganthan,et al.  Ensemble particle swarm optimizer , 2017, Appl. Soft Comput..

[64]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[65]  Satinder P. Singh,et al.  Introduction , 2002, British Journal of Ophthalmology.

[66]  Xing Liu,et al.  Particle swarm optimization-based feature selection in sentiment classification , 2016, Soft Comput..

[67]  Hesamoddin Jahanian,et al.  Support vector machine classification of arterial volume‐weighted arterial spin tagging images , 2016, Brain and behavior.

[68]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[69]  Andreas Holzinger,et al.  Data Mining with Decision Trees: Theory and Applications , 2015, Online Inf. Rev..

[70]  Philip J. Stone,et al.  Experiments in induction , 1966 .

[71]  Piotr Szwed,et al.  OpenCL Implementation of PSO Algorithm for the Quadratic Assignment Problem , 2015, ICAISC.

[72]  Raymond Chiong,et al.  Identifying malicious web domains using machine learning techniques with online credibility and performance data , 2016, 2016 IEEE Congress on Evolutionary Computation (CEC).

[73]  Piotr Szwed,et al.  Multi-swarm PSO algorithm for the Quadratic Assignment Problem: a massive parallel implementation on the OpenCL platform , 2015, ArXiv.

[74]  Erfu Yang,et al.  A Novel Active Semisupervised Convolutional Neural Network Algorithm for SAR Image Recognition , 2017, Comput. Intell. Neurosci..

[75]  Burairah Hussin,et al.  Opinion Mining of Movie Review using Hybrid Method of Support Vector Machine and Particle Swarm Optimization , 2013 .

[76]  Simon Fong,et al.  Accelerated Particle Swarm Optimization and Support Vector Machine for Business Optimization and Applications , 2011, NDT.

[77]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[78]  Colin Campbell,et al.  Learning with Support Vector Machines , 2011, Learning with Support Vector Machines.

[79]  David Barber,et al.  Nearest neighbour classification , 2011 .

[80]  Raymond Chiong,et al.  Why Is Optimization Difficult? , 2009, Nature-Inspired Algorithms for Optimisation.

[81]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[82]  Xinmiao Li,et al.  A Global Optimization Approach to Multi-Polarity Sentiment Analysis , 2015, PloS one.