A comparative analysis of machine learning models for quality pillar assessment of SaaS services by multi-class text classification of users' reviews

Abstract Software as a Service (SaaS) has emerged as the most widely used of all the current software delivery models. With the growth of edge computing, as SaaS services increasingly become distributed, selecting the best SaaS provider from those available is challenging but it is of critical importance. In the recent past, well-known cloud service providers such as Amazon Web Services and Microsoft have developed frameworks and service quality pillars for cloud applications. However, there are currently no mechanisms for product users to know if and to what extent a service satisfies the defined service pillar. Having such information would enable users to form trustworthy associations in edge computing. In this paper, we address this drawback by adopting a systematic approach of analysing customer reviews related to SaaS products and ascertain to which service quality pillar they refer. We use eleven traditional machine learning classification approaches and a weighted voting ensemble of these classifiers to achieve this task and test the performance of each of them. Since the dataset is unbalanced in terms of sample distribution per class, we use 10-fold cross-validation on the training dataset to determine the best parameters for each machine learning algorithm to achieve optimal performance. Friedman test and Nemenyi’s post hoc test is applied to identify the significant difference among the classifiers performance during cross-validation. Based on the experimental results, a comparative analysis is conducted to identify the best performing machine learning classification model on the SaaS reviews. The results show that the performance of the logistic regression model has a higher performance among the individual classifiers and the weighted voting ensemble shows minimal improvement in overall performance.

[1]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[2]  Teruo Higashino,et al.  Edge-centric Computing: Vision and Challenges , 2015, CCRV.

[3]  Sophia Ananiadou,et al.  Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty , 2009, ACL.

[4]  Elizabeth Chang,et al.  An MCDM method for cloud service selection using a Markov chain and the best-worst method , 2018, Knowl. Based Syst..

[5]  Y. Singer,et al.  Ultraconservative online algorithms for multiclass problems , 2003 .

[6]  Gustavo Rau de Almeida Callou,et al.  Decision making in cloud environments: an approach based on multiple-criteria decision analysis and stochastic models , 2018, Journal of Cloud Computing.

[7]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[8]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[9]  Valentina Casola,et al.  Per-service security SLAs for cloud security management: model and implementation , 2018, Int. J. Grid Util. Comput..

[10]  Jan Gorodkin,et al.  Comparing two K-category assignments by a K-category correlation coefficient , 2004, Comput. Biol. Chem..

[11]  Charu C. Aggarwal,et al.  A Survey of Text Classification Algorithms , 2012, Mining Text Data.

[12]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[13]  George Forman,et al.  An Extensive Empirical Study of Feature Selection Metrics for Text Classification , 2003, J. Mach. Learn. Res..

[14]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[15]  Andrew McCallum,et al.  A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[16]  Juan Jose García Adeva,et al.  Automatic text classification to support systematic reviews in medicine , 2014, Expert Syst. Appl..

[17]  Sebastian Raschka,et al.  Python Machine Learning , 2015 .

[18]  Sebastian Raschka,et al.  Naive Bayes and Text Classification I - Introduction and Theory , 2014, ArXiv.

[19]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[20]  Bruno Trstenjak,et al.  on Intelligent Manufacturing and Automation , 2013 KNN with TF-IDF Based Framework for Text Categorization , 2014 .

[21]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[22]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[23]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[24]  Andreas Müller,et al.  Introduction to Machine Learning with Python: A Guide for Data Scientists , 2016 .

[25]  Yoram Singer,et al.  Improved Boosting Algorithms Using Confidence-rated Predictions , 1998, COLT' 98.

[26]  Tai-Yue Wang,et al.  Fuzzy support vector machine for multi-class text categorization , 2007, Inf. Process. Manag..

[27]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[28]  Jason D. M. Rennie,et al.  Improving Multiclass Text Classification with the Support Vector Machine , 2001 .

[29]  G Salton,et al.  Developments in Automatic Text Retrieval , 1991, Science.

[30]  Rajkumar Buyya,et al.  A framework for ranking of cloud computing services , 2013, Future Gener. Comput. Syst..

[31]  Levent Özgür,et al.  Text Categorization with Class-Based and Corpus-Based Keyword Selection , 2005, ISCIS.

[32]  Sangwon Lee,et al.  A Hybrid Multi-Criteria Decision-Making Model for a Cloud Service Selection Problem Using BSC, Fuzzy Delphi Method and Fuzzy AHP , 2016, Wirel. Pers. Commun..

[33]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[34]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT.

[35]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[36]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[37]  Stephen M. Omohundro,et al.  Five Balltree Construction Algorithms , 2009 .

[38]  Yiming Yang,et al.  An Evaluation of Statistical Approaches to Text Categorization , 1999, Information Retrieval.

[39]  Tong Zhang,et al.  Text Categorization Based on Regularized Linear Classification Methods , 2001, Information Retrieval.

[40]  Arun Kumar Sangaiah,et al.  Integrated quality of user experience and quality of service approach to service selection in internet of services , 2017, Int. J. Grid Util. Comput..

[41]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..

[42]  Yang Liu,et al.  Multi-class sentiment classification: The experimental comparisons of feature selection and machine learning algorithms , 2017, Expert Syst. Appl..

[43]  Sabri Boughorbel,et al.  Optimal classifier for imbalanced data using Matthews Correlation Coefficient metric , 2017, PloS one.

[44]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[45]  Amparo Alonso-Betanzos,et al.  Stream change detection via passive-aggressive classification and Bernoulli CUSUM , 2015, Inf. Sci..

[46]  Chih-Fong Tsai,et al.  The distance function effect on k-nearest neighbor classification for medical datasets , 2016, SpringerPlus.

[47]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[48]  Farookh Khadeer Hussain,et al.  Parallel Cloud Service Selection and Ranking Based on QoS History , 2014, International Journal of Parallel Programming.

[49]  Luis Pedro Coelho,et al.  Building Machine Learning Systems with Python , 2013 .

[50]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[51]  Jason D. M. Rennie Improving multi-class text classification with Naive Bayes , 2001 .

[52]  M. Friedman A Comparison of Alternative Tests of Significance for the Problem of $m$ Rankings , 1940 .

[53]  Cesare Furlanello,et al.  A Comparison of MCC and CEN Error Measures in Multi-Class Prediction , 2010, PloS one.

[54]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[55]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[56]  Arie Ben-David,et al.  Comparison of classification accuracy using Cohen's Weighted Kappa , 2008, Expert Syst. Appl..

[57]  Chiranjeev Kumar,et al.  A Novel Framework for Cloud Service Evaluation and Selection Using Hybrid MCDM Methods , 2018 .

[58]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[59]  Chiranjeev Kumar,et al.  Prioritizing the solution of cloud service selection using integrated MCDM methods under Fuzzy environment , 2017, The Journal of Supercomputing.

[60]  Koby Crammer,et al.  Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..

[61]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[62]  Eric Kuada,et al.  Trust modelling for opportunistic cloud services , 2018, Int. J. Grid Util. Comput..

[63]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .