Interpretability of Computational Models for Sentiment Analysis

Sentiment analysis, which is also known as opinion mining, has been an increasingly popular research area focusing on sentiment classification/regression. In many studies, computational models have been considered as effective and efficient tools for sentiment analysis . Computational models could be built by using expert knowledge or learning from data. From this viewpoint, the design of computational models could be categorized into expert based design and data based design. Due to the vast and rapid increase in data, the latter approach of design has become increasingly more popular for building computational models. A data based design typically follows machine learning approaches, each of which involves a particular strategy of learning. Therefore, the resulting computational models are usually represented in different forms. For example, neural network learning results in models in the form of multi-layer perceptron network whereas decision tree learning results in a rule set in the form of decision tree. On the basis of above description, interpretability has become a main problem that arises with computational models. This chapter explores the significance of interpretability for computational models as well as analyzes the factors that impact on interpretability. This chapter also introduces several ways to evaluate and improve the interpretability for computational models which are used as sentiment analysis systems. In particular, rule based systems , a special type of computational models, are used as an example for illustration with respects to evaluation and improvements through the use of computational intelligence methodologies.

[1]  Yi Hu,et al.  Document sentiment classification by exploring description model of topical terms , 2011, Comput. Speech Lang..

[2]  Jadzia Cendrowska,et al.  PRISM: An Algorithm for Inducing Modular Rules , 1987, Int. J. Man Mach. Stud..

[3]  Mihaela Cocea,et al.  Sentiment Analysis: Towards a Tool for Analysing Real-Time Students Feedback , 2014, 2014 IEEE 26th International Conference on Tools with Artificial Intelligence.

[5]  David D. Lewis,et al.  A comparison of two learning algorithms for text categorization , 1994 .

[6]  T. Ross Fuzzy Logic with Engineering Applications , 1994 .

[7]  Mihaela Cocea,et al.  Learning Sentiment from Students' Feedback for Real-Time Interventions in Classrooms , 2014, ICAIS.

[8]  W E Deming,et al.  On probability as a basis for action. , 1975, Methods of information in medicine. Supplement.

[9]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[10]  Han Liu,et al.  Network based rule representation for knowledge discovery and predictive modelling , 2015, 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[11]  Thomas J. Watson,et al.  An empirical study of the naive Bayes classifier , 2001 .

[12]  Shourya Roy,et al.  Fast and accurate text classification via multiple linear discriminant projections , 2003, The VLDB Journal.

[13]  Masrah Azrifah Azmi Murad,et al.  Sentiment classification of customer reviews based on fuzzy logic , 2010, 2010 International Symposium on Information Technology.

[14]  Walaa Medhat,et al.  Sentiment analysis algorithms and applications: A survey , 2014 .

[15]  Hua Yu,et al.  A direct LDA algorithm for high-dimensional data - with application to face recognition , 2001, Pattern Recognit..

[16]  A. M. Uttley,et al.  The Design of Conditional Probability Computers , 1959, Inf. Control..

[17]  Frank Rosenblatt,et al.  PRINCIPLES OF NEURODYNAMICS. PERCEPTRONS AND THE THEORY OF BRAIN MECHANISMS , 1963 .

[18]  Alexander E. Gegov,et al.  Categorization and Construction of Rule Based Systems , 2014, EANN.

[19]  J M Bland,et al.  The intracluster correlation coefficient in cluster randomisation , 1998, BMJ.

[20]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[21]  Han Liu,et al.  Rule Based Systems for Big Data , 2015 .

[22]  Mikolás Janota,et al.  Digital Object Identifier (DOI): , 2000 .

[23]  Carolyn Pillers Dobler,et al.  The Practice of Statistics , 2001, Technometrics.

[24]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[25]  Tapio Elomaa,et al.  An Analysis of Reduced Error Pruning , 2001, J. Artif. Intell. Res..

[26]  Igor Kononenko,et al.  Bayesian neural networks , 1989, Biological Cybernetics.

[27]  Kenneth J. Schlager,et al.  Systems engineering-key to modern development , 1956, IRE Transactions on Engineering Management.

[28]  Han Liu,et al.  J-measure based hybrid pruning for complexity reduction in classification rules , 2013 .

[29]  Jr. Charles Marion Higgins,et al.  Classification and approximation with rule-based networks , 1993 .

[30]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[31]  A. D. Hall,et al.  A Methodology for Systems Engineering , 1962 .

[32]  JOHANNES FÜRNKRANZ,et al.  Separate-and-Conquer Rule Learning , 1999, Artificial Intelligence Review.

[33]  Ivan Jordanov,et al.  An overview of the use of neural networks for data mining tasks , 2012, Wiley Interdiscip. Rev. Data Min. Knowl. Discov..

[34]  Jacques Savoy,et al.  Feature Selection in Sentiment Analysis , 2012, CORIA.

[35]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[36]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[37]  Chuanjun Zhao,et al.  Fuzzy Sentiment Membership Determining for Sentiment Classification , 2014, 2014 IEEE International Conference on Data Mining Workshop.

[38]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[39]  Randy Kerber,et al.  ChiMerge: Discretization of Numeric Attributes , 1992, AAAI.

[40]  Francesco Colace,et al.  A Probabilistic Approach to Tweets' Sentiment Classification , 2013, 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction.

[41]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[42]  Han Liu,et al.  Unified framework for construction of rule based classification systems , 2015 .