Granular computing-based approach of rule learning for binary classification

Rule learning is one of the most popular types of machine-learning approaches, which typically follow two main strategies: ‘divide and conquer’ and ‘separate and conquer’. The former strategy is aimed at induction of rules in the form of a decision tree, whereas the latter one is aimed at direct induction of if–then rules. Due to the case that the divide and conquer strategy could result in the replicated sub-tree problem, which not only leads to overfitting but also increases the computational complexity in classifying unseen instances, researchers have thus been motivated to develop rule learning approaches through the separate and conquer strategy. In this paper, we focus on investigation of the Prism algorithm, since it is a representative one that follows the separate and conquer strategy, and is aimed at learning a set of rules for each class in the setting of granular computing, where each class (referred to as target class) is viewed as a granule. The Prism algorithm shows highly comparable performance to the most popular algorithms, such as ID3 and C4.5, which follow the divide and conquer strategy. However, due to the need to learn a rule set for each class, Prism usually produces very complex rule-based classifiers. In real applications, there are many problems that involve one target class only, so it is not necessary to learn a rule set for each class, i.e., only a set of rules for the target class needs to be learned and a default rule is used to indicate the case of non-target classes. To address the above issues of Prism, we propose a new version of the algorithm referred to as PrismSTC, where ‘STC’ stands for ‘single target class’. Our experimental results show that PrismSTC leads to production of simpler rule-based classifiers without loss of accuracy in comparison with Prism. PrismSTC also demonstrates sufficiently good performance comparing with C4.5.

[1]  Hong Zhao,et al.  A cost sensitive decision tree algorithm with two adaptive mechanisms , 2015, Knowl. Based Syst..

[2]  Tapio Elomaa,et al.  An Analysis of Reduced Error Pruning , 2001, J. Artif. Intell. Res..

[3]  Shyi-Ming Chen,et al.  Fuzzy multiple attributes group decision-making based on the extension of TOPSIS method and interval type-2 fuzzy sets , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[4]  Han Liu,et al.  Induction of classification rules by Gini-index based rule generation , 2018, Inf. Sci..

[5]  J. Ross Quinlan,et al.  Simplifying Decision Trees , 1987, Int. J. Man Mach. Stud..

[6]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[7]  Max Kuhn,et al.  Applied Predictive Modeling , 2013 .

[8]  Shyi-Ming Chen,et al.  Group decision making using incomplete fuzzy preference relations based on the additive consistency and the order consistency , 2014, Inf. Sci..

[9]  W. Pedrycz,et al.  Granular computing and intelligent systems : design with information granules of higher order and higher type , 2011 .

[10]  Han Liu,et al.  Granular computing based machine learning: a big data processing approach , 2018 .

[11]  Yu Christine Chen,et al.  Transient stability assessment via decision trees and multivariate adaptive regression splines , 2017 .

[12]  Hong Zhao,et al.  A cost sensitive decision tree algorithm based on weighted class distribution with batch deleting attribute mechanism , 2017, Inf. Sci..

[13]  Hans-Peter Piepho,et al.  A comparison of random forests, boosting and support vector machines for genomic selection , 2011, BMC proceedings.

[14]  Shyi-Ming Chen,et al.  A fuzzy reasoning approach for rule-based systems based on fuzzy logics , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[15]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[16]  JOHANNES FÜRNKRANZ,et al.  Separate-and-Conquer Rule Learning , 1999, Artificial Intelligence Review.

[17]  Witold Pedrycz,et al.  Sentiment Analysis and Ontology Engineering - An Environment of Computational Intelligence , 2016, Sentiment Analysis and Ontology Engineering.

[18]  Donato Malerba,et al.  Simplifying Decision Trees by Pruning and Grafting: New Results (Extended Abstract) , 1995, ECML.

[19]  Han Liu,et al.  Induction of Modular Classification Rules by Information Entropy Based Rule Generation , 2016 .

[20]  Han Liu,et al.  Ensemble Learning Approaches , 2016 .

[21]  Han Liu,et al.  Interpretability of Computational Models for Sentiment Analysis , 2016, Sentiment Analysis and Ontology Engineering.

[22]  W. Pedrycz,et al.  Information granules and their use in schemes of knowledge management , 2011, Sci. Iran..

[23]  Shyi-Ming Chen,et al.  Granular Computing and Intelligent Systems , 2011 .

[24]  Jerry M. Mendel,et al.  Interval Type-2 Fuzzy Logic Systems Made Simple , 2006, IEEE Transactions on Fuzzy Systems.

[25]  William Zhu,et al.  A Competition Strategy to Cost-Sensitive Decision Trees , 2012, RSKT.

[26]  Witold Pedrycz,et al.  The development of granular rule-based systems: a study in structural model compression , 2017, GRC 2017.

[27]  Kelly Reynolds,et al.  Using Machine Learning to Detect Cyberbullying , 2011, 2011 10th International Conference on Machine Learning and Applications and Workshops.

[28]  Shyi-Ming Chen,et al.  Parallel Cat Swarm Optimization , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[29]  Han Liu,et al.  Network based rule representation for knowledge discovery and predictive modelling , 2015, 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[30]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[31]  Robert Ivor John,et al.  Interval-valued fuzzy decision trees with optimal neighbourhood perimeter , 2014, Appl. Soft Comput..

[32]  Pei-wei Tsai,et al.  Enhanced parallel cat swarm optimization based on the Taguchi method , 2012, Expert Syst. Appl..

[33]  Shyi-Ming Chen,et al.  Fuzzy Multiple Criteria Hierarchical Group Decision-Making Based on Interval Type-2 Fuzzy Sets , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[34]  Catalin Pol Fuzzy decision trees , 2012 .

[35]  Yiyu Yao,et al.  Perspectives of granular computing , 2005, 2005 IEEE International Conference on Granular Computing.

[36]  Han Liu,et al.  Rule-based systems: a granular computing perspective , 2016, Granular Computing.

[37]  Han Liu,et al.  Transformation of discriminative single-task classification into generative multi-task classification in machine learning context , 2017, 2017 Ninth International Conference on Advanced Computational Intelligence (ICACI).

[38]  Maryam Tayefi,et al.  The application of a decision tree to establish the parameters associated with hypertension , 2017, Comput. Methods Programs Biomed..

[39]  Ayca Altay,et al.  Fuzzy Decision Trees , 2016 .

[40]  Rui Zhao,et al.  Automatic detection of cyberbullying on social networks based on bullying features , 2016, ICDCN.

[41]  J. Ross Quinlan,et al.  Improved Use of Continuous Attributes in C4.5 , 1996, J. Artif. Intell. Res..

[42]  Ali Alkhalifah,et al.  Urdu text classification using decision trees , 2015, 2015 12th International Conference on High-capacity Optical Networks and Enabling/Emerging Technologies (HONET).

[43]  J. Ross Quinlan,et al.  Learning Efficient Classification Procedures and Their Application to Chess End Games , 1983 .

[44]  Keeley A. Crockett,et al.  On predicting learning styles in conversational intelligent tutoring systems using fuzzy decision trees , 2017, Int. J. Hum. Comput. Stud..

[45]  Witold Pedrycz,et al.  Granular Computing and Decision-Making: Interactive and Iterative Approaches , 2015 .

[46]  Han Liu,et al.  Fuzzy information granulation towards interpretable sentiment analysis , 2017, GRC 2017.

[47]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[48]  Robert Ivor John,et al.  Interval-valued fuzzy decision trees , 2010, International Conference on Fuzzy Systems.

[49]  Han Liu,et al.  Generation of Classification Rules , 2016 .

[50]  Jadzia Cendrowska,et al.  PRISM: An Algorithm for Inducing Modular Rules , 1987, Int. J. Man Mach. Stud..

[51]  Shyi-Ming Chen,et al.  Forecasting enrollments using high‐order fuzzy time series and genetic algorithms , 2006, Int. J. Intell. Syst..

[52]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[53]  Jingtao Yao,et al.  Information granulation and granular relationships , 2005, 2005 IEEE International Conference on Granular Computing.

[54]  Robert LIN,et al.  NOTE ON FUZZY SETS , 2014 .

[55]  Witold Pedrycz,et al.  Information granularity, big data, and computational intelligence , 2015 .