Attribute importance measurement method based on data coordination degree

Abstract The increasing scale of data information cause the great amount of irrelevant attributes, which becomes a challenging issue for machine learning. Therefore, removing the redundancy through sorting the attributes with appropriate significance has attracted wide attention in academic and application. Taking the knowledge hidden in the data system as the carrier and the inclusion relationship between sets as the basis, this paper proposes the concept of decision coordination degree. Then a composite attribute importance measurement based on core data is established (BCD-AICM). Further the basic properties and features of BCD-AICM are discussed. Finally, the similarities and differences between BCD-AICM and the existing attribute importance measurement methods are discussed using eight UCI data sets. The theoretical analysis and experiments results show that the BCD-AICM has good interpretability and structural characteristics. This method enriches the existing related theories and has broad application prospects in the fields of fuzzy decision-making, knowledge acquisition, resource management, and artificial intelligence etc.

[1]  Xizhao Wang,et al.  Tolerance rough fuzzy decision tree , 2018, Inf. Sci..

[2]  Xiaoyue Liu,et al.  Some new dual hesitant fuzzy aggregation operators based on Choquet integral and their applications to multiple attribute decision making , 2014, J. Intell. Fuzzy Syst..

[3]  Concepción Garrido,et al.  Extraction of attribute importance from satisfaction surveys with data mining techniques: a comparison between neural networks and decision trees , 2017 .

[4]  Qimai Chen,et al.  The Method for Data Reduction Based on Evaluation of Attribute Significance , 2010, 2010 2nd International Workshop on Intelligent Systems and Applications.

[5]  Wang Guo,et al.  Decision Table Reduction based on Conditional Information Entropy , 2002 .

[6]  Jianhua Dai,et al.  An Uncertainty Measure for Incomplete Decision Tables and Its Applications , 2013, IEEE Transactions on Cybernetics.

[7]  Hong Shen,et al.  Incremental feature selection based on rough set in dynamic incomplete data , 2014, Pattern Recognit..

[8]  Bao Qing Hu,et al.  Attribute reduction in ordered decision tables via evidence theory , 2016, Inf. Sci..

[9]  Wei-Zhi Wu,et al.  Attribute reduction based on evidence theory in incomplete decision systems , 2008, Inf. Sci..

[10]  Chenxia Jin,et al.  Feature selection with partition differentiation entropy for large-scale data sets , 2016, Inf. Sci..

[11]  Chenxia Jin,et al.  Knowledge change rate-based attribute importance measure and its performance analysis , 2017, Knowl. Based Syst..

[12]  Chenxia Jin,et al.  Constructing importance measure of attributes in covering decision table , 2015, Knowl. Based Syst..

[13]  Witold Pedrycz,et al.  A new model for intuitionistic fuzzy multi-attributes decision making , 2016, Eur. J. Oper. Res..

[14]  Wojciech Ziarko,et al.  Variable Precision Rough Set Model , 1993, J. Comput. Syst. Sci..

[15]  Guoyin Wang,et al.  Monotonic uncertainty measures for attribute reduction in probabilistic rough set model , 2015, Int. J. Approx. Reason..

[16]  Hu Xu,et al.  Uncertain multi-attributes decision making method based on interval number with probability distribution weighted operators and stochastic dominance degree , 2016, Knowl. Based Syst..

[17]  Qiang Zhang,et al.  Approaches to multiple-criteria group decision making based on interval-valued intuitionistic fuzzy Choquet integral with respect to the generalized λ-Shapley index , 2013, Knowl. Based Syst..

[18]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[19]  Shu-Ping Wan,et al.  Aggregating decision information into Atanassov's intuitionistic fuzzy numbers for heterogeneous multi-attribute group decision making , 2016, Appl. Soft Comput..

[20]  Jun Lu,et al.  Research of Improved Attribute Reduction Algorithm Based on Data Mining of Rough Set , 2014 .

[21]  Priyanka Sharma,et al.  Discovery of Classification Rules Using Distributed Genetic Algorithm , 2015 .

[22]  Hong Zhao,et al.  A cost sensitive decision tree algorithm based on weighted class distribution with batch deleting attribute mechanism , 2017, Inf. Sci..

[23]  Xiaojun Xie,et al.  A novel incremental attribute reduction approach for dynamic incomplete decision systems , 2018, Int. J. Approx. Reason..

[24]  Jianxin Wu,et al.  Linear Regression-Based Efficient SVM Learning for Large-Scale Classification , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Jiye Liang,et al.  A new method for measuring uncertainty and fuzziness in rough set theory , 2002, Int. J. Gen. Syst..

[26]  Kyoungok Kim,et al.  A hybrid classification algorithm by subspace partitioning through semi-supervised decision tree , 2016, Pattern Recognit..

[27]  Yong Wang,et al.  Online active learning of decision trees with evidential data , 2016, Pattern Recognit..

[28]  Qinghua Hu,et al.  Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation , 2007, Pattern Recognit..

[29]  Guoyin Wang,et al.  An incremental approach for attribute reduction based on knowledge granularity , 2016, Knowl. Based Syst..