A Novel Approach for Reducing Attributes and Its Application to Small Enterprise Financing Ability Evaluation

Attribute reduction is viewed as a kind of preprocessing steps for reducing large dimensionality in data mining of all complex systems. A great deal of researchers have proposed various approaches to reduce attributes or select key features in multicriteria decision making evaluation. In practice, the existing approaches for attribute reduction focused on improving the classification accuracy or saving the cost of computational time, without considering the influence of the reduction results on the original data set. To help address this gap, we develop an advanced novel attribute reduction approach combining Pearson correlation analysis with test significance discrimination for the screening and identification of key characteristics related to the original data set. The proposed model has been verified using the financing ability evaluation data of 713 small enterprises of a city commercial bank in China. And the experimental results show that the proposed reduction model is efficient and effective. Moreover, our experimental findings help to locate the qualified partners and alleviate the difficulties faced by enterprises when applying loan.

[1]  K. Pearson VII. Note on regression and inheritance in the case of two parents , 1895, Proceedings of the Royal Society of London.

[2]  L. Herbach Properties of Model II--Type Analysis of Variance Tests, A: Optimum Nature of the $F$-Test for Model II in the Balanced Case , 1959 .

[3]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[4]  L. Magee,et al.  R 2 Measures Based on Wald and Likelihood Ratio Joint Significance Tests , 1990 .

[5]  J E Janosky,et al.  Pearson correlation coefficients vs reliability coefficients. , 1991, Journal of the American Dietetic Association.

[6]  Xiao-Li Meng,et al.  Comparing correlated correlation coefficients , 1992 .

[7]  Salvatore Greco,et al.  Variable Consistency Model of Dominance-Based Rough Sets Approach , 2000, Rough Sets and Current Trends in Computing.

[8]  Anna Maria Radzikowska,et al.  A comparative study of fuzzy rough sets , 2002, Fuzzy Sets Syst..

[9]  Eckart Zitzler,et al.  Indicator-Based Selection in Multiobjective Search , 2004, PPSN.

[10]  Wei‐Min Wang,et al.  Study on Mantle Shear Wave Velocity Structures in North China , 2006 .

[11]  Qinghua Hu,et al.  A new approach to attribute reduction of consistent and inconsistent covering decision systems with covering rough sets , 2007, Inf. Sci..

[12]  Qinghua Hu,et al.  Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation , 2007, Pattern Recognit..

[13]  R. Oostenveld,et al.  Nonparametric statistical testing of EEG- and MEG-data , 2007, Journal of Neuroscience Methods.

[14]  Wen-Xiu Zhang,et al.  Knowledge reduction based on the equivalence relations defined on attribute set and its power set , 2007, Inf. Sci..

[15]  Qinghua Hu,et al.  Neighborhood rough set based heterogeneous feature subset selection , 2008, Inf. Sci..

[16]  Jing-Yu Yang,et al.  Dominance-based rough set approach and knowledge reductions in incomplete ordered information system , 2008, Inf. Sci..

[17]  Daniel S. Yeung,et al.  Approximations and reducts with covering generalized rough sets , 2008, Comput. Math. Appl..

[18]  Masahiro Inuiguchi,et al.  Variable-precision dominance-based rough set approach and attribute reduction , 2009, Int. J. Approx. Reason..

[19]  Zhao Yang Dong,et al.  Attack structural vulnerability of power grids: A hybrid approach based on complex networks , 2010 .

[20]  Lifeng Li,et al.  Attribute reduction in fuzzy concept lattices based on the T implication , 2010, Knowl. Based Syst..

[21]  Yuhua Qian,et al.  Test-cost-sensitive attribute reduction , 2011, Inf. Sci..

[22]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[23]  Kemal Polat,et al.  Determining of gas type in counter flow vortex tube using pairwise fisher score attribute reduction method , 2011 .

[24]  J. Sarkis,et al.  Evaluating ecological sustainable performance measures for supply chain management , 2012 .

[25]  Zhang Kun The establishment of human all-around development evaluation indicators system based on correlation-principle component analysis , 2012 .

[26]  William Zhu,et al.  Ant Colony Optimization with Three Stages for Independent Test Cost Attribute Reduction , 2013 .

[27]  Yu Gao,et al.  Attribute Reduction of Concept Lattice Based on Irreducible Elements , 2013, Int. J. Wavelets Multiresolution Inf. Process..

[28]  So Young Sohn,et al.  Updating a credit-scoring model based on new attributes without realization of actual data , 2014, Eur. J. Oper. Res..

[29]  Chunguang Bai,et al.  Determining and applying sustainable supplier key performance indicators , 2014 .

[30]  Tommy W. S. Chow,et al.  Analyzing rough set based attribute reductions by extension rule , 2014, Neurocomputing.

[31]  Robert J. Elliott,et al.  A Double HMM approach to Altman Z-scores and credit ratings , 2014, Expert Syst. Appl..

[32]  Bin Xie,et al.  Attribute reduction based on maximal rules in decision formal context , 2014, Int. J. Comput. Intell. Syst..

[33]  Baofeng Shi,et al.  A Credit ration attribute reduction appproach based on Pearson correlation analysis and fuzzy-rough sets , 2015 .

[34]  Seyed Reza Hejazi,et al.  An exact feature selection algorithm based on rough set theory , 2015, Complex..

[35]  Sérgio M. Dias,et al.  Concept lattices reduction: Definition, analysis and classification , 2015, Expert Syst. Appl..

[36]  Jing Wang,et al.  A Novel Imbalanced Data Classification Approach Based on Logistic Regression and Fisher Discriminant , 2015 .

[37]  Qinghua Hu,et al.  An improved attribute reduction scheme with covering based rough sets , 2015, Appl. Soft Comput..

[38]  Baofeng Shi,et al.  City Green Economy Evaluation: Empirical Evidence from 15 Sub-Provincial Cities in China , 2016 .

[39]  Jun Zhang,et al.  Efficient attribute reduction from the viewpoint of discernibility , 2016, Inf. Sci..

[40]  Ming Sun,et al.  Fast algorithms of attribute reduction for covering decision systems with minimal elements in discernibility matrix , 2016, Int. J. Mach. Learn. Cybern..

[41]  Guotai Chi,et al.  EVALUATION INDEX SYSTEM OF GREEN INDUSTRY BASED ON MAXIMUM INFORMATION CONTENT , 2016 .

[42]  Tingwen Huang,et al.  High-Performance Consensus Control in Networked Systems With Limited Bandwidth Communication and Time-Varying Directed Topologies , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[43]  Xinhua Yang,et al.  Evaluation Model of Aluminum Alloy Welded Joint Low-Cycle Fatigue Data Based on Information Entropy , 2017, Entropy.

[44]  Theresa Beaubouef,et al.  Rough Sets , 2019, Lecture Notes in Computer Science.