It is well known that data mining is a process of discovering unknown, hidden information from a large amount of data, extracting valuable information, and using the information to make important business decisions. And data mining has been developed into a new information technology, including regression, decision tree, neural network, fuzzy set, rough set, support vector machine and so on. This paper puts forward a rough set-based multiple criteria linear programming (RS-MCLP) approach for solving classification problems in data mining. Firstly, we describe the basic theory and models of rough set and multiple criteria linear programming (MCLP) and analyse their characteristics and advantages in practical applications. Secondly, detailed analysis about their deficiencies are provided respectively. However, because of the existing mutual complementarities between them, we put forward and build the RS-MCLP methods and models which sufficiently integrate their virtues and overcome the adverse factors simultaneously. In addition, we also develop and implement these algorithm and models in SAS and Windows platform. Finally, many experiments show that RS-MCLP approach is prior to single MCLP model and other traditional classification methods in data mining.
[1]
Yong Shi,et al.
Data Mining in Credit Card Portfolio Management: A Multiple Criteria Decision Making Approach
,
2001
.
[2]
Andrzej Skowron,et al.
Rough set methods in feature selection and recognition
,
2003,
Pattern Recognit. Lett..
[3]
Lior Rokach,et al.
Data Mining And Knowledge Discovery Handbook
,
2005
.
[4]
Jerzy W. Grzymala-Busse,et al.
Rough Sets
,
1995,
Commun. ACM.
[5]
Yi Peng,et al.
Multiple criteria linear programming approach to data mining: Models, algorithm designs and software development
,
2003,
Optim. Methods Softw..
[6]
Hirotaka Nakayama,et al.
Theory of Multiobjective Optimization
,
1985
.
[7]
F. Glover,et al.
Simple but powerful goal programming models for discriminant problems
,
1981
.