Theory of Rough Sets provides good foundations for the attribute reduction processes in data mining. For numeric attributes, it is enriched with appropriately designed discretization methods. However, not much has been done for symbolic attributes with large numbers of values. The paper presents a framework for the symbolic value partition problem, which is more general than the attribute reduction, and more complicated than the discretization problems.We demonstrate that such problem can be converted into a series of the attribute reduction phases. We propose an algorithm searching for a (sub)optimal attribute reduct coupled with attribute value domains partitions. Experimental results show that the algorithm can help in computing smaller rule sets with better coverage, comparing to the standard attribute reduction approaches.
[1]
Fan Min,et al.
The M-Relative Reduct Problem
,
2006,
RSKT.
[2]
Zdzislaw Pawlak,et al.
Some Issues on Rough Sets
,
2004,
Trans. Rough Sets.
[3]
Catherine Blake,et al.
UCI Repository of machine learning databases
,
1998
.
[4]
Jerzy W. Grzymala-Busse,et al.
Transactions on Rough Sets I
,
2004,
Lecture Notes in Computer Science.
[5]
Bernhard Ganter,et al.
Formal Concept Analysis: Mathematical Foundations
,
1998
.
[6]
Sinh Hoa Nguyen,et al.
Regularity analysis and its applications in data mining
,
2000
.