Rule Discovery in Databases with Missing Values Based on Rough Set Model
暂无分享,去创建一个
One of the most important problems on rule induction methods is that measures used for rule search will be influenced by missing values. In this paper, a new approach to missing values is introduced, called rough estimation of conditional probabilities. This technique uses three estimation strategies, ground mean, lower and upper methods. Attributes which have missing values will be estimated by these methods and will be checked by constraints for probabilistic rules. The proposed method was evaluated on medical databases, the experimental results of which show that induced rules correctly represented experts'knowledge.
[1] Leo Breiman,et al. Classification and Regression Trees , 1984 .
[2] Tomasz Imielinski,et al. Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.
[3] Nada Lavrac,et al. The Multi-Purpose Incremental Learning System AQ15 and Its Testing Application to Three Medical Domains , 1986, AAAI.
[4] Jerzy W. Grzymala-Busse,et al. Rough Sets , 1995, Commun. ACM.