Scalable classifiers with dynamic pruning
暂无分享,去创建一个
The paper presents an algorithm to solve the problem of classification for data mining applications. This is a decision tree classifier which uses modified gini index as the partitioning criteria. A pre-sorting technique is used to overcome the problem of sorting at each node of the tree. This technique is integrated with a breadth first tree growth strategy which enables us to calculate the best partition for each of the leaf nodes in a single scan of a database. We have implemented this algorithm using depth first tree growth strategy also. The algorithm uses a dynamic pruning approach which reduces the number of scans of the database and does away with a separate tree pruning phase. The proof of correctness, analysis and performance study are also presented.
[1] Gomer Thomas,et al. Practitioner problems in need of database research , 1991, SGMD.
[2] Tomasz Imielinski,et al. Research Directions in Knowledge Discovery , 1991, SIGMOD Rec..
[3] Philip S. Yu,et al. Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..
[4] Sholom M. Weiss,et al. Computer Systems That Learn , 1990 .
[5] Tomasz Imielinski,et al. Database Mining: A Performance Perspective , 1993, IEEE Trans. Knowl. Data Eng..