Database Mining: A Performance Perspective

The authors' perspective of database mining as the confluence of machine learning techniques and the performance emphasis of database technology is presented. Three classes of database mining problems involving classification, associations, and sequences are described. It is argued that these problems can be uniformly viewed as requiring discovery of rules embedded in massive amounts of data. A model and some basic operations for the process of rule discovery are described. It is shown how the database mining problems considered map to this model, and how they can be solved by using the basic operations proposed. An example is given of an algorithm for classification obtained by combining the basic rule discovery operations. This algorithm is efficient in discovering classification rules and has accuracy comparable to ID3, one of the best current classifiers. >

[1]  G. R. Dattatreya,et al.  Adaptive Pattern Recognition with Random Costs and Its Application to Decision Trees , 1986, IEEE Transactions on Systems, Man, and Cybernetics.

[2]  Richard P. Lippmann,et al.  An introduction to computing with neural nets , 1987 .

[3]  R. Lippmann,et al.  An introduction to computing with neural nets , 1987, IEEE ASSP Magazine.

[4]  Jeffrey D. Ullman,et al.  Principles of Database and Knowledge-Base Systems, Volume II , 1988, Principles of computer science series.

[5]  Jeffrey D. Ullman,et al.  Principles of database and knowledge-base systems, Vol. I , 1988 .

[6]  R. Gray,et al.  Applications of information theory to pattern recognition and the design of decision trees and trellises , 1988 .

[7]  Integration of Text Search with ORION , 1990, IEEE Data Eng. Bull..

[8]  Shalom Tsur,et al.  Data Dredging , 1990, IEEE Data Eng. Bull..

[9]  R. Brice,et al.  Finding interesting things in lots of data , 1990, Twenty-Third Annual Hawaii International Conference on System Sciences.

[10]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[11]  Wray Buntine,et al.  Collected Notes on the Workshop for Pattern Discovery in Large Databases , 1991 .

[12]  Gregory Piatetsky-Shapiro,et al.  Discovery, Analysis, and Presentation of Strong Rules , 1991, Knowledge Discovery in Databases.

[13]  Gomer Thomas,et al.  Practitioner problems in need of database research , 1991, SGMD.

[14]  Tomasz Imielinski,et al.  An Interval Classifier for Database Mining Applications , 1992, VLDB.

[15]  Jiawei Han,et al.  Knowledge Discovery in Databases: An Attribute-Oriented Approach , 1992, VLDB.

[16]  Shamkant B. Navathe,et al.  Knowledge mining by imprecise querying: a classification-based approach , 1992, [1992] Eighth International Conference on Data Engineering.