Data Mining and Knowledge Discovery in Business Databases

The rapid and constant growth of databases in business, government, and science has far outpaced our ability to interpret and make sense of this data avalanche, creating a need for a new generation of tools and techniques for intelligent and automated database analysis. These tools and techniques are the subject of the rapidly emerging field of data mining and knowledge discovery in databases (KDD). This paper surveys the state of the art in this field, with a particular focus on the issues and challenges in applying KDD to business databases.

[1]  Daniel Kahneman,et al.  Probabilistic reasoning , 1993 .

[2]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[3]  Gregory Piatetsky-Shapiro,et al.  The interestingness of deviations , 1994 .

[4]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[5]  John A. Major,et al.  EFD: A hybrid knowledge/statistical‐based system for the detection of fraud , 1992, Int. J. Intell. Syst..

[6]  David Heckerman,et al.  Bayesian Networks for Knowledge Discovery , 1996, Advances in Knowledge Discovery and Data Mining.

[7]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[8]  P. Langley,et al.  Computational Models of Scientific Discovery and Theory Formation , 1990 .

[9]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[10]  C. J. V. Rijsbergen,et al.  Rough Sets, Fuzzy Sets and Knowledge Discovery , 1994, Workshops in Computing.

[11]  Gregory Piatetsky-Shapiro,et al.  KDD-93: Progress and Challenges in Knowledge Discovery in Databases , 1994, AI Mag..

[12]  Daniel E. O'Leary Some Privacy Issues in Knowledge Discovery: The OECD Personal Privacy Guidelines , 1995, IEEE Expert.

[13]  Marc Rotenberg,et al.  Protecting privacy , 1992, CACM.

[14]  J. Friedman Multivariate adaptive regression splines , 1990 .

[15]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[16]  Donald J. Berndt,et al.  Finding Patterns in Time Series: A Dynamic Programming Approach , 1996, Advances in Knowledge Discovery and Data Mining.

[17]  Ron Kohavi,et al.  Wrappers for performance enhancement and oblivious decision graphs , 1995 .

[18]  Belur V. Dasarathy,et al.  Nearest neighbor (NN) norms: NN pattern classification techniques , 1991 .

[19]  JoBea Way,et al.  The evolution of synthetic aperture radar systems and their progression to the EOS SAR , 1991, IEEE Trans. Geosci. Remote. Sens..

[20]  Philip K. Chan,et al.  Systems for Knowledge Discovery in Databases , 1993, IEEE Trans. Knowl. Data Eng..

[21]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[22]  T. Anand,et al.  SPOTLIGHT: a data explanation system , 1992, Proceedings Eighth Conference on Artificial Intelligence for Applications.

[23]  Jiawei Han,et al.  Knowledge Discovery in Databases: An Attribute-Oriented Approach , 1992, VLDB.

[24]  Heikki Mannila,et al.  Fast Discovery of Association Rules , 1996, Advances in Knowledge Discovery and Data Mining.

[25]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[26]  Gregory Piatetsky-Shapiro,et al.  Knowledge Discovery in Personal Data vs. Privacy: A mini-symposium , 1995, IEEE Expert.