Knowledge Discovery in Databases and Decision Support

Two types of decision support systems (DSS) are discussed in Chapter 2: model-oriented and data-oriented. The assumption underlying model-oriented support is that accurate models of the decision problem are available or can be constructed prior to decision making. This approach includes optimization, simulation, decision analytic and statistical models. Various statistical methodologies can be used both to construct models and to estimate their parameters. Traditionally, these methods have been ap-plied to relatively small data sets; the resulting models were used to verify given hy-potheses, identify relationships among variables, and formulate predictions and sce-narios. Statistical methods require significant expertise in their application and interpreta-tion of results. These methods are difficult to use when there is little explicit knowl-edge about the issues and when the problems are ill-defined but described by large or very large amount of data. Therefore, these methods are normally used by analysts and researchers rather then directly by decision makers. While one may suspect that valuable knowledge is buried in the data, it was not until recently, that tools for knowledge extraction and presentation in an easily readable form have became avail-able. Three types of developments made data-oriented DSSs viable and effective. Artifi-cial intelligence (AI) adopted many statistical methods and embedded them in systems that efficiently search for patterns, derive rules, classify data according to some higher level concepts, and construct models from very large databases (Briscoe and Caelli 1996; Kohavi 1998). AI researchers have also developed methods to deal with such

[1]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[2]  Shinichi Morishita,et al.  On Classification and Regression , 1998, Discovery Science.

[3]  Padhraic Smyth,et al.  Statistical Themes and Lessons for Data Mining , 2004, Data Mining and Knowledge Discovery.

[4]  Raymond J. Mooney,et al.  Combining symbolic and connectionist learning methods to refine certainty-factor rule-bases , 1996 .

[5]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[6]  U. M. Feyyad Data mining and knowledge discovery: making sense out of data , 1996 .

[7]  Gordon Miller,et al.  Decision Making: Descriptive, Normative, and Prescriptive Interactions , 1990 .

[8]  George H. John Enhancements to the data mining process , 1997 .

[9]  S. Nilsson,et al.  Identification of Biodiversity and Other Forest Attributes for Sustainable Forest Management: Siberian Forest Case Study , 1998 .

[10]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[11]  Sholom M. Weiss,et al.  Predictive data mining - a practical guide , 1997 .

[12]  P R Limb,et al.  Data Mining — Tools and Techniques , 1996 .

[13]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[14]  Ramakrishnan Srikant,et al.  Mining quantitative association rules in large relational tables , 1996, SIGMOD '96.

[15]  Bob Francis,et al.  Silicon Graphics Inc. , 1993 .

[16]  Ramasamy Uthurusamy,et al.  From Data Mining to Knowledge Discovery: Current Challenges and Future Directions , 1996, Advances in Knowledge Discovery and Data Mining.

[17]  K. P. Barr,et al.  Use of a Decision Tree To Select the Mud System for the Oso Field, Nigeria , 1995 .

[18]  Chris J. Skinner,et al.  Analysis of complex surveys , 1991 .

[19]  Ronald J. Brachman,et al.  The Process of Knowledge Discovery in Databases , 1996, Advances in Knowledge Discovery and Data Mining.

[20]  Ron Kohavi,et al.  Improving simple Bayes , 1997 .

[21]  Sholom M. Weiss,et al.  Data Mining and Forecasting in Large-Scale Telecommunication Networks , 1996, IEEE Expert.

[22]  Thomas E. McKone,et al.  Decision Tree Method for the Classification of Chemical Pollutants: Incorporation of Across-Chemical Variability and Within-Chemical Uncertainty , 1998 .

[23]  Ryszard S. Michalski,et al.  Knowledge acquisition by encoding expert rules versus computer induction from examples: a case study involving soybean pathology , 1999, Int. J. Hum. Comput. Stud..

[24]  Ron Kohavi,et al.  Visualizing the Simple Bayesian Classi er , 1997 .

[25]  Christopher M. Bishop,et al.  Classification and regression , 1997 .

[26]  Usama M. Fayyad,et al.  Data Mining and Knowledge Discovery: Making Sense Out of Data , 1996, IEEE Expert.

[27]  C. Brodley,et al.  Decision tree classification of land cover from remotely sensed data , 1997 .

[28]  Evangelos Simoudis,et al.  Reality Check for Data Mining , 1996, IEEE Expert.

[29]  M. Braga,et al.  Exploratory Data Analysis , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[30]  A. Agresti An introduction to categorical data analysis , 1997 .

[31]  Stephen R. Gardner Building the data warehouse , 1998, CACM.

[32]  Joseph P. Bigus,et al.  Data mining with neural networks , 1996 .