Knowledge discovery techniques for predicting country investment risk

This paper presents the insights gained from applying knowledge discovery in databases (KDD) processes for the purpose of developing intelligent models, used to classify a country's investing risk based on a variety of factors. Inferential data mining techniques, like C5.0, as well as intelligent learning techniques, like neural networks, were applied to a dataset of 52 countries. The dataset included 27 variables (economic, stock market performance/risk and regulatory efficiencies) on 52 countries, whose investing risk category was assessed in a Wall Street Journal survey of international experts. The results of applying KDD techniques to the dataset are promising, and successfully classified most countries as compared to the experts' classifications. Implementation details, results, and future plans are also presented.

[1]  Chidanand Apté,et al.  Predicting Equity Returns from Securities Data , 1996, Advances in Knowledge Discovery and Data Mining.

[2]  Steven Walczak,et al.  Neural networks as a tool for developing and validating business heuristics , 2001, Expert Syst. Appl..

[3]  Colin O. Benjamin,et al.  Comparing BP and ART II neural network classifiers for facility location , 1995 .

[4]  L.F.A. Wessels,et al.  Extrapolation and interpolation in neural network classifiers , 1992, IEEE Control Systems.

[5]  Steven Walczak,et al.  Neural network models for a resource allocation problem , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[6]  Stephen Grossberg,et al.  A fuzzy ARTMAP nonparametric probability estimator for nonstationary pattern recognition problems , 1995, IEEE Trans. Neural Networks.

[7]  B. Efron,et al.  The Jackknife: The Bootstrap and Other Resampling Plans. , 1983 .

[8]  Stelios H. Zanakis,et al.  Discriminant characteristics of US banks acquired with or without federal assistance , 1994 .

[9]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[10]  Wei-Yin Loh,et al.  A Comparison of Prediction Accuracy, Complexity, and Training Time of Thirty-Three Old and New Classification Algorithms , 2000, Machine Learning.

[11]  Krishan G. Saini,et al.  A survey of the quantitative approaches to country risk analysis , 1984 .

[12]  Ronald J. Brachman,et al.  The Process of Knowledge Discovery in Databases , 1996, Advances in Knowledge Discovery and Data Mining.

[13]  G. McLachlan,et al.  Pattern Classification: A Unified View of Statistical and Neural Approaches. , 1998 .

[14]  Steven Walczak,et al.  Heuristic principles for the design of artificial neural networks , 1999, Inf. Softw. Technol..

[15]  Jürgen Schürmann,et al.  Pattern classification , 2008 .

[16]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[17]  Steven Walczak,et al.  An Empirical Analysis of Data Requirements for Financial Forecasting with Neural Networks , 2001, J. Manag. Inf. Syst..

[18]  Bernard Widrow,et al.  Neural networks: applications in industry, business and science , 1994, CACM.

[19]  Evangelos Triantaphyllou,et al.  The Reliability Issue of Computer-Aided Breast Cancer Diagnosis , 2000, Comput. Biomed. Res..

[20]  Paul Gray,et al.  Introduction to Data Mining and Knowledge Discovery , 1998, Proceedings of the Thirty-First Hawaii International Conference on System Sciences.

[21]  Constantin Zopounidis,et al.  A survey of business failures with an emphasis on prediction methods and industrial applications , 1996 .

[22]  LiMin Fu,et al.  Neural networks in computer intelligence , 1994 .

[23]  Teuvo Kohonen,et al.  Self-organization and associative memory: 3rd edition , 1989 .

[24]  C. Apte,et al.  Data mining with decision trees and decision rules , 1997, Future Gener. Comput. Syst..

[25]  Usama M. Fayyad,et al.  On the Handling of Continuous-Valued Attributes in Decision Tree Generation , 1992, Machine Learning.

[26]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[27]  Stephen Grossberg,et al.  Fuzzy ARTMAP: A neural network architecture for incremental supervised learning of analog multidimensional maps , 1992, IEEE Trans. Neural Networks.