Our experiments with capital markets data suggest that the domain can be e ectively modeled by classi cation rules induced from available historical data for the purpose of making gainful predictions for equity investments. New classi cation techniques developed at IBM Research, including minimal rule generation (R-MINI) and contextual feature analysis, seem robust enough for consistently extracting useful information from noisy domains such as nancial markets. We will brie y introduce the rationale for our minimal rule generation technique, and the motivation for the use of contextual information in analyzing features. We will then describe our experience from several experiments with the S&P 500 data, illustrating the general methodology, and the results of correlations and simulated managed investment based on classi cation rules generated by R-MINI. We will sketch how the rules for classi cations can be e ectively used for numerical prediction, and eventually to an investment policy. Both the development of robust \minimal" classi cation rule generation, as well as its application to the nancial markets, are part of an
[1]
Nada Lavrac,et al.
The Multi-Purpose Incremental Learning System AQ15 and Its Testing Application to Three Medical Domains
,
1986,
AAAI.
[2]
Giulia Pagallo,et al.
Learning DNF by Decision Trees
,
1989,
IJCAI.
[3]
David Haussler,et al.
Learnability and the Vapnik-Chervonenkis dimension
,
1989,
JACM.
[4]
Sholom M. Weiss,et al.
Computer Systems That Learn
,
1990
.
[5]
J. Ross Quinlan,et al.
C4.5: Programs for Machine Learning
,
1992
.
[6]
Sholom M. Weiss,et al.
Rule-Based Regression
,
1993,
IJCAI.
[7]
Sholom M. Weiss,et al.
Optimized rule induction
,
1993,
IEEE Expert.
[8]
Alok Aggarwal,et al.
Finding a minimum weight K-link path in graphs with Monge property and applications
,
1993,
SCG '93.
[9]
Se June Hong,et al.
Use of Contextaul Information for Feature Ranking and Discretization
,
1997,
IEEE Trans. Knowl. Data Eng..
[10]
Jorma Rissanen,et al.
Stochastic Complexity in Statistical Inquiry
,
1989,
World Scientific Series in Computer Science.