Confirmation rule induction and its applications to coronary heart disease diagnosis and risk group discovery

The confirmation rule set concept, presented in this paper, provides a framework for reliable decision making. This framework enables flexible formation of a confirmation rule set, by incorporating rules induced by machine learning algorithms as well as human encoded expert rules. The only conditions for including a rule into the confirmation rule set are its high predictive accuracy and relative independence from other rules in the rule set. This paper introduces the concept of confirmation rule sets, together with an algorithm for selecting relatively independent rules from a set of acceptable confirmation rules. It presents also two approaches to confirmation rule induction: first, an exhaustive confirmation rule construction algorithm that was used to discover diagnostic rules in the coronary heart disease diagnosis problem, and second, a heuristic confirmation rule construction algorithm that was used for subgroup discovery in the coronary heart disease risk group detection problem.

[1]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[2]  M. Pazzani,et al.  Error Reduction through Learning Multiple Descriptions , 1996, Machine Learning.

[3]  Peter A. Flach,et al.  Predictive Performance of Weghted Relative Accuracy , 2000, PKDD.

[4]  Willi Klösgen,et al.  Explora: A Multipattern and Multistrategy Discovery Assistant , 1996, Advances in Knowledge Discovery and Data Mining.

[5]  Nada Lavrac,et al.  A Relevancy Filter for Constructive Induction , 1998, IEEE Intell. Syst..

[6]  P. Boland Majority Systems and the Condorcet Jury Theorem , 1989 .

[7]  Igor Kononenko,et al.  Inductive and Bayesian learning in medical diagnosis , 1993, Appl. Artif. Intell..

[8]  Peter Clark,et al.  The CN2 Induction Algorithm , 1989, Machine Learning.

[9]  Heikki Mannila,et al.  Fast Discovery of Association Rules , 1996, Advances in Knowledge Discovery and Data Mining.

[10]  Michael J. Pazzani,et al.  Trading off coverage for accuracy in forecasts: Applications to clinical data analysis , 1994 .

[11]  William B. Langdon,et al.  Genetic programming for combining classifiers , 2001 .

[12]  Stefan Wrobel,et al.  An Algorithm for Multi-relational Discovery of Subgroups , 1997, PKDD.

[13]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Kagan Tumer,et al.  Error Correlation and Error Reduction in Ensemble Classifiers , 1996, Connect. Sci..

[15]  Peter Clark,et al.  Rule Induction with CN2: Some Recent Improvements , 1991, EWSL.

[16]  Nada Lavrac,et al.  The Multi-Purpose Incremental Learning System AQ15 and Its Testing Application to Three Medical Domains , 1986, AAAI.

[17]  Igor Kononenko,et al.  Machine learning in prognosis of the femoral neck fracture recovery , 1996, Artif. Intell. Medicine.

[18]  Nada Lavrac,et al.  Experiments with Noise Filtering in a Medical Domain , 1999, ICML.

[19]  Nada Lavrac,et al.  Confirmation Rule Sets , 2000, PKDD.

[20]  Igor Kononenko,et al.  Analysing and improving the diagnosis of ischaemic heart disease with machine learning , 1999, Artif. Intell. Medicine.

[21]  Ronald L. Rivest,et al.  Learning complicated concepts reliably and usefully , 1988, Annual Conference Computational Learning Theory.

[22]  Tom Fawcett,et al.  Robust Classification for Imprecise Environments , 2000, Machine Learning.

[23]  M A Hlatky,et al.  Task force 6: cost effectiveness of assessment and management of risk factors , 1996 .

[24]  Nada Lavrac,et al.  Diagnostic Rules of Increased Reliability for Critical Medical Applications , 1999, AIMDM.

[25]  Nada Lavrac,et al.  Descriptive Induction through Subgroup Discovery: A Case Study in a Medical Domain , 2002, International Conference on Machine Learning.