New synthesis of bayesian network classifiers and cardiac spect image interpretation

A new family of Bayesian network classifiers is introduced and demonstrated to outperform existing classifiers. Of particular interest is use of these classifiers for interpretation of cardiac SPECT images. High classification performance on databases from a variety of other domains is also demonstrated. Cardiac SPECT (Single Photon Emission Computed Tomography) is a diagnostic technique used by physicians for assessing the perfusion of the heart's left ventricle. A physician reaches the diagnosis by comparing SPECT images taken from a patient at rest and at maximum stress. Interpretation of images by strictly visual techniques is burdened with error and inconsistency. Thus, assistance in quantifying and automating the diagnosis is sought. An important issue in automating the diagnosis is classification of left ventricle perfusion into a number of predetermined categories. The goal of this dissertation is to investigate the use of Bayesian methods for construction of classifiers that would assist in interpretation of cardiac SPECT images. These images and their descriptions are characterized by a significant amount of uncertainty. Bayesian methods build models by approximating the probability distribution of the variables in the problem domain; they are naturally well suited to deal with uncertainty. This research consisted of three main parts. (1) Data warehousing—assembling cardiac SPECT images and patient records into an easily accessible database and creating software manipulation of SPECT images. (2) Three-dimensional image processing—implementation of custom algorithms for extraction of features from SPECT images. (3) Learning Bayesian network classifiers—research of novel machine learning algorithms that use Bayesian techniques for creation of robust classifiers. The main contribution of this work is creation of a new family of Bayesian network classifier—their high performance classifying left ventricular perfusion is demonstrated. Additionally, it is shown that they outperform existing Bayesian network classifiers and machine learning algorithm C4.5 using data from University of California at Irvine Repository of Machine Learning Databases. Among other contributions is a method for automated extraction of features from cardiac SPECT images based on the creation of models of normal left ventricles, software for visualization of cardiac SPECT images, automated feature extraction, and creation of Bayesian network classifiers.

[1]  A. Cuarón,et al.  Interobserver variability in the interpretation of myocardial images with Tc-99m-labeled diphosphonate and pyrophosphate. , 1980, Journal of nuclear medicine : official publication, Society of Nuclear Medicine.

[2]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[3]  David Maxwell Chickering,et al.  Learning Equivalence Classes of Bayesian Network Structures , 1996, UAI.

[4]  Wray L. Buntine Theory Refinement on Bayesian Networks , 1991, UAI.

[5]  Weiru Liu,et al.  Learning belief networks from data: an information theory based approach , 1997, CIKM '97.

[6]  Peter Auer,et al.  Theory and Applications of Agnostic PAC-Learning with Small Decision Trees , 1995, ICML.

[7]  Jérôme Declerck,et al.  Automatic registration and alignment on a template of cardiac stress and rest reoriented SPECT images , 1997, IEEE Transactions on Medical Imaging.

[8]  D. Madigan,et al.  Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam's Window , 1994 .

[9]  Michael I. Jordan,et al.  Neural networks , 1996, CSUR.

[10]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine-mediated learning.

[11]  R. Pettigrew,et al.  Three-dimensional displays of left ventricular epicardial surface from standard cardiac SPECT perfusion quantification techniques. , 1995, Journal of nuclear medicine : official publication, Society of Nuclear Medicine.

[12]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[13]  Pat Langley,et al.  Tractable Average-Case Analysis of Naive Bayesian Classifiers , 1999, ICML.

[14]  Gregory M. Provan,et al.  A Comparison of Induction Algorithms for Selective and non-Selective Bayesian Classifiers , 1995, ICML.

[15]  Moninder Singh,et al.  Construction of Bayesian network structures from data: A brief survey and an efficient algorithm , 1995, Int. J. Approx. Reason..

[16]  Gürsel Serpen,et al.  A novel algorithm for classification of SPECT images of a human heart , 1996, Proceedings Ninth IEEE Symposium on Computer-Based Medical Systems.

[17]  Pat Langley,et al.  Induction of Selective Bayesian Classifiers , 1994, UAI.

[18]  David J. Spiegelhalter,et al.  Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[19]  Gregory F. Cooper,et al.  A Bayesian Network Classifier that Combines a Finite Mixture Model and a NaIve Bayes Model , 1999, UAI.

[20]  Ron Kohavi,et al.  Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[21]  S. Lauritzen Propagation of Probabilities, Means, and Variances in Mixed Graphical Association Models , 1992 .

[22]  Nir Friedman,et al.  Bayesian Network Classification with Continuous Attributes: Getting the Best of Both Discretization and Parametric Fitting , 1998, ICML.

[23]  Michael I. Jordan,et al.  Variational methods for inference and estimation in graphical models , 1997 .

[24]  Adnan Darwiche,et al.  Inference in belief networks: A procedural guide , 1996, Int. J. Approx. Reason..

[25]  Krzysztof J. Cios,et al.  On the synthesis and complexity of feedforward networks , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[26]  Ross D. Shachter Probabilistic Inference and Influence Diagrams , 1988, Oper. Res..

[27]  Wolfgang Maass,et al.  Efficient agnostic PAC-learning with simple hypothesis , 1994, COLT '94.

[28]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[29]  H. Akaike A new look at the statistical model identification , 1974 .

[30]  Pat Langley,et al.  An Analysis of Bayesian Classifiers , 1992, AAAI.

[31]  P. Spirtes,et al.  An Algorithm for Fast Recovery of Sparse Causal Graphs , 1991 .

[32]  Jason Catlett,et al.  On Changing Continuous Attributes into Ordered Discrete Attributes , 1991, EWSL.

[33]  N. Wermuth,et al.  Graphical and recursive models for contingency tables , 1983 .

[34]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[35]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[36]  Remco R. Bouckaert,et al.  Properties of Bayesian Belief Network Learning Algorithms , 1994, UAI.

[37]  E. Hoffman,et al.  Contour/surface registration using a physically deformable model , 1996, Proceedings of the Workshop on Mathematical Methods in Biomedical Image Analysis.

[38]  S. M. Collins,et al.  Tomographic Thallium‐201 Myocardial Perfusion Scintigrams After Maximal Coronary Artery Vasodilation with Intravenous Dipyridamole: Comparison of Qualitative and Quantitative Approaches , 1982, Circulation.

[39]  Constantin F. Aliferis,et al.  An Evaluation of an Algorithm for Inductive Learning of Bayesian Belief Networks Using Simulated Data Sets , 1994, UAI.

[40]  Krzysztof J. Cios,et al.  Application of Fuzzy Reasoning for Filtering and Enhancement of Ultrasonic Images , 1994 .

[41]  Igor Kononenko,et al.  Semi-Naive Bayesian Classifier , 1991, EWSL.

[42]  Paul J. Krause,et al.  Learning probabilistic networks , 1999, The Knowledge Engineering Review.

[43]  M. Goris,et al.  Two-dimensional mapping of three-dimensional SPECT data: a preliminary step to the quantitation of thallium myocardial perfusion single photon emission tomography. , 1987, American journal of physiologic imaging.

[44]  K.J. Cios,et al.  Issues in automating cardiac SPECT diagnosis , 2000, IEEE Engineering in Medicine and Biology Magazine.

[45]  Wai Lam,et al.  LEARNING BAYESIAN BELIEF NETWORKS: AN APPROACH BASED ON THE MDL PRINCIPLE , 1994, Comput. Intell..

[46]  K. Cios,et al.  A knowledge discovery approach to diagnosing myocardial perfusion , 2000, IEEE Engineering in Medicine and Biology Magazine.

[47]  Krzysztof J. Cios,et al.  Structured region growing and recognition algorithm for nondestructive evaluation , 1996, Optics & Photonics.

[48]  Christopher Meek,et al.  Learning Bayesian Networks with Discrete Variables from Data , 1995, KDD.

[49]  David J. Spiegelhalter,et al.  Sequential updating of conditional probabilities on directed graphical structures , 1990, Networks.

[50]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[51]  David J. Spiegelhalter,et al.  Bayesian analysis in expert systems , 1993 .

[52]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[53]  Denise Draper,et al.  Localized Partial Evaluation of Belief Networks , 1994, UAI.

[54]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1998, Learning in Graphical Models.

[55]  Bruce D'Ambrosio,et al.  Local expression languages for probabilistic dependence , 1995, Int. J. Approx. Reason..

[56]  Henry Tirri,et al.  On Supervised Selection of Bayesian Networks , 1999, UAI.