Causal Explorer: A Causal Probabilistic Network Learning Toolkit for Biomedical Discovery

Causal Probabilistic Networks (CPNs), (a.k.a. Bayesian Networks, or Belief Networks) are well-established representations in biomedical applications such as decision support systems and predictive modeling or mining of causal hypotheses. CPNs (a) have well-developed theory for induction of causal relationships, and (b) are suitable for creating sound and practical decision support systems. While several public domain and commercial tools exist for modeling and inference with CPNs, very few software tools and libraries exist currently that give access to algorithms for CPN induction. To that end, we have developed a software library, called Causal Explorer, that implements a suit of global, local and partial CPN induction algorithms. The toolkit emphasizes causal discovery algorithms. Approximately half of the algorithms are enhanced implementations of well-established algorithms, and the remaining ones are novel local and partial algorithms that scale to thousands of variables and thus are particularly suitable for modeling in massive datasets.

[1]  Gregory F. Cooper,et al.  A temporal representation and reasoning model for medical decision-support systems , 1998 .

[2]  D E Heckerman,et al.  Toward Normative Expert Systems: Part II Probability-Based Representations for Efficient Knowledge Acquisition and Inference , 1992, Methods of Information in Medicine.

[3]  David Page,et al.  KDD Cup 2001 report , 2002, SKDD.

[4]  Thomas A. Darden,et al.  Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method , 2001, Bioinform..

[5]  Sebastian Thrun,et al.  Bayesian Network Induction via Local Neighborhoods , 1999, NIPS.

[6]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[7]  Peter J. Haug,et al.  Automatic identification of pneumonia related concepts on chest x-ray reports , 1999, AMIA.

[8]  Constantin F. Aliferis,et al.  Towards Principled Feature Selection: Relevancy, Filters and Wrappers , 2003 .

[9]  P Haddawy,et al.  Efficient temporal probabilistic reasoning via context-sensitive model construction , 1997, Comput. Biol. Medicine.

[10]  Andrew P. Sage,et al.  Uncertainty in Artificial Intelligence , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[11]  Kevin P. Murphy,et al.  Learning the Structure of Dynamic Probabilistic Networks , 1998, UAI.

[12]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Peter Haddawy,et al.  Generating Bayesian Networks from Probablity Logic Knowledge Bases , 1994, UAI.

[14]  D. Heckerman,et al.  Toward Normative Expert Systems: Part I The Pathfinder Project , 1992, Methods of Information in Medicine.

[15]  Constantin F. Aliferis,et al.  Temporal representation design principles: an assessment in the domain of liver transplantation , 1998, AMIA.

[16]  Mtw,et al.  Computation, causation, and discovery , 2000 .

[17]  Gregory F. Cooper,et al.  An Empirical Analysis of Likelihood-Weighting Simulation on a Large, Multiply-Connected Belief Network , 1991, Computers and biomedical research, an international journal.

[18]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[19]  Rina Dechter,et al.  Identifying Independencies in Causal Graphs with Feedback , 1996, UAI.

[20]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[21]  Constantin F. Aliferis,et al.  Algorithms for Large Scale Markov Blanket Discovery , 2003, FLAIRS.

[22]  Richard E. Neapolitan,et al.  Probabilistic reasoning in expert systems - theory and algorithms , 2012 .

[23]  Avi Pfeffer,et al.  Object-Oriented Bayesian Networks , 1997, UAI.

[24]  Daphne Koller,et al.  Toward Optimal Feature Selection , 1996, ICML.