论文信息 - Estimating High-Dimensional Directed Acyclic Graphs with the PC-Algorithm

Estimating High-Dimensional Directed Acyclic Graphs with the PC-Algorithm

We consider the PC-algorithm (Spirtes et al., 2000) for estimating the skeleton and equivalence class of a very high-dimensional directed acyclic graph (DAG) with corresponding Gaussian distribution. The PC-algorithm is computationally feasible and often very fast for sparse problems with many nodes (variables), and it has the attractive property to automatically achieve high computational efficiency as a function of sparseness of the true underlying DAG. We prove uniform consistency of the algorithm for very high-dimensional, sparse DAGs where the number of nodes is allowed to quickly grow with sample size n, as fast as O(na) for any 0 < a < ∞. The sparseness assumption is rather minimal requiring only that the neighborhoods in the DAG are of lower order than sample size n. We also demonstrate the PC-algorithm for simulated data.

P. Bühlmann | M. Kalisch

[1] R. Neapolitan. Learning Bayesian networks , 2007, KDD '07.

[2] Peng Zhao,et al. On Model Selection Consistency of Lasso , 2006, J. Mach. Learn. Res..

[3] Constantin F. Aliferis,et al. The max-min hill-climbing Bayesian network structure learning algorithm , 2006, Machine Learning.

[4] Eytan Domany,et al. On the Number of Samples Needed to Learn the Correct Structure of a Bayesian Network , 2006, UAI.

[5] N. Meinshausen,et al. High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[6] Anna Goldenberg,et al. Tractable learning of large Bayes net structures from sparse data , 2004, ICML.

[7] J. Robins,et al. Uniform consistency in causal inference , 2003 .

[8] Tom Burr,et al. Causation, Prediction, and Search , 2003, Technometrics.

[9] David Maxwell Chickering,et al. Optimal Structure Identification With Greedy Search , 2003, J. Mach. Learn. Res..

[10] Jiji Zhang,et al. Strong Faithfulness and Uniform Consistency in Causal Inference , 2002, UAI.

[11] G. Wills. Introduction to Graphical Modelling , 2002, Technometrics.