On Penalty Parameter Selection for Estimating Network Models

Network models are gaining popularity as a way to estimate direct effects among psychological variables and investigate the structure of constructs. A key feature of network estimation is determining which edges are likely to be non-zero. In psychology, this is commonly achieved through the graphical lasso regularization method that estimates a precision matrix of Gaussian variables using an [Formula: see text]-penalty to push small values to zero. A tuning parameter, λ, controls the sparsity of the network. There are many methods to select λ, which can lead to vastly different graphs. The most common approach in psychological network applications is to minimize the extended Bayesian information criterion, but the consistency of this method for model selection has primarily been examined in high dimensional settings (i.e., n < p) that are uncommon in psychology. Further, there is some evidence that alternative selection methods may have superior performance. Here, using simulation, we compare four different methods for selecting λ, including the stability approach to regularization selection (StARS), K-fold cross-validation, the rotation information criterion (RIC), and the extended Bayesian information criterion (EBIC). Our results demonstrate that penalty parameter selection should be made based on data characteristics and the inferential goal (e.g., to increase sensitivity versus to avoid false positives). We end with recommendations for selecting the penalty parameter when using the graphical lasso.

[1]  R. McNally,et al.  Can network analysis transform psychopathology? , 2016, Behaviour research and therapy.

[2]  Verena D. Schmittmann,et al.  Qgraph: Network visualizations of relationships in psychometric data , 2012 .

[3]  Daniel J. McDonald,et al.  Risk-consistency of cross-validation with lasso-type procedures , 2013, 1308.0810.

[4]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[5]  Ping Zhang Model Selection Via Multifold Cross Validation , 1993 .

[6]  Zehua Chen,et al.  EXTENDED BIC FOR SMALL-n-LARGE-P SPARSE GLM , 2012 .

[7]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[8]  D. Borsboom,et al.  State of the aRt personality research: A tutorial on network analysis of personality data in R , 2015 .

[9]  Larry A. Wasserman,et al.  Stability Approach to Regularization Selection (StARS) for High Dimensional Graphical Models , 2010, NIPS.

[10]  Yunan Zhu,et al.  Sparse Graphical Models for Functional Connectivity Networks: Best Methods and the Autocorrelation Issue , 2017, bioRxiv.

[11]  Donald R. Williams,et al.  On Nonregularized Estimation of Psychological Networks , 2019, Multivariate behavioral research.

[12]  Abdolreza Mohammadi,et al.  BDgraph: An R Package for Bayesian Structure Learning in Graphical Models , 2015, Journal of Statistical Software.

[13]  Victor Chernozhukov,et al.  On cross-validated Lasso , 2016 .

[14]  John S. J. Hsu,et al.  Bayesian estimation of a covariance matrix with flexible prior specification , 2012 .

[15]  D. Borsboom A network theory of mental disorders , 2017, World psychiatry : official journal of the World Psychiatric Association.

[16]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[17]  Denny Borsboom,et al.  Network Psychometrics. , 2016, 1609.02818.

[18]  Eiko I. Fried,et al.  Network analysis of depression and anxiety symptom relationships in a psychiatric sample , 2016, Psychological Medicine.

[19]  Mikko J Sillanpää,et al.  Use of Wishart Prior and Simple Extensions for Sparse Precision Matrix Estimation , 2016, PloS one.

[20]  Donald R. Williams,et al.  Back to the basics: Rethinking partial correlation network methodology. , 2020, The British journal of mathematical and statistical psychology.

[21]  Michael Hecker,et al.  Gene regulatory network inference: Data integration in dynamic models - A review , 2009, Biosyst..

[22]  Wenjiang J. Fu,et al.  Asymptotics for lasso-type estimators , 2000 .

[23]  Larry A. Wasserman,et al.  The huge Package for High-dimensional Undirected Graph Estimation in R , 2012, J. Mach. Learn. Res..

[24]  Daniel J. McDonald,et al.  Leave-one-out cross-validation is risk consistent for lasso , 2012, Machine Learning.

[25]  Weidong Liu Gaussian graphical model estimation with false discovery rate control , 2013, 1306.0976.

[26]  Yang Feng,et al.  Modified Cross-Validation for Penalized High-Dimensional Linear Regression Models , 2013, 1309.2068.

[27]  J. Lafferty,et al.  High-dimensional Ising model selection using ℓ1-regularized logistic regression , 2010, 1010.0311.

[28]  T. Yarkoni,et al.  Choosing Prediction Over Explanation in Psychology: Lessons From Machine Learning , 2017, Perspectives on psychological science : a journal of the Association for Psychological Science.

[29]  Claudia D van Borkulo,et al.  The application of a network approach to Health-Related Quality of Life (HRQoL): introducing a new method for assessing HRQoL in healthy adults and cancer patients , 2015, Quality of Life Research.

[30]  Mikko J. Sillanpää,et al.  Estimation of covariance and precision matrix, network structure, and a view toward systems biology , 2017 .

[31]  Eiko I. Fried,et al.  A Tutorial on Regularized Partial Correlation Networks , 2016, Psychological methods.

[32]  A. Mohammadi,et al.  Bayesian Structure Learning in Sparse Gaussian Graphical Models , 2012, 1210.5371.

[33]  R. Tibshirani,et al.  Sparse estimation of a covariance matrix. , 2011, Biometrika.

[34]  P. Linkowski,et al.  Network analysis of empathy items from the interpersonal reactivity index in 1973 young adults , 2018, Psychiatry Research.

[35]  Roger W. Johnson,et al.  An Introduction to the Bootstrap , 2001 .

[36]  Denny Borsboom,et al.  Mental Disorders as Causal Systems , 2015 .

[37]  Peng Zhao,et al.  On Model Selection Consistency of Lasso , 2006, J. Mach. Learn. Res..

[38]  A. Raftery Bayesian Model Selection in Social Research , 1995 .

[39]  S. Epskamp Brief Report on Estimating Regularized Gaussian Networks from Continuous and Ordinal Data , 2016, 1606.05771.

[40]  A. Adan,et al.  Network analysis of multiple risk factors for mental health in young Colombian adults , 2019, Journal of mental health.

[41]  Richard A. Bryant,et al.  Acute and Chronic Posttraumatic Stress Symptoms in the Emergence of Posttraumatic Stress Disorder: A Network Analysis , 2017, JAMA psychiatry.

[42]  D. Borsboom,et al.  Network analysis: an integrative approach to the structure of psychopathology. , 2013, Annual review of clinical psychology.