Structure detection of semiparametric structural equation models with Bayesian adaptive group lasso

Structural equation models (SEMs) are widely recognized as the most important statistical tool for assessing the interrelationships among latent variables. This study develops a Bayesian adaptive group least absolute shrinkage and selection operator procedure to perform simultaneous model selection and estimation for semiparametric SEMs, wherein the structural equation is formulated using the additive nonparametric functions of observed and latent variables. We propose the use of basis expansions to approximate the unknown functions. By introducing adaptive penalties to the groups of basis expansions, the nonlinear, linear, or non-existent effects of observed and latent variables in the structural equation can be automatically detected. A simulation study demonstrates that the proposed method performs satisfactorily. This paper presents an application of revealing the observed and latent risk factors of diabetic kidney disease.

[1]  Sik-Yum Lee,et al.  Bayesian analysis of structural equation models with multinomial variables and an application to type 2 diabetic nephropathy , 2007, Statistics in medicine.

[2]  Chenlei Leng,et al.  Bayesian adaptive Lasso , 2010, Annals of the Institute of Statistical Mathematics.

[3]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[4]  Sik-Yum Lee,et al.  Structural equation modelling: A Bayesian approach. , 2007 .

[5]  L. Fahrmeir,et al.  A Bayesian Semiparametric Latent Variable Model for Mixed Responses , 2007 .

[6]  Edward H. Ip,et al.  A Bayesian Modeling Approach for Generalized Semiparametric Structural Equation Models , 2013, Psychometrika.

[7]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  M. Ng,et al.  Phenotype–genotype interactions on renal function in type 2 diabetes: an analysis using structural equation modelling , 2009, Diabetologia.

[9]  G. Casella,et al.  Penalized regression, standard errors, and Bayesian lassos , 2010 .

[10]  Esben Budtz-Jørgensen,et al.  Structural Equation Models , 2005 .

[11]  Wenjiang J. Fu Penalized Regressions: The Bridge versus the Lasso , 1998 .

[12]  C. Schwarz,et al.  Smoothing Population Size Estimates for Time‐Stratified Mark–Recapture Experiments Using Bayesian P‐Splines , 2011, Biometrics.

[13]  Daniel J. Bauer A Semiparametric Approach to Modeling Nonlinear Relations Among Latent Variables , 2005 .

[14]  G. Casella,et al.  The Bayesian Lasso , 2008 .

[15]  Francis R. Bach,et al.  Consistency of the group Lasso and multiple kernel learning , 2007, J. Mach. Learn. Res..

[16]  M. Sillanpää,et al.  Bayesian mapping of genotype × expression interactions in quantitative and qualitative traits , 2006, Heredity.

[17]  D. A. Kenny,et al.  Estimating the nonlinear and interactive effects of latent variables. , 1984 .

[18]  R. Tibshirani,et al.  On the “degrees of freedom” of the lasso , 2007, 0712.0881.

[19]  Xiangnan Feng,et al.  Latent variable models with nonparametric interaction effects of latent variables , 2014, Statistics in medicine.

[20]  J. Tuomilehto,et al.  The metabolic syndrome defined by factor analysis and incident type 2 diabetes in a chinese population with high postprandial glucose. , 2004, Diabetes care.

[21]  Hongtu Zhu,et al.  Bayesian Lasso for Semiparametric Structural Equation Models , 2012, Biometrics.

[22]  Sara van de Geer,et al.  Statistics for High-Dimensional Data , 2011 .

[23]  Chih-Ling Tsai,et al.  Regression coefficient and autoregressive order shrinkage and selection via the lasso , 2007 .

[24]  R. Kass,et al.  Bayesian curve-fitting with free-knot splines , 2001 .

[25]  B. Tomlinson,et al.  Factor analysis of the metabolic syndrome: obesity vs insulin resistance as the central abnormality , 2001, International Journal of Obesity.

[26]  Xinyuan Song,et al.  Semiparametric Latent Variable Models With Bayesian P-Splines , 2010 .

[27]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[28]  Hansheng Wang,et al.  Computational Statistics and Data Analysis a Note on Adaptive Group Lasso , 2022 .

[29]  S Y Lee,et al.  Latent variable models with mixed continuous and polytomous data , 2001, Biometrics.

[30]  Xin-Yuan Song,et al.  Bayesian model selection for mixtures of structural equation models with an unknown number of components. , 2003, The British journal of mathematical and statistical psychology.

[31]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[32]  Xinyuan Song,et al.  Bayesian Analysis of the Functional-Coefficient Autoregressive Heteroscedastic Model , 2014 .

[33]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[34]  Dries F. Benoit,et al.  Bayesian adaptive Lasso quantile regression , 2012 .

[35]  David B Dunson,et al.  Bayesian Modeling of Time‐Varying and Waning Exposure Effects , 2003, Biometrics.

[36]  Sik-Yum Lee,et al.  On Bayesian estimation and model comparison of an integrated structural equation model , 2008, Comput. Stat. Data Anal..

[37]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[38]  P. Bühlmann,et al.  The group lasso for logistic regression , 2008 .

[39]  B. Brenner,et al.  The risk of developing end-stage renal disease in patients with type 2 diabetes and nephropathy: the RENAAL study. , 2003, Kidney international.

[40]  Volker Roth,et al.  The Bayesian group-Lasso for analyzing contingency tables , 2009, ICML '09.

[41]  G. Bakris,et al.  Preserving renal function in adults with hypertension and diabetes: a consensus approach. National Kidney Foundation Hypertension and Diabetes Executive Committees Working Group. , 2000, American journal of kidney diseases : the official journal of the National Kidney Foundation.