strum: an R package for structural modeling of latent variables for general pedigrees

BackgroundStructural equation modeling (SEM) is an extremely general and powerful approach to account for measurement error and causal pathways when analyzing data, and it has been used in wide range of applied sciences. There are many commercial and freely available software packages for SEM. However, it is difficult to use any of the packages to analyze general pedigree data, and SEM packages for genetics are limited in their application.ResultsWe present the new R package strum to serve the need of a suitable SEM software tool for genetic analysis. It implements a general framework for SEM within the context of general pedigree data. This context requires specialized considerations such as familial correlations and ascertainment. Our package is an extraordinarily flexible tool capable of modeling genetic association, linkage analysis, polygenic effects, shared environment, and ascertainment combined with confirmatory factor analysis and general SEM. It also provides a convenient tool for model visualization, and integrates tools for simulating pedigree data. The various features of this package are tested through a simulation study to evaluate performance, and our results show that strum is very reliable and robust in terms of the accuracy and coverage of parameter estimates.Conclusionsstrum is a valuable new tool for genetic analysis. It can be easily used with general pedigree data, incorporating both measurement and structural models, giving it some significant advantages over other software packages. It also includes a built-in approach for handling ascertainment, a helpful integrated tool for genetic data simulation, and built-in tools for model visualization, providing a significant addition to biomedical research.

[1]  Jeffrey R. Spies,et al.  OpenMx - Advanced Structural Equation Modeling , 2011 .

[2]  Shuying Sun,et al.  Statistical human genetics : methods and protocols , 2012 .

[3]  Bengt Muthén,et al.  Technical aspects of Muthén's liscomp approach to estimation of latent variable relations with a comprehensive measurement model , 1995 .

[4]  Andreas M. Brandmaier,et al.  Structural Equation Modeling With Ωnyx , 2015 .

[5]  C. Pipper,et al.  [''R"--project for statistical computing]. , 2008, Ugeskrift for laeger.

[6]  Albert Satorra,et al.  A scaled difference chi-square test statistic for moment structure analysis , 1999 .

[7]  R. Elston,et al.  Calculating Asymptotic Significance Levels of the Constrained Likelihood Ratio Test with Application to Multivariate Genetic Linkage Analysis , 2009, Statistical applications in genetics and molecular biology.

[8]  C. Stein,et al.  Structural equation modeling. , 2012, Methods in molecular biology.

[9]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[10]  Andreas Ritter,et al.  Structural Equations With Latent Variables , 2016 .

[11]  F. Agakov,et al.  Abundant pleiotropy in human complex diseases and traits. , 2011, American journal of human genetics.

[12]  A. Thapar,et al.  Methodology for Genetic Studies of Twins and Families , 1993 .

[13]  R. Elston,et al.  Ascertainment adjustment: where does it take us? , 2000, American journal of human genetics.

[14]  B. Muthén A general structural equation model with dichotomous, ordered categorical, and continuous latent variable indicators , 1984 .

[15]  Robert C. Elston,et al.  A Framework for Structural Equation Models in General Pedigrees , 2011, Human Heredity.

[16]  C. Amos Robust variance-components approach for assessing genetic linkage in pedigrees. , 1994, American journal of human genetics.

[17]  N. Laird,et al.  Fitting ACE structural equation models to case‐control family data , 2009, Genetic epidemiology.

[18]  Yves Rosseel,et al.  lavaan: An R Package for Structural Equation Modeling , 2012 .

[19]  John Fox,et al.  TEACHER'S CORNER: Structural Equation Modeling With the sem Package in R , 2006 .

[20]  C Kooperberg,et al.  The use of phenome‐wide association studies (PheWAS) for exploration of novel genotype‐phenotype relationships and pleiotropy discovery , 2011, Genetic epidemiology.

[21]  Karl G. Jöreskog,et al.  Lisrel 8: User's Reference Guide , 1997 .

[22]  A. Narayanan,et al.  A Review of Eight Software Packages for Structural Equation Modeling , 2012 .

[23]  B. Muthén,et al.  Robust inference using weighted least squares and quadratic estimating equations in latent variable modeling with categorical and continuous outcomes , 1997 .

[24]  Peter M. Bentler,et al.  EQS : structural equations program manual , 1989 .

[25]  Esben Budtz-Jørgensen,et al.  Linear latent variable models: the lava-package , 2012, Comput. Stat..