Estimating heritability of complex traits from genome-wide association studies using IBS-based Haseman–Elston regression

Exploring heritability of complex traits is a central focus of statistical genetics. Among various previously proposed methods to estimate heritability, variance component methods are advantageous when estimating heritability using markers. Due to the high-dimensional nature of data obtained from genome-wide association studies (GWAS) in which genetic architecture is often unknown, the most appropriate heritability estimator model is often unclear. The Haseman–Elston (HE) regression is a variance component method that was initially only proposed for linkage studies. However, this study presents a theoretical basis for a modified HE that models linkage disequilibrium for a quantitative trait, and consequently can be used for GWAS. After replacing identical by descent (IBD) scores with identity by state (IBS) scores, we applied the IBS-based HE regression to single-marker association studies (scenario I) and estimated the variance component using multiple markers (scenario II). In scenario II, we discuss the circumstances in which the HE regression and the mixed linear model are equivalent; the disparity between these two methods is observed when a covariance component exists for the additive variance. When we extended the IBS-based HE regression to case-control studies in a subsequent simulation study, we found that it provided a nearly unbiased estimate of heritability, more precise than that estimated via the mixed linear model. Thus, for the case-control scenario, the HE regression is preferable. GEnetic Analysis Repository (GEAR; http://sourceforge.net/p/gbchen/wiki/GEAR/) software implemented the HE regression method and is freely available.

[1]  J. Ott Statistical properties of the haplotype relative risk , 1989, Genetic epidemiology.

[2]  P. Visscher,et al.  Estimation and partition of heritability in human populations using whole-genome analysis methods. , 2013, Annual review of genetics.

[3]  R. Elston,et al.  The investigation of linkage between a quantitative trait and a marker locus , 1972, Behavior genetics.

[4]  Jennifer R. Harris,et al.  Combined Genome Scans for Body Stature in 6,602 European Twins: Evidence for Common Caucasian Loci , 2007, PLoS genetics.

[5]  Doug Speed,et al.  Improved heritability estimation from genome-wide SNPs. , 2012, American journal of human genetics.

[6]  P. Visscher,et al.  Estimating missing heritability for disease from genome-wide association studies. , 2011, American journal of human genetics.

[7]  P. Visscher,et al.  Reconciling the analysis of IBD and IBS in complex trait studies , 2010, Nature Reviews Genetics.

[8]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[9]  P. Visscher,et al.  Common polygenic variation contributes to risk of schizophrenia and bipolar disorder , 2009, Nature.

[10]  B S Weir,et al.  Variation in actual relationship as a consequence of Mendelian sampling and linkage. , 2011, Genetics research.

[11]  K. Lange The affected sib-pair method using identity by state relations. , 1986, American journal of human genetics.

[12]  W. Ewens,et al.  Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). , 1993, American journal of human genetics.

[13]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[14]  Shaun Purcell,et al.  Powerful regression-based quantitative-trait linkage analysis of general pedigrees. , 2002, American journal of human genetics.

[15]  E. Dempster,et al.  Heritability of Threshold Characters. , 1950, Genetics.

[16]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[17]  K. Lange,et al.  The affected-pedigree-member method of linkage analysis. , 1988, American journal of human genetics.

[18]  D. Bishop,et al.  The power of identity-by-state methods for linkage analysis. , 1990, American journal of human genetics.

[19]  Kermit Ritland,et al.  Estimators for pairwise relatedness and individual inbreeding coefficients , 1996 .

[20]  N. Risch,et al.  A comparison of linkage disequilibrium measures for fine-scale mapping. , 1995, Genomics.

[21]  W. G. Hill,et al.  Linkage disequilibrium in finite populations , 1968, Theoretical and Applied Genetics.

[22]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[23]  K. Lange A test statistic for the affected‐sib‐set method , 1986, Annals of human genetics.

[24]  Manuel A. R. Ferreira,et al.  Assumption-Free Estimation of Heritability from Genome-Wide Identity-by-Descent Sharing between Full Siblings , 2006, PLoS genetics.

[25]  Mark I McCarthy,et al.  Genomic inflation factors under polygenic inheritance , 2011, European Journal of Human Genetics.

[26]  N. Wray Allele frequencies and the r2 measure of linkage disequilibrium: impact on design and interpretation of association studies. , 2005, Twin research and human genetics : the official journal of the International Society for Twin Studies.

[27]  Stephan Ripke,et al.  Estimation of SNP heritability from dense genotype data. , 2013, American journal of human genetics.

[28]  Alkes L. Price,et al.  Quantifying Missing Heritability at Known GWAS Loci , 2013, PLoS genetics.

[29]  R. Elston,et al.  Possible linkage relationships between certain blood groups and schizophrenia or other psychoses , 1973, Behavior genetics.

[30]  S Purcell,et al.  Equivalence between Haseman-Elston and variance-components linkage analyses for sib pairs. , 2001, American journal of human genetics.

[31]  W. Ewens Genetics and analysis of quantitative traits , 1999 .

[32]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[33]  D. Falconer The inheritance of liability to certain diseases, estimated from the incidence among relatives , 1965 .