Linkage analysis without defined pedigrees

The need to collect accurate and complete pedigree information has been a drawback of family‐based linkage and association studies. Even in case‐control studies, investigators should be aware of, and condition on, familial relationships. In single nucleotide polymorphism (SNP) genome scans, relatedness can be directly inferred from the genetic data rather than determined through interviews. Various methods of estimating relatedness have previously been implemented, most notably in PLINK. We present new fast and accurate algorithms for estimating global and local kinship coefficients from dense SNP genotypes. These algorithms require only a single pass through the SNP genotype data. We also show that these estimates can be used to cluster individuals into pedigrees. With these estimates in hand, quantitative trait locus linkage analysis proceeds via traditional variance components methods without any prior relationship information. We demonstrate the success of our algorithms on simulated and real data sets. Our procedures make linkage analysis as easy as a typical genomewide association study. Genet. Epidemiol. 2011. © 2011 Wiley‐Liss, Inc. 35:360‐370, 2011

[1]  E A Thompson,et al.  The IBD process along four chromosomes. , 2008, Theoretical population biology.

[2]  J. Pritchard,et al.  Confounding from Cryptic Relatedness in Case-Control Association Studies , 2005, PLoS genetics.

[3]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[4]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[5]  Josyf Mychaleckyj,et al.  Robust relationship inference in genome-wide association studies , 2010, Bioinform..

[6]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[7]  M. McMullen,et al.  A unified mixed-model method for association mapping that accounts for multiple levels of relatedness , 2006, Nature Genetics.

[8]  K. Ritland,et al.  A novel method for estimating heritability using molecular markers , 1998, Heredity.

[9]  L. Almasy,et al.  Multipoint quantitative-trait linkage analysis in general pedigrees. , 1998, American journal of human genetics.

[10]  Brian L. Browning,et al.  High-resolution detection of identity by descent in unrelated individuals. , 2010, American journal of human genetics.

[11]  Alfred V. Aho,et al.  The Design and Analysis of Computer Algorithms , 1974 .

[12]  G A Satten,et al.  Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model. , 2001, American journal of human genetics.

[13]  J. Blangero,et al.  Genetic and environmental contributions to cardiovascular risk factors in Mexican Americans. The San Antonio Family Heart Study. , 1996, Circulation.

[14]  Jinliang Wang,et al.  An estimator for pairwise relatedness using molecular markers. , 2002, Genetics.

[15]  D. Schaid Mathematical and Statistical Methods for Genetic Analysis , 1999 .

[16]  G. McCracken,et al.  ON ESTIMATING RELATEDNESS USING GENETIC MARKERS , 1985, Evolution; international journal of organic evolution.

[17]  B. Weir,et al.  A Maximum-Likelihood Method for the Estimation of Pairwise Relatedness in Structured Populations , 2007, Genetics.

[18]  H. Hameister,et al.  Demethylation, reactivation, and destabilization of human fragile X full-mutation alleles in mouse embryocarcinoma cells. , 2001, American journal of human genetics.

[19]  Melanie Bahlo,et al.  Multipoint approximations of identity-by-descent probabilities for accurate linkage analysis of distantly related individuals. , 2008, American journal of human genetics.

[20]  Mary Sara McPeek,et al.  Enhanced Pedigree Error Detection , 2002, Human Heredity.

[21]  M. Wagner,et al.  A test statistic to detect errors in sib-pair relationships. , 1998, American journal of human genetics.

[22]  M. Daly,et al.  Guilt beyond a reasonable doubt , 2007, Nature Genetics.

[23]  K. Roeder,et al.  The power of genomic control. , 2000, American journal of human genetics.

[24]  J. Pritchard,et al.  Use of unlinked genetic markers to detect population stratification in association studies. , 1999, American journal of human genetics.

[25]  E A Thompson,et al.  The estimation of pairwise relationships , 1975, Annals of human genetics.

[26]  P. Visscher,et al.  A Genome Scan for Quantitative Trait Loci in a Wild Population of Red Deer ( Cervus elaphus ) , 2002 .

[27]  M. Boehnke,et al.  Accurate inference of relationships in sib-pair linkage studies. , 1997, American journal of human genetics.

[28]  J. Mathews,et al.  Extensions to multivariate normal models for pedigree analysis , 1982, Annals of human genetics.

[29]  J. Slate,et al.  INVITED REVIEW: Quantitative trait locus mapping in natural populations: progress, caveats and future directions , 2004, Molecular ecology.

[30]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[31]  M. Lynch,et al.  Estimation of pairwise relatedness with molecular markers. , 1999, Genetics.

[32]  Elizabeth A Thompson,et al.  Impact of parental relationships in maximum lod score affected sib‐pair method , 2002, Genetic epidemiology.

[33]  R. Tibshirani,et al.  Sparsity and smoothness via the fused lasso , 2005 .

[34]  B. Milligan,et al.  Maximum-likelihood estimation of relatedness. , 2003, Genetics.

[35]  M P Epstein,et al.  Improved inference of relationship for pairs of individuals. , 2000, American journal of human genetics.

[36]  D. Reich,et al.  Detecting association in a case‐control study while correcting for population stratification , 2001, Genetic epidemiology.

[37]  L. Almasy,et al.  Discovery of expression QTLs using large-scale transcriptional profiling in human lymphocytes , 2007, Nature Genetics.

[38]  L Sun,et al.  Statistical tests for detection of misspecified relationships by use of genome-screen data. , 2000, American journal of human genetics.

[39]  Mary Sara McPeek,et al.  ROADTRIPS: case-control association testing with partially or completely unknown population and pedigree structure. , 2010, American journal of human genetics.

[40]  Ellen M Wijsman,et al.  Case‐control association testing in the presence of unknown relationships , 2009, Genetic epidemiology.

[41]  E A Thompson,et al.  Gene identities and multiple relationships. , 1974, Biometrics.

[42]  Sharon R Browning,et al.  Estimation of Pairwise Identity by Descent From Dense Genetic Marker Data in a Population Sample of Haplotypes , 2008, Genetics.

[43]  D. Kwiatkowski,et al.  Assessing Genuine Parents-Offspring Trios for Genetic Association Studies , 2008, Human Heredity.

[44]  D J Schaid,et al.  Evaluation of candidate genes in case-control studies: a statistical method to account for related subjects. , 2001, American journal of human genetics.

[45]  Bernard Prum,et al.  Estimation of the inbreeding coefficient through use of genomic data. , 2003, American journal of human genetics.

[46]  Sreenivasan Ravi Book Review: Mathematical and statistical methods for genetic analysis, 2nd edition , 2005 .