Association statistics under the PPL framework

In this paper, we extend the PPL framework to the analysis of case‐control (CC) data and introduce three new linkage disequilibrium (LD) statistics. These statistics measure the evidence for or against LD, rather than testing the null hypothesis of no LD, and they therefore avoid the need for multiple testing corrections. They are suitable not only for CC designs but also can be used in application to family data, ranging from trios to complex pedigrees, all under the same statistical framework, allowing for the seamless analysis of disparate data structures. They also provide other core advantages of the PPL framework, including the use of sequential updating to accumulate LD evidence across potentially heterogeneous sets or subsets of data; parameterization in terms of a very general trait likelihood, which simultaneously considers dominant, recessive, and additive models; and a straightforward mechanism for modeling two‐locus epistasis. Finally, by implementing the new statistics within the PPL framework, we have a ready mechanism for incorporating linkage information, obtained from distinct data, into LD analyses in the form of a prior distribution. Here we examine the performance of the proposed LD statistics using simulated data, as well as assessing the effects of key modeling violations on this performance. Genet. Epidemiol. 34:835‐845, 2010. © 2010 Wiley‐Liss, Inc.

[1]  Where’s the Evidence? , 2011, Human Heredity.

[2]  Steven Buyske,et al.  Identification of a schizophrenia-associated functional noncoding variant in NOS1AP. , 2009, The American journal of psychiatry.

[3]  Edwin Liu,et al.  Type 1 diabetes: recent developments , 2004, BMJ : British Medical Journal.

[4]  Veronica J Vieland,et al.  Thermometers: Something for Statistical Geneticists to Think about , 2006, Human Heredity.

[5]  C. A. Smith,et al.  Testing for heterogeneity of recombination fraction values in Human Genetics , 1963, Annals of human genetics.

[6]  Ellen M Wijsman,et al.  A score for Bayesian genome screening , 2003, Genetic epidemiology.

[7]  Jian Huang,et al.  The Posterior Probability of Linkage Allowing for Linkage Disequilibrium and a New Estimate of Disequilibrium between a Trait and a Marker , 2005, Human Heredity.

[8]  S A Seuchter,et al.  Linkage analysis in nuclear families. 1: Optimality criteria for affected sib-pair tests. , 1994, Human heredity.

[9]  Veronica J. Vieland,et al.  The Incorporation of Prior Genomic Information Does Not Necessarily Improve the Performance of Bayesian Linkage Methods: An Example Involving Sex-Specific Recombination and the Two-Point PPL , 2006, Human Heredity.

[10]  G. Lathrop,et al.  Easy calculations of lod scores and genetic risks on small computers. , 1984, American journal of human genetics.

[11]  N J Cox,et al.  Allele-sharing models: LOD scores and accurate linkage tests. , 1997, American journal of human genetics.

[12]  R. Elston,et al.  A general model for the genetic analysis of pedigree data. , 1971, Human heredity.

[13]  J. Kaprio,et al.  Genetic liability of type 1 diabetes and the onset age among 22,650 young Finnish twin pairs: a nationwide follow-up study. , 2003, Diabetes.

[14]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[15]  A. Gupta,et al.  A Bayesian Approach to , 1997 .

[16]  R. Lewontin The Interaction of Selection and Linkage. I. General Considerations; Heterotic Models. , 1964, Genetics.

[17]  R. Elston,et al.  Detecting rare variants for complex traits using family and unrelated data , 2010, Genetic epidemiology.

[18]  N. Laird,et al.  Family-based designs in the age of large-scale gene-association studies , 2006, Nature Reviews Genetics.

[19]  C. Bonaïti‐pellié,et al.  Effects of misspecifying genetic parameters in lod score analysis. , 1986, Biometrics.

[20]  Larry Wasserman,et al.  Using linkage genome scans to improve power of association in genome scans. , 2006, American journal of human genetics.

[21]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[22]  N. Laird,et al.  Meta-analysis in clinical trials. , 1986, Controlled clinical trials.

[23]  Adrian Vella,et al.  Localization of a type 1 diabetes locus in the IL2RA/CD25 region by use of tag single-nucleotide polymorphisms. , 2005, American journal of human genetics.

[24]  Alberto Maria Segre,et al.  MLIP: using multiple processors to compute the posterior probability of linkage , 2008, BMC Bioinformatics.

[25]  Xin Xu,et al.  Implementing a unified approach to family‐based tests of association , 2000, Genetic epidemiology.

[26]  Nunzio Bottini,et al.  A functional variant of lymphoid tyrosine phosphatase is associated with type I diabetes , 2004, Nature Genetics.

[27]  D. Weeks,et al.  Two-locus models of disease: comparison of likelihood and nonparametric linkage methods. , 1993, American journal of human genetics.

[28]  C. Lewis,et al.  A method for meta‐analysis of genome searches: Application to simulated data , 1999, Genetic epidemiology.

[29]  Jian Huang,et al.  Power to Detect Linkage Based on Multiple Sets of Data in the Presence of Locus Heterogeneity: Comparative Evaluation of Model-Based Linkage Methods for Affected Sib Pair Data , 2001, Human Heredity.

[30]  P. Visscher,et al.  Family-based genome-wide association studies. , 2009, Pharmacogenomics.

[31]  David J Balding,et al.  Logistic regression protects against population structure in genetic association studies. , 2005, Genome research.

[32]  J. Ioannidis Calibration of credibility of agnostic genome‐wide associations , 2008, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[33]  W J Ewens,et al.  A resolution of the ascertainment sampling problem. I. Theory. , 1986, Theoretical population biology.

[34]  A. Morris,et al.  Data quality control in genetic case-control association studies , 2010, Nature Protocols.

[35]  P. Donnelly,et al.  Association mapping in structured populations. , 2000, American journal of human genetics.

[36]  C. Falk,et al.  Haplotype relative risks: an easy reliable way to construct a proper control sample for risk calculations , 1987, Annals of human genetics.

[37]  D. Greenberg,et al.  Inferring mode of inheritance by comparison of lod scores. , 1989, American journal of medical genetics.

[38]  Adrian Vella,et al.  Replication of an association between the lymphoid tyrosine phosphatase locus (LYP/PTPN22) with type 1 diabetes, and evidence for its role as a general autoimmunity locus. , 2004, Diabetes.

[39]  L Kruglyak,et al.  Parametric and nonparametric linkage analysis: a unified multipoint approach. , 1996, American journal of human genetics.

[40]  E. Lander,et al.  Construction of multilocus genetic linkage maps in humans. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[41]  D R Goldstein,et al.  Meta‐analysis by combining parameter estimates: Simulated linkage studies , 1999, Genetic epidemiology.

[42]  Veronica J Vieland,et al.  Bayesian analysis of a previously published genome screen for panic disorder reveals new and compelling evidence for linkage to chromosome 7 , 2003, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[43]  J. Todd,et al.  The CTLA-4 gene region of chromosome 2q33 is linked to, and associated with, type 1 diabetes. Belgian Diabetes Registry. , 1996, Human molecular genetics.

[44]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[45]  J. Todd,et al.  A correlation between the relative predisposition of MHC class II alleles to type 1 diabetes and the structure of their proteins. , 2001, Human molecular genetics.

[46]  P. Gorroochurn Perils in the Use of Linkage Disequilibrium for Fine Gene Mapping: Simple Insights from Population Genetics , 2008, Cancer Epidemiology Biomarkers & Prevention.

[47]  Gonçalo R. Abecasis,et al.  PEDSTATS: descriptive statistics, graphics and quality assessment for gene mapping data , 2005, Bioinform..

[48]  D. Clayton,et al.  A genome-wide association study of nonsynonymous SNPs identifies a type 1 diabetes locus in the interferon-induced helicase (IFIH1) region , 2006, Nature Genetics.

[49]  Albert D. G. de Roos,et al.  Origins of introns based on the definition of exon modules and their conserved interfaces , 2005, Bioinform..

[50]  M. McCarthy,et al.  Genome-wide association studies for complex traits: consensus, uncertainty and challenges , 2008, Nature Reviews Genetics.

[51]  J. O’Connell,et al.  An optimal algorithm for automatic genotype elimination. , 1999, American journal of human genetics.

[52]  D. Rao,et al.  An overview of the genetic dissection of complex traits. , 2008, Advances in genetics.

[53]  S A Seuchter,et al.  Linkage analysis in nuclear families. 2: Relationship between affected sib-pair tests and lod score analysis. , 1994, Human heredity.

[54]  S Richardson,et al.  A Bayesian approach to multipoint mapping in nuclear families , 1997, Genetic epidemiology.

[55]  P. Donnelly,et al.  The effects of human population structure on large genetic association studies , 2004, Nature Genetics.

[56]  Jian Huang,et al.  Comparison of ‘Model-Free’ and ‘Model-Based’ Linkage Statistics in the Presence of Locus Heterogeneity: Single Data Set and Multiple Data Set Applications , 2001, Human Heredity.

[57]  Jian Huang,et al.  A bayesian approach to replication of linkage findings , 1999, Genetic epidemiology.

[58]  K. Eguchi,et al.  Identification and Functional Analysis of Mutations in the Hepatocyte Nuclear Factor-1α Gene in Anti-Islet Autoantibody-Negative Japanese Patients with Type 1 Diabetes , 2000 .

[59]  Ellen M Wijsman,et al.  Joint oligogenic segregation and linkage analysis using bayesian Markov chain Monte Carlo methods , 2004, Molecular biotechnology.

[60]  Helen Schuilenburg,et al.  Genome-wide association study and meta-analysis finds over 40 loci affect risk of type 1 diabetes , 2009, Nature Genetics.

[61]  D. Reich,et al.  Population Structure and Eigenanalysis , 2006, PLoS genetics.

[62]  J A Badner,et al.  Meta‐analysis of linkage studies , 1999, Genetic epidemiology.

[63]  Alberto Maria Segre,et al.  Exploiting gene × gene interaction in linkage analysis , 2007, BMC proceedings.

[64]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[65]  J. Witte,et al.  Genetic dissection of complex traits. , 1994, Nature genetics.

[66]  Jon Wakefield,et al.  Bayes factors for genome‐wide association studies: comparison with P‐values , 2009, Genetic epidemiology.

[67]  M. Daly,et al.  Transferability of tag SNPs in genetic association studies in multiple populations , 2006, Nature Genetics.

[68]  S. Gabriel,et al.  Assessing the impact of population stratification on genetic association studies , 2004, Nature Genetics.

[69]  S E Hodge,et al.  The problem of ascertainment for linkage analysis. , 1996, American journal of human genetics.

[70]  N. Reid,et al.  Likelihood , 1993 .

[71]  Luc J. Smink,et al.  Association of the T-cell regulatory gene CTLA4 with susceptibility to autoimmune disease , 2003, Nature.

[72]  J. Ott Analysis of Human Genetic Linkage , 1985 .

[73]  S. Garg,et al.  Genetic determination of islet cell autoimmunity in monozygotic twin, dizygotic twin, and non-twin siblings of patients with type 1 diabetes: prospective twin study , 1999, BMJ.

[74]  D. Balding A tutorial on statistical methods for population association studies , 2006, Nature Reviews Genetics.

[75]  Ian Hacking Logic of Statistical Inference , 1965 .

[76]  B. Yandell,et al.  A markov chain monte carlo approach to detect polygene loci for complex traits , 1995 .

[77]  S E Hodge,et al.  Inherent intractability of the ascertainment problem for pedigree data: a general likelihood framework. , 1995, American journal of human genetics.

[78]  R. Fisher XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. , 1919, Transactions of the Royal Society of Edinburgh.

[79]  G. Abecasis,et al.  Merlin—rapid analysis of dense genetic maps using sparse gene flow trees , 2002, Nature Genetics.

[80]  J. Karam,et al.  A Polymorphic Locus Near the Human Insulin Gene Is Associated with Insulin-dependent Diabetes Melliitus , 1984, Diabetes.

[81]  N. Morton Segregation analysis in human genetics. , 1958, Science.

[82]  V J Vieland,et al.  Bayesian linkage analysis, or: how I learned to stop worrying and love the posterior probability of linkage. , 1998, American journal of human genetics.

[83]  W J Ewens,et al.  A resolution of the ascertainment sampling problem. II. Generalizations and numerical results. , 1988, American journal of human genetics.

[84]  G. Abecasis,et al.  Optimal designs for two‐stage genome‐wide association studies , 2007, Genetic epidemiology.

[85]  V. Vieland,et al.  Two novel quantitative trait linkage analysis statistics based on the posterior probability of linkage: application to the COGA families , 2005, BMC Genetics.

[86]  V. Vieland,et al.  The consistency of the posterior probability of linkage , 2000, Annals of human genetics.

[87]  V. Vieland,et al.  Practical Considerations for Dividing Data into Subsets Prior to PPL Analysis , 2008, Human Heredity.

[88]  D. Greenberg,et al.  Summary of analyses of problem 2 simulated data for GAW11 , 1999, Genetic epidemiology.

[89]  Michael Evans,et al.  Fast and Accurate Calculation of a Computationally Intensive Statistic for Mapping Disease Genes , 2009, J. Comput. Biol..

[90]  W. Haenszel,et al.  Statistical aspects of the analysis of data from retrospective studies of disease. , 1959, Journal of the National Cancer Institute.

[91]  Expected Monotonicity – A Desirable Property for Evidence Measures? , 2010, Human Heredity.

[92]  R C Elston,et al.  Man bites dog? The validity of maximizing lod scores to determine mode of inheritance. , 1989, American journal of medical genetics.

[93]  V. Vieland,et al.  Two-locus heterogeneity cannot be distinguished from two-locus epistasis on the basis of affected-sib-pair data. , 2003, American journal of human genetics.

[94]  H. Cordell Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans. , 2002, Human molecular genetics.

[95]  J. Ott,et al.  A computer program for linkage analysis of general human pedigrees. , 1976, American journal of human genetics.

[96]  Lisa J. Strug,et al.  An Alternative Foundation for the Planning and Evaluation of Linkage Analysis , 2006, Human Heredity.

[97]  Veronica J Vieland,et al.  A multilocus model of the genetic architecture of autoimmune thyroid disorder, with clinical implications. , 2008, American journal of human genetics.

[98]  V. Vieland,et al.  Accumulating quantitative trait linkage evidence across multiple datasets using the posterior probability of linkage , 2007, Genetic epidemiology.

[99]  I. Pe’er,et al.  Optimal two‐stage genotyping designs for genome‐wide association scans , 2006, Genetic epidemiology.

[100]  Terje O. Espelid,et al.  Algorithm 698: DCUHRE: an adaptive multidemensional integration routine for a vector of integrals , 1991, TOMS.

[101]  M. Spence,et al.  Simulated data for a complex genetic trait (Problem 2 for GAW11): How the model was developed, and why , 1999, Genetic epidemiology.

[102]  E. Lander,et al.  Complete multipoint sib-pair analysis of qualitative and quantitative traits. , 1995, American journal of human genetics.

[103]  J. Turtle,et al.  Diabetes in the New Millennium , 1999 .

[104]  W. Ewens,et al.  Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). , 1993, American journal of human genetics.

[105]  D R Goldstein,et al.  Meta‐analysis by combining p‐values: Simulated linkage studies , 1999, Genetic epidemiology.

[106]  V. Vieland,et al.  Statistical Evidence: A Likelihood Paradigm , 1998 .

[107]  J. O’Connell,et al.  The VITESSE algorithm for rapid exact multilocus linkage analysis via genotype set–recoding and fuzzy inheritance , 1995, Nature Genetics.

[108]  K. Mossman The Wellcome Trust Case Control Consortium, U.K. , 2008 .

[109]  Y. Tomer,et al.  Autoimmune thyroiditis and diabetes: dissecting the joint genetic susceptibility in a large cohort of multiplex families. , 2009, The Journal of clinical endocrinology and metabolism.

[110]  P. Visscher,et al.  The value of relatives with phenotypes but missing genotypes in association studies for quantitative traits , 2006, Genetic epidemiology.

[111]  P. Armitage Tests for Linear Trends in Proportions and Frequencies , 1955 .

[112]  R. A. Bailey,et al.  Robust associations of four new chromosome regions from genome-wide analyses of type 1 diabetes , 2007, Nature Genetics.

[114]  Alistair T. Pagnamenta,et al.  Novel method for combined linkage and genome-wide association analysis finds evidence of distinct genetic architecture for two subtypes of autism , 2011, Journal of Neurodevelopmental Disorders.