Relaxed Significance Criteria for Linkage Analysis

Linkage analysis involves performing significance tests at many loci located throughout the genome. Traditional criteria for declaring a linkage statistically significant have been formulated with the goal of controlling the rate at which any single false positive occurs, called the genomewise error rate (GWER). As complex traits have become the focus of linkage analysis, it is increasingly common to expect that a number of loci are truly linked to the trait. This is especially true in mapping quantitative trait loci (QTL), where sometimes dozens of QTL may exist. Therefore, alternatives to the strict goal of preventing any single false positive have recently been explored, such as the false discovery rate (FDR) criterion. Here, we characterize some of the challenges that arise when defining relaxed significance criteria that allow for at least one false positive linkage to occur. In particular, we show that the FDR suffers from several problems when applied to linkage analysis of a single trait. We therefore conclude that the general applicability of FDR for declaring significant linkages in the analysis of a single trait is dubious. Instead, we propose a significance criterion that is more relaxed than the traditional GWER, but does not appear to suffer from the problems of the FDR. A generalized version of the GWER is proposed, called GWERk, that allows one to provide a more liberal balance between true positives and false positives at no additional cost in computation or assumptions.

[1]  I. Borecki,et al.  Common variants in the 5' region of the leptin gene are associated with body mass index in men from the National Heart, Lung, and Blood Institute Family Heart Study. , 2004, American journal of human genetics.

[2]  R. N. Curnow,et al.  Estimating the locations and the sizes of the effects of quantitative trait loci using flanking markers , 1992, Theoretical and Applied Genetics.

[3]  Z. Zeng Precision mapping of quantitative trait loci. , 1994, Genetics.

[4]  R. Doerge,et al.  Permutation tests for multiple loci affecting a quantitative character. , 1996, Genetics.

[5]  Eric S. Lander,et al.  A comprehensive genetic map of the mouse genome , 1996, Nature.

[6]  M. Soller,et al.  A whole genome scan for quantitative trait loci affecting milk protein percentage in Israeli-Holstein cattle, by means of selective milk DNA pooling in a daughter design, using an adjusted false discovery rate criterion. , 2001, Genetics.

[7]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Allen Rodrigo GoDIVA: a pipeline for the design of bioinformatics applications , 2004 .

[9]  Eric S. Lander,et al.  A comprehensive genetic map of the mouse genome , 1996, Nature.

[10]  John D. Storey,et al.  Genetic interactions between polymorphisms that affect gene expression in yeast , 2005, Nature.

[11]  B. Sorić Statistical “Discoveries” and Effect-Size Estimation , 1989 .

[12]  A. Farcomeni Some Results on the Control of the False Discovery Rate under Dependence , 2007 .

[13]  John D. Storey The positive false discovery rate: a Bayesian interpretation and the q-value , 2003 .

[14]  R. Doerge,et al.  Empirical threshold values for quantitative trait mapping. , 1994, Genetics.

[15]  T P Speed,et al.  Modeling interference in genetic recombination. , 1995, Genetics.

[16]  C. Haley,et al.  A simple regression method for mapping quantitative trait loci in line crosses using flanking markers , 1992, Heredity.

[17]  D. Siegmund,et al.  Statistical methods for linkage analysis of complex traits from high-resolution maps of identity by descent. , 1995, Genetics.

[18]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[19]  Hao Wu,et al.  R/qtl: QTL Mapping in Experimental Crosses , 2003, Bioinform..

[20]  R. Jansen A general mixture model for mapping quantitative trait loci by using molecular markers , 1992, Theoretical and Applied Genetics.

[21]  J. Weller,et al.  A new approach to the problem of multiple comparisons in the genetic dissection of complex traits. , 1998, Genetics.

[22]  John D. Storey,et al.  Multiple Locus Linkage Analysis of Genomewide Expression in Yeast , 2005, PLoS biology.

[23]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[24]  Z B Zeng,et al.  Theoretical basis for separation of multiple linked gene effects in mapping quantitative trait loci. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Y. Benjamini,et al.  Quantitative Trait Loci Analysis Using the False Discovery Rate , 2005, Genetics.

[26]  W. Ewens Genetics and analysis of quantitative traits , 1999 .

[27]  E Feingold,et al.  Gaussian models for genetic linkage analysis using complete high-resolution maps of identity by descent. , 1993, American journal of human genetics.

[28]  R. Jansen,et al.  University of Groningen High Resolution of Quantitative Traits Into Multiple Loci via Interval Mapping , 2022 .

[29]  John D. Storey,et al.  Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach , 2004 .

[30]  John D. Storey A direct approach to false discovery rates , 2002 .

[31]  R. Fernando,et al.  Controlling the Proportion of False Positives in Multiple Dependent Tests , 2004, Genetics.

[32]  E. Lander,et al.  Construction of multilocus genetic linkage maps in humans. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Rachel B. Brem,et al.  Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors , 2003, Nature Genetics.

[34]  E. Lander,et al.  Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results , 1995, Nature Genetics.

[35]  D. Zaykin,et al.  Using the false discovery rate approach in the genetic dissection of complex traits: a response to Weller et al. , 2000, Genetics.

[36]  A. Kong,et al.  Linkage mapping in experimental crosses: the robustness of single-gene models. , 1997, Genetics.

[37]  V. Guiard,et al.  Multiple Quantitative Trait Loci Mapping With Cofactors and Application of Alternative Variants of the False Discovery Rate in an Enlarged Granddaughter Design , 2004, Genetics.

[38]  M. Spence,et al.  Analysis of human genetic linkage , 1986 .

[39]  N. Morton Sequential tests for the detection of linkage. , 1955, American journal of human genetics.

[40]  E. Lander,et al.  Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. , 1989, Genetics.

[41]  W. Wong,et al.  GoSurfer: a graphical interactive tool for comparative analysis of large gene sets in Gene Ontology space. , 2004, Applied bioinformatics.