Comparison of mixed-model approaches for association mapping in rapeseed, potato, sugar beet, maize, and Arabidopsis

BackgroundIn recent years, several attempts have been made in plant genetics to detect QTL by using association mapping methods. The objectives of this study were to (i) evaluate various methods for association mapping in five plant species and (ii) for three traits in each of the plant species compare the Topt, the restricted maximum likelihood (REML) estimate of the conditional probability that two genotypes carry at the same locus alleles that are identical in state but not identical by descent. In order to compare the association mapping methods based on scenarios with realistic estimates of population structure and familial relatedness, we analyzed phenotypic and genotypic data of rapeseed, potato, sugar beet, maize, and Arabidopsis. For the same reason, QTL effects were simulated on top of the observed phenotypic values when examining the adjusted power for QTL detection.ResultsThe correlation between the Toptvalues identified using REML deviance profiles and profiles of the mean of squared difference between observed and expected P values was 0.83.ConclusionThe mixed-model association mapping approaches using a kinship matrix, which was based on Topt, were more appropriate for association mapping than the recently proposed QK method with respect to the adherence to the nominal α level and the adjusted power for QTL detection. Furthermore, we showed that Toptdiffers considerably among the five plant species but only marginally among different traits.

[1]  J. Reif,et al.  Linkage disequilibrium in European elite maize germplasm investigated with SSRs , 2005, Theoretical and Applied Genetics.

[2]  H. Grüneberg,et al.  Introduction to quantitative genetics , 1960 .

[3]  Robin Thompson,et al.  ASREML user guide release 1.0 , 2002 .

[4]  E. Buckler,et al.  Structure of linkage disequilibrium in plants. , 2003, Annual review of plant biology.

[5]  C. Calestani,et al.  A high-density genetic map of hexaploid wheat (Triticum aestivum L.) from the cross Chinese Spring × SQ1 and its use to compare QTLs for grain yield across a range of environments , 2005, Theoretical and Applied Genetics.

[6]  Mattias Jakobsson,et al.  The Pattern of Polymorphism in Arabidopsis thaliana , 2005, PLoS biology.

[7]  J. Holland,et al.  Estimating and Interpreting Heritability for Plant Breeding: An Update , 2010 .

[8]  D. Balding A tutorial on statistical methods for population association studies , 2006, Nature Reviews Genetics.

[9]  E. Buckler,et al.  Using natural allelic diversity to evaluate gene function. , 2003, Methods in molecular biology.

[10]  G. Evanno,et al.  Detecting the number of clusters of individuals using the software structure: a simulation study , 2005, Molecular ecology.

[11]  C. Gebhardt,et al.  Single nucleotide polymorphism (SNP) genotyping as basis for developing a PCR-based marker highly diagnostic for potato varieties with high resistance to Globodera pallida pathotype Pa2/3 , 2006, Molecular Breeding.

[12]  M. McMullen,et al.  A unified mixed-model method for association mapping that accounts for multiple levels of relatedness , 2006, Nature Genetics.

[13]  A. Gallais Quantitative genetics and breeding methods in autopolyploid plants , 2004 .

[14]  H. Monod,et al.  Marker-assisted selection with spatial analysis of unreplicated field trials , 1999, Theoretical and Applied Genetics.

[15]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[16]  Edward S. Buckler,et al.  Dwarf8 polymorphisms associate with variation in flowering time , 2001, Nature Genetics.

[17]  M. Lynch,et al.  Estimation of relatedness by DNA fingerprinting. , 1988, Molecular biology and evolution.

[18]  P. N. Fox,et al.  Statistical Methods for Plant Variety Evaluation , 1997, Plant Breeding Series 3 Statistical Methods for Plant Variety Evaluation.

[19]  S. Muse,et al.  Genetic structure and diversity among maize inbred lines as inferred from DNA microsatellites. , 2003, Genetics.

[20]  Keyan Zhao,et al.  Genome-Wide Association Mapping in Arabidopsis Identifies Previously Known Flowering Time and Pathogen Resistance Genes , 2005, PLoS genetics.

[21]  O. Hardy,et al.  spagedi: a versatile computer program to analyse spatial genetic structure at the individual or population levels , 2002 .

[22]  William E. Fry,et al.  Quantification of General Resistance of Potato Cultivars and Fungicide Effects for Integrated Control of Potato Late Blight , 1978 .

[23]  M. Purugganan,et al.  Linkage Disequilibrium Mapping of Arabidopsis CRY2 Flowering Time Alleles Sequence data from this article have been deposited with the EMBL/GenBank Data Libraries under accession nos. AY576055, AY576271. , 2004, Genetics.

[24]  R. Bernardo,et al.  Estimation of coefficient of coancestry using molecular markers in maize , 1993, Theoretical and Applied Genetics.

[25]  Keyan Zhao,et al.  An Arabidopsis Example of Association Mapping in Structured Samples , 2006, PLoS genetics.

[26]  A. Melchinger,et al.  Genetic diversity in European winter triticale determined with SSR markers and coancestry coefficient , 2004, Theoretical and Applied Genetics.

[27]  S. Tanksley Mapping polygenes. , 1993, Annual review of genetics.

[28]  A. Paterson,et al.  Molecular dissection of quantitative traits: progress and prospects. , 1995, Genome research.

[29]  Hans-Peter Piepho,et al.  Comparison of Mixed-Model Approaches for Association Mapping , 2008, Genetics.

[30]  P. Stam,et al.  Association between relationship measures based on AFLP markers, pedigree data and morphological traits in barley , 1997, Theoretical and Applied Genetics.

[31]  F. V. van Eeuwijk,et al.  Linkage Disequilibrium Mapping of Yield and Yield Stability in Modern Spring Barley Cultivars , 2004, Genetics.

[32]  R. Bernardo,et al.  In silico mapping of quantitative trait loci in maize , 2004, Theoretical and Applied Genetics.

[33]  Amanda J. Garris,et al.  Genetic Structure and Diversity in Oryza sativa L. , 2005, Genetics.

[34]  Steen Magnussen,et al.  An Algorithm for Generating Positively Correlated Beta Distributed Random Variables , 2002, Comput. Stat. Data Anal..

[35]  F. V. van Eeuwijk,et al.  A Mixed-Model Approach to Association Mapping Using Pedigree Information With an Illustration of Resistance to Phytophthora infestans in Potato , 2007, Genetics.

[36]  R. Collins,et al.  Newly identified loci that influence lipid concentrations and risk of coronary artery disease , 2008, Nature Genetics.

[37]  Brian R. Cullis,et al.  Spatial analysis of multi-environment early generation variety trials , 1998 .

[38]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[39]  P. Donnelly,et al.  Association mapping in structured populations. , 2000, American journal of human genetics.

[40]  M. Yano,et al.  Genetic and molecular dissection of naturally occurring variation. , 2001, Current opinion in plant biology.

[41]  H. Piepho,et al.  Mixed modelling for QTL × environment interaction analysis , 2004, Euphytica.

[42]  M. Sorrells,et al.  Association Mapping of Kernel Size and Milling Quality in Wheat (Triticum aestivum L.) Cultivars , 2006, Genetics.

[43]  E S Buckler,et al.  Structure of linkage disequilibrium and phenotypic associations in the maize genome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[44]  A. Melchinger,et al.  Diversity and Relationships among U.S. Maize Inbreds Revealed by Restriction Fragment Length Polymorphisms , 1991 .

[45]  Hiroshi Sato,et al.  Functional SNPs in the lymphotoxin-α gene that are associated with susceptibility to myocardial infarction , 2002, Nature Genetics.

[46]  F. Salamini,et al.  Mapping QTLs for sucrose content, yield and quality in a sugar beet population fingerprinted by EST-related markers , 2002, Theoretical and Applied Genetics.

[47]  R. Bernardo,et al.  Marker-based estimates of identity by descent and alikeness in state among maize inbreds , 1996, Theoretical and Applied Genetics.

[48]  A. Long,et al.  The power of association studies to detect the contribution of candidate genetic loci to variation in complex traits. , 1999, Genome research.

[49]  E. Buckler,et al.  Genetic association mapping and genome organization of maize. , 2006, Current opinion in biotechnology.

[50]  Hans-Peter Piepho,et al.  A Note on the Analysis of Designed Experiments with Complex Treatment Structure , 2006 .

[51]  J. Veyrieras,et al.  Evaluating the Reliability of Structure Outputs in Case of Relatedness between Individuals , 2007 .

[52]  Robin Thompson,et al.  Analyzing Variety by Environment Data Using Multiplicative Mixed Models and Adjustments for Spatial Field Trend , 2001, Biometrics.