Derived SNP Alleles are Used More Frequently than Ancestral Alleles as Risk-associated Variants in Common Human Diseases

Evolutionary aspects of the genetic architecture of common human diseases remain enigmatic. The results of more than 200 genome-wide association studies published to date were compiled in a catalog (). We used cataloged data to determine whether derived (mutant) alleles are associated with higher risk of human disease more frequently than ancestral alleles. We placed all allelic variants into ten categories of population frequency (0%-100%) in 10% increments. We then analyzed the relationship between allelic frequency, evolutionary status of the polymorphic site (ancestral versus derived), and disease risk status (risk versus protection). Given the same population frequency, derived alleles are more likely to be risk associated than ancestral alleles, as are rarer alleles. The common interpretation of this association is that negative selection prevents fixation of the risk variants. However, disease stratification as early or late onset suggests that weak selection against risk-associated alleles is unlikely a major factor shaping genetic architecture of common diseases. Our results clearly suggest that the duration of existence of an allele in a population is more important. Alleles existing longer tend to show weaker linkage disequilibrium with neighboring alleles, including the causal alleles, and are less likely to tag a SNP-disease association.

[1]  B. Shastry SNPs in disease gene mapping, medicinal drug development and evolution , 2007, Journal of Human Genetics.

[2]  Andrew D. Johnson,et al.  Bmc Medical Genetics an Open Access Database of Genome-wide Association Results , 2009 .

[3]  Marek Kimmel,et al.  Forward-Time Simulations of Human Populations with Complex Diseases , 2007, PLoS genetics.

[4]  F. Tajima Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. , 1989, Genetics.

[5]  R. Elston,et al.  Finding genes underlying human disease , 2009, Clinical genetics.

[6]  S. Pavard,et al.  Negative Selection on BRCA1 Susceptibility Alleles Sheds Light on the Population Genetics of Late-Onset Diseases and Aging Theory , 2007, PloS one.

[7]  M. Slatkin,et al.  The Joint Allele-Frequency Spectrum in Closely Related Species , 2007, Genetics.

[8]  B. Charlesworth,et al.  A polygenic basis for late-onset disease. , 2003, Trends in genetics : TIG.

[9]  N. Morton,et al.  Into the post-HapMap era. , 2008, Advances in genetics.

[10]  J. Lachance Disease-associated alleles in genome-wide association studies are enriched for derived low frequency alleles relative to HapMap and neutral expectations , 2010, BMC Medical Genomics.

[11]  Robert J. Klein,et al.  Predicting functionally important SNP classes based on negative selection , 2011, BMC Bioinformatics.

[12]  K. Kidd,et al.  Nonsynonymous SNPs: validation characteristics, derived allele frequency patterns, and suggestive evidence for natural selection , 2006, Human mutation.

[13]  M. Spitz,et al.  Shifting paradigm of association studies: value of rare single-nucleotide polymorphisms. , 2008, American journal of human genetics.

[14]  Peng Yue,et al.  SNPs3D: Candidate gene and SNP selection for association studies , 2006, BMC Bioinformatics.

[15]  T. Ohta,et al.  Probability of gene fixation in an expanding finite population. , 1974, Proceedings of the National Academy of Sciences of the United States of America.

[16]  K. Lange,et al.  Prioritizing GWAS results: A review of statistical methods and recommendations for their application. , 2010, American journal of human genetics.