Evolutionary Dynamics of Co-Segregating Gene Clusters Associated with Complex Diseases

Background The distribution of human disease-associated mutations is not random across the human genome. Despite the fact that natural selection continually removes disease-associated mutations, an enrichment of these variants can be observed in regions of low recombination. There are a number of mechanisms by which such a clustering could occur, including genetic perturbations or demographic effects within different populations. Recent genome-wide association studies (GWAS) suggest that single nucleotide polymorphisms (SNPs) associated with complex disease traits are not randomly distributed throughout the genome, but tend to cluster in regions of low recombination. Principal Findings Here we investigated whether deleterious mutations have accumulated in regions of low recombination due to the impact of recent positive selection and genetic hitchhiking. Using publicly available data on common complex diseases and population demography, we observed an enrichment of hitchhiked disease associations in conserved gene clusters subject to selection pressure. Evolutionary analysis revealed that these conserved gene clusters arose by multiple concerted rearrangements events across the vertebrate lineage. We observed distinct clustering of disease-associated SNPs in evolutionary rearranged regions of low recombination and high gene density, which harbor genes involved in immunity, that is, the interleukin cluster on 5q31 or RhoA on 3p21. Conclusions Our results suggest that multiple lineage specific rearrangements led to a physical clustering of functionally related and linked genes exhibiting an enrichment of susceptibility loci for complex traits. This implies that besides recent evolutionary adaptations other evolutionary dynamics have played a role in the formation of linked gene clusters associated with complex disease traits.

[1]  Tariq Ahmad,et al.  Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci , 2010, Nature Genetics.

[2]  K. M. Wegner Clustering of Drosophila melanogaster Immune Genes in Interplay with Recombination Rate , 2008, PloS one.

[3]  S. Snelling,et al.  An SNP in the 5′-UTR of GDF5 is associated with osteoarthritis susceptibility in Europeans and with in vivo differences in allelic expression in articular cartilage , 2007 .

[4]  D. Reich,et al.  Human Population Differentiation Is Strongly Correlated with Local Recombination Rate , 2010, PLoS genetics.

[5]  Ryan D. Hernandez,et al.  Proportionally more deleterious genetic variation in European than in African populations , 2008, Nature.

[6]  Mark Daly,et al.  Haploview: analysis and visualization of LD and haplotype maps , 2005, Bioinform..

[7]  Camilo Salazar,et al.  Chromosomal rearrangements maintain a polymorphic supergene controlling butterfly mimicry , 2011, Nature.

[8]  Chandler D. Gatenbee,et al.  Crohn's disease and genetic hitchhiking at IBD5. , 2012, Molecular biology and evolution.

[9]  Andrew D. Johnson,et al.  SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap , 2008, Bioinform..

[10]  L. Duret,et al.  Evolutionary origin and maintenance of coexpressed gene clusters in mammals. , 2006, Molecular biology and evolution.

[11]  Bart De Moor,et al.  BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis , 2005, Bioinform..

[12]  Life Technologies,et al.  A map of human genome variation from population-scale sequencing , 2011 .

[13]  Pardis C Sabeti,et al.  Linkage disequilibrium in the human genome , 2001, Nature.

[14]  L. Quintana-Murci,et al.  From evolutionary genetics to human immunology: how selection shapes host defence genes , 2010, Nature Reviews Genetics.

[15]  Joshua M Akey,et al.  Where do we go from here? Constructing genomic maps of positive selection in humans: , 2009 .

[16]  B. Charlesworth Measures of divergence between populations and the effect of forces that reduce variability. , 1998, Molecular biology and evolution.

[17]  Anders Albrechtsen,et al.  Natural Selection Affects Multiple Aspects of Genetic Variation at Putatively Neutral Sites across the Human Genome , 2011, PLoS genetics.

[18]  Yusuke Nakamura,et al.  Association analysis of genetic variants in IL23R, ATG16L1 and 5p13.1 loci with Crohn's disease in Japanese patients , 2007, Journal of Human Genetics.

[19]  A. Gamian,et al.  Impaired erythrocyte antioxidant defense in active inflammatory bowel disease: Impact of anemia and treatment , 2010, Inflammatory bowel diseases.

[20]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[21]  Chandler D. Gatenbee,et al.  Crohn ’ s Disease and Genetic Hitchhiking at IBD 5 , 2011 .

[22]  A. Cutter,et al.  Natural selection shapes nucleotide polymorphism across the genome of the nematode Caenorhabditis briggsae. , 2010, Genome research.

[23]  Philip Rosenstiel,et al.  Genome-wide association study for Crohn's disease in the Quebec Founder Population identifies multiple validated disease loci , 2007, Proceedings of the National Academy of Sciences.

[24]  P. Donnelly,et al.  A Fine-Scale Map of Recombination Rates and Hotspots Across the Human Genome , 2005, Science.

[25]  Andreas Prlic,et al.  Ensembl 2008 , 2007, Nucleic Acids Res..

[26]  A. Caballero,et al.  Variation After a Selective Sweep in a Subdivided Population , 2005, Genetics.

[27]  Justin C. Fay,et al.  Evidence for Hitchhiking of Deleterious Mutations within the Human Genome , 2011, PLoS genetics.

[28]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[29]  J. Akey,et al.  Fitting background-selection predictions to levels of nucleotide variation and divergence along the human autosomes. , 2005, Genome research.

[30]  Judy H. Cho,et al.  [Letters to Nature] , 1975, Nature.

[31]  John Loughlin,et al.  An SNP in the 5'-UTR of GDF5 is associated with osteoarthritis susceptibility in Europeans and with in vivo differences in allelic expression in articular cartilage. , 2007, Human molecular genetics.

[32]  K. Marder,et al.  The APOE- (cid:101) 4 Allele and the Risk of Alzheimer Disease Among African Americans, Whites, and Hispanics , 2001 .

[33]  P. Green,et al.  Widespread Genomic Signatures of Natural Selection in Hominid Evolution , 2009, PLoS genetics.

[34]  Andrew D. Johnson,et al.  Bmc Medical Genetics an Open Access Database of Genome-wide Association Results , 2009 .

[35]  W. G. Hill,et al.  The effect of linkage on limits to artificial selection. , 1966, Genetical research.

[36]  Laurence D. Hurst,et al.  Evidence for co-evolution of gene order and recombination rate , 2003, Nature Genetics.

[37]  Christian Gieger,et al.  A genome-wide meta-analysis identifies 22 loci associated with eight hematological parameters in the HaemGen consortium , 2009, Nature Genetics.

[38]  Kazuho Ikeo,et al.  Rapid Evolution of Major Histocompatibility Complex Class I Genes in Primates Generates New Disease Alleles in Humans via Hitchhiking Diversity , 2006, Genetics.

[39]  S. Tishkoff,et al.  Positive Selection Can Create False Hotspots of Recombination , 2006, Genetics.

[40]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[41]  Sinead B. O'Leary,et al.  Genetic variation in the 5q31 cytokine gene cluster confers susceptibility to Crohn disease , 2001, Nature Genetics.

[42]  S. Gabriel,et al.  The Structure of Haplotype Blocks in the Human Genome , 2002, Science.

[43]  Wei Chen,et al.  SNP@Evolution: a hierarchical database of positive selection on the human genome , 2009, BMC Evolutionary Biology.

[44]  J. Pritchard,et al.  A Map of Recent Positive Selection in the Human Genome , 2006, PLoS biology.

[45]  P. O’Reilly,et al.  Confounding between recombination and selection, and the Ped/Pop method for detecting selection. , 2008, Genome research.

[46]  Mourad Sahbatou,et al.  Association of NOD2 leucine-rich repeat variants with susceptibility to Crohn's disease , 2001, Nature.

[47]  W. Cresko,et al.  Extensive linkage disequilibrium and parallel adaptive divergence across threespine stickleback genomes , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[48]  Benjamin Meder,et al.  HBEGF, SRA1, and IK: Three cosegregating genes as determinants of cardiomyopathy. , 2009, Genome research.

[49]  Edwin Cuppen,et al.  Haplotype Block Structure Is Conserved across Mammals , 2006, PLoS genetics.

[50]  Yaakov Stern,et al.  The APOE-∊4 Allele and the Risk of Alzheimer Disease Among African Americans, Whites, and Hispanics , 1998 .

[51]  S. Chanock,et al.  Polymorphism analysis of six selenoprotein genes: support for a selective sweep at the glutathione peroxidase 1 locus (3p21) in Asian populations , 2006, BMC Genetics.