Emergence and influence of sequence bias in evolutionarily malleable, mammalian tandem arrays

The radiation of mammals at the extinction of the dinosaurs produced a plethora of new forms—as diverse as bats, dolphins, and elephants—in only 10-20 million years. Behind the scenes, adaptation to new niches is accompanied by extensive innovation in large families of genes that allow animals to contact the environment, including chemosensors, xenobiotic enzymes, and immune and barrier proteins. Genes in these “outward-looking” families are allelically diverse among humans and exhibit tissue-specific and sometimes stochastic expression. Here, we show that outward-looking genes are clustered in tandem arrays, enriched in AT-biased isochores, and lack CpG islands in their promoters. Models of mammalian genome evolution have not incorporated the sharply different functions and transcriptional patterns of genes in AT-versus GC-biased regions. To examine the relationship between gene family expansion, sequence content, and functional diversification, we use population genetic data and comparative analysis. First, we find that AT bias can emerge with gene family expansion in cis. Second, human genes in AT-biased isochores or with GC-poor promoters experience relatively low rates of de novo point mutation today but are enriched for functional variants. Finally, we find that isochores containing gene clusters exhibit low rates of recombination. We hypothesize that the depletion of GC bases in outward-facing gene clusters results from tolerance of sequence variation and low recombination. In turn, high AT content exerts a profound effect on their chromatin organization and transcriptional regulation.

[1]  Voichita D. Marinescu,et al.  Evolutionary constraint and innovation across hundreds of placental mammals , 2023, bioRxiv.

[2]  Jonathan M. Mudge,et al.  A joint NCBI and EMBL-EBI transcript set for clinical genomics and research , 2022, Nature.

[3]  Yan Zhang,et al.  Highly enriched BEND3 prevents the premature activation of bivalent genes during differentiation , 2022, Science.

[4]  T. Wiehe,et al.  Recombination, selection, and the evolution of tandem gene arrays , 2022, bioRxiv.

[5]  Ruth I. Tennen,et al.  The UGT2A1/UGT2A2 locus is associated with COVID-19-related loss of smell or taste , 2022, Nature Genetics.

[6]  D. Weigel,et al.  Mutation bias reflects natural selection in Arabidopsis thaliana , 2022, Nature.

[7]  Aleksandra A. Kolodziejczyk,et al.  Cell-type specialization is encoded by specific chromatin topologies , 2021, Nature.

[8]  Kam Y. J. Zhang,et al.  A loss-of-function variant in SUV39H2 identified in autism-spectrum disorder causes altered H3K9 trimethylation and dysregulation of protocadherin β-cluster genes in the developing brain , 2021, Molecular Psychiatry.

[9]  Lukas Burger,et al.  BANP opens chromatin and activates CpG-island-regulated genes , 2021, Nature.

[10]  G. Sella,et al.  Broad-scale variation in human genetic diversity levels is predicted by purifying selection on coding and non-coding elements , 2021, bioRxiv.

[11]  Rhett M. Rautsaw,et al.  Phylogenetically diverse diets favor more complex venoms in North American pitvipers , 2021, Proceedings of the National Academy of Sciences.

[12]  X. Xie,et al.  Changes in genome architecture and transcriptional dynamics progress independently of sensory experience during post-natal brain development , 2021, Cell.

[13]  Ethan K. Scott,et al.  Deep conservation of the enhancer regulatory code in animals , 2020, Science.

[14]  A. Barbadilla,et al.  Germline de novo mutation rates on exons versus introns in humans , 2020, Nature Communications.

[15]  C. Smadja,et al.  The role of copy-number variation in the reinforcement of sexual isolation between the two European subspecies of the house mouse , 2020, Philosophical Transactions of the Royal Society B.

[16]  C. Hetz,et al.  Mechanisms, regulation and functions of the unfolded protein response , 2020, Nature Reviews Molecular Cell Biology.

[17]  S. Carroll,et al.  The origin and diversification of a novel protein family in venomous snakes , 2020, Proceedings of the National Academy of Sciences.

[18]  J. Boeke,et al.  Widespread Transcriptional Scanning in the Testis Modulates Gene Evolution Rates , 2020, Cell.

[19]  T. Wiehe,et al.  DNA sequence-dependent chromatin architecture and nuclear hubs formation , 2019, Scientific Reports.

[20]  P. Fraser,et al.  Long-range enhancer–promoter contacts in gene expression control , 2019, Nature Reviews Genetics.

[21]  M. J. Sheehan,et al.  Evolutionary patterns of major urinary protein scent signals in house mice and relatives , 2019, Molecular ecology.

[22]  D. Duboule,et al.  The constrained architecture of mammalian Hox gene clusters , 2019, Proceedings of the National Academy of Sciences.

[23]  W. Bickmore Patterns in the genome , 2019, Heredity.

[24]  X. Xie,et al.  Three-dimensional genome structures of single sensory neurons in mouse visual and olfactory systems , 2019, Nature Structural & Molecular Biology.

[25]  E. Bruford,et al.  Update on the human and mouse lipocalin (LCN) gene family, including evidence the mouse Mup cluster is result of an “evolutionary bloom” , 2019, Human Genomics.

[26]  Ryan L. Collins,et al.  The mutational constraint spectrum quantified from variation in 141,456 humans , 2020, Nature.

[27]  Hannes P. Eggertsson,et al.  Characterizing mutagenic effects of recombination through a sequence-level genetic map , 2019, Science.

[28]  B. Garcia,et al.  H3K9me3-heterochromatin loss at protein-coding genes enables developmental lineage specification , 2019, Science.

[29]  S. Lomvardas,et al.  LHX2- and LDB1-mediated trans interactions regulate olfactory receptor choice , 2019, Nature.

[30]  W. Low,et al.  Rapid birth-death evolution and positive selection in detoxification-type glutathione S-transferases in mammals , 2018, PloS one.

[31]  T. Wiehe,et al.  A common genomic code for chromatin architecture and recombination landscape , 2018, bioRxiv.

[32]  Emmanuel Paradis,et al.  ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R , 2018, Bioinform..

[33]  Graham J. Etherington,et al.  Adaptation and conservation insights from the koala genome , 2018, Nature Genetics.

[34]  T. Kepler,et al.  The Egyptian Rousette Genome Reveals Unexpected Features of Bat Antiviral Immunity , 2018, Cell.

[35]  Aaron J. Wilk,et al.  Diversification of human NK cells: Lessons from deep profiling , 2018, Journal of leukocyte biology.

[36]  Graham M. Hughes,et al.  The Birth and Death of Olfactory Receptor Gene Families in Mammalian Niche Adaptation , 2018, Molecular biology and evolution.

[37]  J. Pritchard,et al.  Frequent nonallelic gene conversion on the human lineage and its effect on the divergence of gene duplicates , 2017, Proceedings of the National Academy of Sciences.

[38]  J. Mainland,et al.  Genetic variation across the human olfactory receptor repertoire alters odor perception , 2017, Proceedings of the National Academy of Sciences.

[39]  Hannes P. Eggertsson,et al.  Parental influence on human germline de novo mutations in 1,548 trios from Iceland , 2017, Nature.

[40]  Stavros Lomvardas,et al.  Cooperative interactions enable singular olfactory receptor expression in mouse olfactory neurons , 2017, bioRxiv.

[41]  Shixia Xu,et al.  Comparative genomics analyses of alpha-keratins reveal insights into evolutionary adaptation of marine mammals , 2017, Frontiers in Zoology.

[42]  Loretta Auvil,et al.  Reconstruction and evolutionary history of eutherian chromosomes , 2017, Proceedings of the National Academy of Sciences.

[43]  Y. Gao,et al.  Structural Modeling of Chromatin Integrates Genome Features and Reveals Chromosome Folding Principle , 2017, bioRxiv.

[44]  H. Musto,et al.  The Isochores as a Fundamental Level of Genome Structure and Organization: A General Overview , 2017, Journal of Molecular Evolution.

[45]  S. Koren,et al.  The evolution of the natural killer complex; a comparison between mammals using new high-quality genome assemblies and targeted annotation , 2016, bioRxiv.

[46]  Kamel Jabbari,et al.  An Isochore Framework Underlies Chromatin Architecture , 2016, bioRxiv.

[47]  Jay Shendure,et al.  Understanding Spatial Genome Organization: Methods and Insights , 2016, Genom. Proteom. Bioinform..

[48]  James Y. Zou Analysis of protein-coding genetic variation in 60,706 humans , 2015, Nature.

[49]  T. de Lange A loopy view of telomere evolution , 2015, Frontiers in genetics.

[50]  P. Ahlberg,et al.  New genomic and fossil data illuminate the origin of enamel , 2015, Nature.

[51]  Rebekah L. Rogers Chromosomal Rearrangements as Barriers to Genetic Homogenization between Archaic and Modern Humans. , 2015, Molecular biology and evolution.

[52]  Morris Swertz,et al.  Genome-wide patterns and properties of de novo mutations in humans , 2015, Nature Genetics.

[53]  H. Ellegren,et al.  Evolutionary Consequences of DNA Methylation on the GC Content in Vertebrate Genomes , 2015, G3: Genes, Genomes, Genetics.

[54]  Luciano Milanesi,et al.  Segmenting the Human Genome into Isochores , 2015, Evolutionary bioinformatics online.

[55]  R. Camerini-Otero,et al.  Recombination initiation maps of individual human genomes , 2014, Science.

[56]  Nadav Ahituv,et al.  Enhancer Interaction Networks as a Means for Singular Olfactory Receptor Expression , 2014, Cell.

[57]  Philipp W. Messer,et al.  Quantification of GC-biased gene conversion in the human genome , 2014, bioRxiv.

[58]  S. O’Brien,et al.  Mammalian keratin associated proteins (KRTAPs) subgenomes: disentangling hair diversity and adaptation to terrestrial and aquatic environments , 2014, BMC Genomics.

[59]  Molly Przeworski,et al.  Determinants of mutation rate variation in the human germline. , 2014, Annual review of genomics and human genetics.

[60]  Kazushige Touhara,et al.  Extreme expansion of the olfactory receptor gene repertoire in African elephants and evolutionary dynamics of orthologous gene groups in 13 placental mammals , 2014, Genome research.

[61]  Andreas Keller,et al.  The Missense of Smell: Functional Variability in the Human Odorant Receptor Repertoire , 2013, Nature Neuroscience.

[62]  S. Lomvardas,et al.  Co-Opting the Unfolded Protein Response to Elicit Olfactory Receptor Feedback , 2013, Cell.

[63]  Matthew D. Schultz,et al.  Global Epigenomic Reconfiguration During Mammalian Brain Development , 2013, Science.

[64]  David Sweatt,et al.  Faculty Opinions recommendation of Global epigenomic reconfiguration during mammalian brain development. , 2013 .

[65]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[66]  Katherine S. Pollard,et al.  A Model-Based Analysis of GC-Biased Gene Conversion in the Human and Chimpanzee Genomes , 2013, PLoS genetics.

[67]  D. Nelson,et al.  The cytochrome P450 genesis locus: the origin and evolution of animal cytochrome P450s , 2013, Philosophical Transactions of the Royal Society B: Biological Sciences.

[68]  S. Cockroft,et al.  Transcription forms and remodels supercoiling domains unfolding large-scale chromatin structures , 2013, Nature Structural &Molecular Biology.

[69]  Zachary D. Smith,et al.  DNA methylation: roles in mammalian development , 2013, Nature Reviews Genetics.

[70]  A. Jeffreys,et al.  Recombination regulator PRDM9 influences the instability of its own coding sequence in humans , 2012, Proceedings of the National Academy of Sciences.

[71]  R. Hardison Evolution of hemoglobin and its genes. , 2012, Cold Spring Harbor perspectives in medicine.

[72]  Carolyn A. Larabell,et al.  Nuclear Aggregation of Olfactory Receptor Genes Governs Their Monogenic Expression , 2012, Cell.

[73]  H. Willard,et al.  Evidence for sequence biases associated with patterns of histone methylation , 2012, BMC Genomics.

[74]  B. King,et al.  Clusters of Ancestrally Related Genes That Show Paralogy in Whole or in Part Are a Major Feature of the Genomes of Humans and Other Species , 2012, PloS one.

[75]  Wouter de Laat,et al.  A Regulatory Archipelago Controls Hox Genes Transcription in Digits , 2011, Cell.

[76]  Ananda L Roy,et al.  Enhancer-promoter communication and transcriptional regulation of Igh. , 2011, Trends in immunology.

[77]  Jamal Tazi,et al.  Regulated functional alternative splicing in Drosophila , 2011, Nucleic acids research.

[78]  Yun-Shien Lee,et al.  Analysis of human meiotic recombination events with a parent-sibling tracing approach , 2011, BMC Genomics.

[79]  S. Lomvardas,et al.  High-throughput mapping of the promoters of the mouse olfactory receptor genes reveals a new type of mammalian promoter and provides insight into olfactory receptor gene regulation. , 2011, Genome research.

[80]  K. Kawasaki,et al.  The evolution of milk casein genes from tooth genes before the origin of mammals. , 2011, Molecular biology and evolution.

[81]  M. Noor,et al.  Recombination rate variation in closely related species , 2011, Heredity.

[82]  A. Bird,et al.  CpG islands and the regulation of transcription. , 2011, Genes & development.

[83]  Manolis Kellis,et al.  An Epigenetic Signature for Monoallelic Olfactory Receptor Expression , 2011, Cell.

[84]  A. Gylfason,et al.  Fine-scale recombination rate differences between sexes, populations and individuals , 2010, Nature.

[85]  D. Petrov,et al.  Evidence That Mutation Is Universally Biased towards AT in Bacteria , 2010, PLoS genetics.

[86]  K. Paigen,et al.  Prdm9 Controls Activation of Mammalian Recombination Hotspots , 2010, Science.

[87]  P. Donnelly,et al.  Drive Against Hotspot Motifs in Primates Implicates the PRDM9 Gene in Meiotic Recombination , 2010, Science.

[88]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[89]  M. Lynch Rate, molecular spectrum, and consequences of human mutation , 2010, Proceedings of the National Academy of Sciences.

[90]  Laurent Duret,et al.  Biased gene conversion and the evolution of mammalian genomic landscapes. , 2009, Annual review of genomics and human genetics.

[91]  E. Birney,et al.  Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt , 2009, Nature Protocols.

[92]  P. Green,et al.  Widespread Genomic Signatures of Natural Selection in Hominid Evolution , 2009, PLoS genetics.

[93]  M. Hultén Chiasma distribution at diakinesis in the normal human male. , 2009, Hereditas.

[94]  R. Lane,et al.  Exceptional LINE Density at V1R Loci: The Lyon Repeat Hypothesis Revisited on Autosomes , 2009, Journal of Molecular Evolution.

[95]  Masatoshi Nei,et al.  The evolution of animal chemosensory receptor gene repertoires: roles of chance and necessity , 2008, Nature Reviews Genetics.

[96]  B. Trask,et al.  Extensive copy-number variation of the human olfactory receptor gene family. , 2008, American journal of human genetics.

[97]  Martin W Simmen,et al.  Genome-scale relationships between cytosine methylation and dinucleotide abundances in animals. , 2008, Genomics.

[98]  Laurent Duret,et al.  The Impact of Recombination on Nucleotide Substitutions in the Human Genome , 2008, PLoS genetics.

[99]  D. Ramsden,et al.  End-bridging is required for pol μ to efficiently promote repair of noncomplementary ends by nonhomologous end joining , 2008, Nucleic acids research.

[100]  Fernando A. Villanea,et al.  Diet and the evolution of human amylase gene copy number variation , 2007, Nature Genetics.

[101]  K. Schwarz,et al.  Single-stranded DNA ligation and XLF-stimulated incompatible DNA end ligation by the XRCC4-DNA ligase IV complex: influence of terminal DNA sequence , 2007, Nucleic acids research.

[102]  M. Nei,et al.  Extensive Gains and Losses of Olfactory Receptor Genes in Mammalian Evolution , 2007, PloS one.

[103]  Job Dekker,et al.  GC- and AT-rich chromatin domains differ in conformation and histone modification status and are differentially modulated by Rpd3p , 2007, Genome Biology.

[104]  James H. Thomas Rapid Birth–Death Evolution Specific to Xenobiotic Cytochrome P450 Genes in Vertebrates , 2007, PLoS genetics.

[105]  Tom Maniatis,et al.  Identification of long-range regulatory elements in the protocadherin-α gene cluster , 2006, Proceedings of the National Academy of Sciences.

[106]  R. Feyereisen,et al.  Evolution of insect P450. , 2006, Biochemical Society transactions.

[107]  B. Malnic,et al.  Identification of potential regulatory motifs in odorant receptor genes by analysis of promoter sequences. , 2006, Genome research.

[108]  Feng Gao,et al.  GC-Profile: a web-based tool for visualizing and analyzing the variation of GC content in genomic sequences , 2006, Nucleic Acids Res..

[109]  Giorgio Bernardi,et al.  An isochore map of human chromosomes. , 2006, Genome research.

[110]  David C. Schwartz,et al.  DNA sequence of human chromosome 17 and analysis of rearrangement in the human lineage , 2006, Nature.

[111]  T. Haaf,et al.  7E olfactory receptor gene clusters and evolutionary chromosome rearrangements , 2005, Cytogenetic and Genome Research.

[112]  Jean L. Chang,et al.  Initial sequence of the chimpanzee genome and comparison with the human genome , 2005, Nature.

[113]  Barbara J. Trask,et al.  Human subtelomeres are hot spots of interchromosomal recombination and segmental duplication , 2005, Nature.

[114]  David M. A. Martin,et al.  The Genome of the African Trypanosome Trypanosoma brucei , 2005, Science.

[115]  Dan Graur,et al.  GC composition of the human genome: in search of isochores. , 2005, Molecular biology and evolution.

[116]  M. Bucan,et al.  Promoter features related to tissue specificity as measured by Shannon entropy , 2005, Genome Biology.

[117]  K. J. Fryxell,et al.  CpG mutation rates in the human genome are highly dependent on local GC content. , 2005, Molecular biology and evolution.

[118]  M. Uhrberg The KIR gene family: life in the fast lane of evolution , 2005, European journal of immunology.

[119]  M. Hultén,et al.  Inter-sex variation in synaptonemal complex lengths largely determine the different recombination rates in male and female germ cells , 2004, Cytogenetic and Genome Research.

[120]  J. Comeron Faculty Opinions recommendation of Comparative recombination rates in the rat, mouse, and human genomes. , 2004 .

[121]  B. Trask,et al.  Species specificity in rodent pheromone receptor repertoires. , 2004, Genome research.

[122]  Jeffrey H. Chuang,et al.  Functional Bias and Spatial Organization of Genes in Mutational Hot and Cold Regions in the Human Genome , 2004, PLoS biology.

[123]  H. Riethman,et al.  Mapping and initial analysis of human subtelomeric sequence assemblies. , 2003, Genome research.

[124]  V. Corces,et al.  Extensive exon reshuffling over evolutionary time coupled to trans-splicing in Drosophila. , 2003, Genome research.

[125]  Toshiro Aigaki,et al.  Alternative splicing of lola generates 19 transcription factors controlling axon guidance in Drosophila , 2003, Nature Neuroscience.

[126]  Jianzhi Zhang,et al.  Evolutionary deterioration of the vomeronasal pheromone transduction pathway in catarrhine primates , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[127]  B. Trask,et al.  Complex evolution of 7E olfactory receptor genes in segmental duplications. , 2003, Genome research.

[128]  Xiangdong Fang,et al.  Locus control regions. , 2002, Blood.

[129]  Takashi Gojobori,et al.  In silico chromosome staining: Reconstruction of Giemsa bands from the whole human genome sequence , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[130]  B. Trask,et al.  Comparative sequencing of a multicopy subtelomeric region containing olfactory receptor genes reveals multiple interactions between non-homologous chromosomes. , 2001, Human molecular genetics.

[131]  Gustavo Glusman,et al.  The complete human olfactory subgenome. , 2001, Genome research.

[132]  M. Nachman,et al.  Estimate of the mutation rate per nucleotide in humans. , 2000, Genetics.

[133]  K. J. Fryxell,et al.  Cytosine deamination plays a primary role in the evolution of mammalian isochores. , 2000, Molecular biology and evolution.

[134]  Francisco Antequera,et al.  CpG islands as genomic footprints of promoters that are associated with replication origins , 1999, Current Biology.

[135]  B. Trask,et al.  Distribution of olfactory receptor genes in the human genome , 1998, Nature Genetics.

[136]  R. Mann,et al.  Why are Hox genes clustered? , 1997, BioEssays : news and reviews in molecular, cellular and developmental biology.

[137]  G. Holmquist,et al.  Organization of mutations along the genome: a prime determinant of genome evolution. , 1994, Trends in ecology & evolution.

[138]  G. Holmquist,et al.  Chromosome bands, their chromatin flavors, and their functional features. , 1992, American journal of human genetics.

[139]  A. Bird,et al.  The expected equilibrium of the CpG dinucleotide in vertebrate genomes under a mutation model. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[140]  Mary C. Rykowski,et al.  Human genome organization: Alu, LINES, and the molecular structure of metaphase chromosome bands , 1988, Cell.

[141]  A. Bird CpG-rich islands and the function of DNA methylation , 1986, Nature.

[142]  Dr. Susumu Ohno Evolution by Gene Duplication , 1970, Springer Berlin Heidelberg.

[143]  G. Bernardi,et al.  Isolation and characterization of mouse and guinea pig satellite deoxyribonucleic acids. , 1968, Biochemistry.

[144]  Erik Kaestner,et al.  The Origins Of Genome Architecture , 2016 .

[145]  A. R. Wagner Molecular Biology and Evolution , 2001 .

[146]  J. Filipski Evolution of DNA Sequence Contributions of Mutational Bias and Selection to the Origin of Chromosomal Compartments , 1990 .