Heterogeneous Tempo and Mode of Conserved Noncoding Sequence Evolution among Four Mammalian Orders

Conserved noncoding sequences (CNSs) of vertebrates are considered to be closely linked with protein-coding gene regulatory functions. We examined the abundance and genomic distribution of CNSs in four mammalian orders: primates, rodents, carnivores, and cetartiodactyls. We defined the two thresholds for CNS using conservation level of coding genes; using all the three coding positions and using only first and second codon positions. The abundance of CNSs varied among lineages, with primates and rodents having highest and lowest number of CNSs, respectively, whereas carnivores and cetartiodactyls had intermediate values. These CNSs cover 1.3–5.5% of the mammalian genomes and have signatures of selective constraints that are stronger in more ancestral than the recent ones. Evolution of new CNSs as well as retention of ancestral CNSs contribute to the differences in abundance. The genomic distribution of CNSs is dynamic with higher proportions of rodent and primate CNSs located in the introns compared with carnivores and cetartiodactyls. In fact, 19% of orthologous single-copy CNSs between human and dog are located in different genomic regions. If CNSs can be considered as candidates of gene expression regulatory sequences, heterogeneity of CNSs among the four mammalian orders may have played an important role in creating the order-specific phenotypes. Fewer CNSs in rodents suggest that rodent diversity is related to lower regulatory conservation. With CNSs shown to cluster around genes involved in nervous systems and the higher number of primate CNSs, our result suggests that CNSs may be involved in the higher complexity of the primate nervous system.

[1]  Koichiro Tamura,et al.  MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. , 2013, Molecular biology and evolution.

[2]  R. B. Azevedo,et al.  On the Immortality of Television Sets: “Function” in the Human Genome According to the Evolution-Free Gospel of ENCODE , 2013, Genome biology and evolution.

[3]  Hani Z. Girgis,et al.  A High-Resolution Enhancer Atlas of the Developing Telencephalon , 2013, Cell.

[4]  Naruya Saitou,et al.  Vertebrate Paralogous Conserved Noncoding Sequences May Be Related to Gene Expressions in Brain , 2012, Genome biology and evolution.

[5]  Timothy Wells,et al.  A novel long-range enhancer regulates postnatal expression of Zeb2: implications for Mowat-Wilson syndrome phenotypes. , 2012, Human molecular genetics.

[6]  Michael Hiller,et al.  Hundreds of conserved non-coding genomic regions are independently lost in mammals , 2012, Nucleic acids research.

[7]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[8]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[9]  Lee E. Edsall,et al.  A map of the cis-regulatory sequences in the mouse genome , 2012, Nature.

[10]  Gabriel Kreiman,et al.  Integrated genome analysis suggests that most conserved non-coding sequences are regulatory factor binding sites , 2012, Nucleic acids research.

[11]  N. Saitou,et al.  Identification and Characterization of Lineage-Specific Highly Conserved Noncoding Sequences in Mammalian Genomes , 2012, Genome biology and evolution.

[12]  P. Wittkopp,et al.  Cis-regulatory elements: molecular mechanisms and evolutionary processes underlying divergence , 2011, Nature Reviews Genetics.

[13]  M. Nei,et al.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. , 2011, Molecular biology and evolution.

[14]  Ross C Hardison,et al.  What fraction of the human genome is functional? , 2011, Genome research.

[15]  S. Brenner,et al.  Ancient vertebrate conserved noncoding elements have been evolving rapidly in teleost fishes. , 2011, Molecular biology and evolution.

[16]  Y. Gondo,et al.  Reptiles and Mammals Have Differentially Retained Long Conserved Noncoding Sequences from the Amniote Ancestor , 2010, Genome biology and evolution.

[17]  C. Ponting,et al.  Massive turnover of functional sequence in human and other mammalian genomes. , 2010, Genome research.

[18]  G. K. Davis,et al.  Phenotypic robustness conferred by apparently redundant transcriptional enhancers , 2010, Nature.

[19]  Michael D. Wilson,et al.  Five-Vertebrate ChIP-seq Reveals the Evolutionary Dynamics of Transcription Factor Binding , 2010, Science.

[20]  G. Elgar Pan-vertebrate conserved non-coding sequences associated with developmental regulation. , 2009, Briefings in functional genomics & proteomics.

[21]  Tal Pupko,et al.  Rodent phylogeny revised: analysis of six nuclear genes from all major rodent clades , 2009, BMC Evolutionary Biology.

[22]  A. Visel,et al.  ChIP-seq accurately predicts tissue-specific activity of enhancers , 2009, Nature.

[23]  R. Vinton,et al.  Asymmetrical distribution of non-conserved regulatory sequences at PHOX2B is reflected at the ENCODE loci and illuminates a possible genome-wide trend , 2009, BMC Genomics.

[24]  J. Bloom,et al.  Screening reveals conserved and nonconserved transcriptional regulatory elements including an E3/E4 allele-dependent APOE coding region enhancer. , 2008, Genomics.

[25]  William Stafford Noble,et al.  Predicting Human Nucleosome Occupancy from Primary Sequence , 2008, PLoS Comput. Biol..

[26]  S. Carroll Evo-Devo and an Expanding Evolutionary Synthesis: A Genetic Theory of Morphological Evolution , 2008, Cell.

[27]  R C Hardison,et al.  Exponential decay of GC content detected by strand-symmetric substitution rates influences the evolution of isochore structure. , 2008, Molecular biology and evolution.

[28]  Eric D Green,et al.  Regulatory divergence modifies limb length between mammals. , 2008, Genes & development.

[29]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[30]  Axel Visel,et al.  Deletion of Ultraconserved Elements Yields Viable Mice , 2007, PLoS biology.

[31]  David Haussler,et al.  Human Genome Ultraconserved Elements Are Ultraselected , 2007, Science.

[32]  Klaudia Walter,et al.  Parallel evolution of conserved non-coding elements that target a common set of developmental regulatory genes from worms to humans , 2007, Genome Biology.

[33]  Andy Purvis,et al.  A higher-level MRP supertree of placental mammals , 2006, BMC Evolutionary Biology.

[34]  T. Peterson,et al.  Gene Conversion Between Direct Noncoding Repeats Promotes Genetic and Phenotypic Diversity at a Regulatory Locus of Zea mays (L.) , 2006, Genetics.

[35]  Tad S Sonstegard,et al.  Genomic divergences among cattle, dog and human estimated from large-scale alignments of genomic sequences , 2006, BMC Genomics.

[36]  Laurent Excoffier,et al.  Conserved noncoding sequences are selectively constrained and not mutation cold spots , 2006, Nature Genetics.

[37]  M. Daly,et al.  Genome-wide mapping of DNase hypersensitive sites using massively parallel signature sequencing (MPSS). , 2005, Genome research.

[38]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[39]  Boris Lenhard,et al.  Arrays of ultraconserved non-coding regions span the loci of key developmental genes in vertebrate genomes , 2004, BMC Genomics.

[40]  Klaudia Walter,et al.  Highly Conserved Non-Coding Sequences Are Associated with Vertebrate Development , 2004, PLoS biology.

[41]  Michael P. Cummings,et al.  MEGA (Molecular Evolutionary Genetics Analysis) , 2004 .

[42]  D. Haussler,et al.  Ultraconserved Elements in the Human Genome , 2004, Science.

[43]  Edward M. Rubin,et al.  Megabase deletions of gene deserts result in viable mice , 2004, Nature.

[44]  G. Wray Transcriptional regulation and the evolution of development. , 2003, The International journal of developmental biology.

[45]  A. Meyer,et al.  Recent Advances in the (Molecular) Phylogeny of Vertebrates , 2003 .

[46]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[47]  B. Oostra,et al.  A long-range Shh enhancer regulates expression in the developing limb and fin and is associated with preaxial polydactyly. , 2003, Human molecular genetics.

[48]  Sridhar Hannenhalli,et al.  Enrichment of regulatory signals in conserved non-coding genomic sequence , 2001, Bioinform..

[49]  R. Hardison Conserved noncoding sequences are reliable guides to regulatory elements. , 2000, Trends in genetics : TIG.

[50]  D. Stern PERSPECTIVE: EVOLUTIONARY DEVELOPMENTAL BIOLOGY AND THE PROBLEM OF VARIATION , 2000, Evolution; international journal of organic evolution.

[51]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[52]  J Krushkal,et al.  Rates of nucleotide substitution in primates and rodents and the generation-time effect hypothesis. , 1996, Molecular phylogenetics and evolution.

[53]  L. Duret,et al.  Strong conservation of non-coding sequences during vertebrates evolution: potential involvement in post-transcriptional regulation of gene expression. , 1993, Nucleic acids research.

[54]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[55]  M. King,et al.  Evolution at two levels in humans and chimpanzees. , 1975, Science.

[56]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[57]  Albert J. Vilella,et al.  EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates. , 2009, Genome research.

[58]  Tanya Vavouri,et al.  Defining a genomic radius for long-range enhancer action: duplicated conserved non-coding elements hold the key. , 2006, Trends in genetics : TIG.

[59]  Peer Bork,et al.  Comparative architectures of mammalian and chicken genomes reveal highly variable rates of genomic rearrangements across different lineages. , 2005, Genome research.

[60]  Webb Miller,et al.  Evolution and functional classification of vertebrate gene deserts. , 2005, Genome research.

[61]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[62]  J. R. Scotti,et al.  Available From , 1973 .

[63]  L. Pauling,et al.  Evolutionary Divergence and Convergence in Proteins , 1965 .