A novel test for selection on cis-regulatory elements reveals positive and negative selection acting on mammalian transcriptional enhancers.

Measuring natural selection on genomic elements involved in the cis-regulation of gene expression--such as transcriptional enhancers and promoters--is critical for understanding the evolution of genomes, yet it remains a major challenge. Many studies have attempted to detect positive or negative selection in these noncoding elements by searching for those with the fastest or slowest rates of evolution, but this can be problematic. Here, we introduce a new approach to this issue, and demonstrate its utility on three mammalian transcriptional enhancers. Using results from saturation mutagenesis studies of these enhancers, we classified all possible point mutations as upregulating, downregulating, or silent, and determined which of these mutations have occurred on each branch of a phylogeny. Applying a framework analogous to Ka/Ks in protein-coding genes, we measured the strength of selection on upregulating and downregulating mutations, in specific branches as well as entire phylogenies. We discovered distinct modes of selection acting on different enhancers: although all three have experienced negative selection against downregulating mutations, the selection pressures on upregulating mutations vary. In one case, we detected positive selection for upregulation, whereas the other two had no detectable selection on upregulating mutations. Our methodology is applicable to the growing number of saturation mutagenesis data sets, and provides a detailed picture of the mode and strength of natural selection acting on cis-regulatory elements.

[1]  Hunter B. Fraser,et al.  Gene expression drives local adaptation in humans , 2013, Genome research.

[2]  M. Hurles,et al.  Fast-evolving noncoding sequences in the human genome , 2007, Genome Biology.

[3]  Philipp W. Messer,et al.  Strong Purifying Selection at Synonymous Sites in D. melanogaster , 2013, PLoS genetics.

[4]  J. Plotkin,et al.  Synonymous but not the same: the causes and consequences of codon bias , 2011, Nature Reviews Genetics.

[5]  Wei Zheng,et al.  Genetic Analysis of Variation in Transcription Factor Binding in Yeast , 2010, Nature.

[6]  A. E. Hirsh,et al.  Adjusting for selection on synonymous sites in estimates of evolutionary distance. , 2005, Molecular biology and evolution.

[7]  P. Andolfatto Controlling Type-I Error of the McDonald–Kreitman Test in Genomewide Scans for Selection on Noncoding DNA , 2008, Genetics.

[8]  P. Kwok,et al.  Functional Characterization of Liver Enhancers That Regulate Drug‐Associated Transporters , 2011, Clinical pharmacology and therapeutics.

[9]  Jay Shendure,et al.  High-resolution analysis of DNA regulatory elements by synthetic saturation mutagenesis , 2009, Nature Biotechnology.

[10]  Michael D. Wilson,et al.  Species-Specific Transcription in Mice Carrying Human Chromosome 21 , 2008, Science.

[11]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[12]  Albert J. Vilella,et al.  A high-resolution map of human evolutionary constraint using 29 mammals , 2011, Nature.

[13]  Alan M. Moses,et al.  Statistical tests for natural selection on regulatory regions based on the strength of transcription factor binding sites , 2009, BMC Evolutionary Biology.

[14]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[15]  T. Mikkelsen,et al.  Rapid dissection and model-based optimization of inducible enhancers in human cells using a massively parallel reporter assay , 2012, Nature Biotechnology.

[16]  Kate E. Jones,et al.  The delayed rise of present-day mammals , 1990, Nature.

[17]  Jeffrey H. Chuang,et al.  The importance of being cis: evolution of orthologous fish and mammalian enhancer activity. , 2010, Molecular biology and evolution.

[18]  A. Eyre-Walker,et al.  Synonymous codon usage in Escherichia coli: selection for translational accuracy. , 2006, Molecular biology and evolution.

[19]  A. Kahn,et al.  In Vivo Functional Characterization of the Aldolase B Gene Enhancer* , 2002, The Journal of Biological Chemistry.

[20]  D. Haussler,et al.  Biased clustered substitutions in the human genome: the footprints of male-driven biased gene conversion. , 2007, Genome research.

[21]  D. Haussler,et al.  An RNA gene expressed during cortical development evolved rapidly in humans , 2006, Nature.

[22]  Ziheng Yang,et al.  Statistical methods for detecting molecular adaptation , 2000, Trends in Ecology & Evolution.

[23]  T. Hughes,et al.  Mapping pathways and phenotypes by systematic gene overexpression. , 2006, Molecular cell.

[24]  Olivier Fedrigo,et al.  Promoter regions of many neural- and nutrition-related genes have experienced positive selection during human evolution , 2007, Nature Genetics.

[25]  C. Zheng,et al.  ; 0 ; , 1951 .

[26]  Joseph B Hiatt,et al.  Activity-enhancing mutations in an E3 ubiquitin ligase identified by high-throughput mutagenesis , 2013, Proceedings of the National Academy of Sciences.

[27]  M. Gerstein,et al.  Variation in Transcription Factor Binding Among Humans , 2010, Science.

[28]  M. King,et al.  Evolution at two levels in humans and chimpanzees. , 1975, Science.

[29]  Barak A. Cohen,et al.  Complex effects of nucleotide variants in a mammalian cis-regulatory element , 2012, Proceedings of the National Academy of Sciences.

[30]  M. Kreitman,et al.  Adaptive protein evolution at the Adh locus in Drosophila , 1991, Nature.

[31]  Ari Löytynoja,et al.  webPRANK: a phylogeny-aware multiple sequence aligner with interactive alignment browser , 2010, BMC Bioinformatics.

[32]  Alex A. Pollen,et al.  The genomic basis of adaptive evolution in threespine sticklebacks , 2012, Nature.

[33]  Hunter B. Fraser,et al.  Differences in enhancer activity in mouse and zebrafish reporter assays are often associated with changes in gene expression , 2012, BMC Genomics.

[34]  P. Andolfatto,et al.  Methods to detect selection on noncoding DNA. , 2012, Methods in molecular biology.

[35]  D. Fowler,et al.  Deep mutational scanning: assessing protein function on a massive scale. , 2011, Trends in biotechnology.

[36]  S. Pääbo,et al.  Accelerated Evolution of Conserved Noncoding Sequences in Humans , 2006, Science.

[37]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[38]  T. Mitchell-Olds Faculty Opinions recommendation of Population genetic and phylogenetic evidence for positive selection on regulatory mutations at the factor VII locus in humans. , 2005 .

[39]  Ilan Gronau,et al.  Genome-wide inference of natural selection on human transcription factor binding sites , 2013, Nature Genetics.

[40]  Su Yeon Kim,et al.  Adaptive Evolution of Conserved Noncoding Elements in Mammals , 2007, PLoS genetics.

[41]  Joseph B Hiatt,et al.  Massively parallel functional dissection of mammalian enhancers in vivo , 2012, Nature Biotechnology.

[42]  Mathieu Blanchette,et al.  Ancestors 1.0: a web server for ancestral sequence reconstruction , 2010, Bioinform..

[43]  T. Jukes,et al.  The neutral theory of molecular evolution. , 2000, Genetics.

[44]  M. Kimura Preponderance of synonymous changes as evidence for the neutral theory of molecular evolution , 1977, Nature.

[45]  Ioannis Xenarios,et al.  T-Coffee: a web server for the multiple sequence alignment of protein and RNA sequences using structural information and homology extension , 2011, Nucleic Acids Res..

[46]  C. Kallen,et al.  The mRNA‐Binding Protein Zfp36 Is Upregulated by β‐Adrenergic Stimulation and Represses IL‐6 Production in 3T3‐L1 Adipocytes , 2012, Obesity.

[47]  H. Akashi Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy. , 1994, Genetics.

[48]  L. Duret,et al.  Adaptation or biased gene conversion? Extending the null hypothesis of molecular evolution. , 2007, Trends in genetics : TIG.

[49]  D. Baker,et al.  High Resolution Mapping of Protein Sequence–Function Relationships , 2010, Nature Methods.