Early Evolution of Conserved Regulatory Sequences Associated with Development in Vertebrates

Comparisons between diverse vertebrate genomes have uncovered thousands of highly conserved non-coding sequences, an increasing number of which have been shown to function as enhancers during early development. Despite their extreme conservation over 500 million years from humans to cartilaginous fish, these elements appear to be largely absent in invertebrates, and, to date, there has been little understanding of their mode of action or the evolutionary processes that have modelled them. We have now exploited emerging genomic sequence data for the sea lamprey, Petromyzon marinus, to explore the depth of conservation of this type of element in the earliest diverging extant vertebrate lineage, the jawless fish (agnathans). We searched for conserved non-coding elements (CNEs) at 13 human gene loci and identified lamprey elements associated with all but two of these gene regions. Although markedly shorter and less well conserved than within jawed vertebrates, identified lamprey CNEs are able to drive specific patterns of expression in zebrafish embryos, which are almost identical to those driven by the equivalent human elements. These CNEs are therefore a unique and defining characteristic of all vertebrates. Furthermore, alignment of lamprey and other vertebrate CNEs should permit the identification of persistent sequence signatures that are responsible for common patterns of expression and contribute to the elucidation of the regulatory language in CNEs. Identifying the core regulatory code for development, common to all vertebrates, provides a foundation upon which regulatory networks can be constructed and might also illuminate how large conserved regulatory sequence blocks evolve and become fixed in genomic DNA.

[1]  D. Haussler,et al.  Ultraconserved Elements in the Human Genome , 2004, Science.

[2]  N. Holland,et al.  Expression of AmphiCoe, an amphioxus COE/EBF gene, in the developing central nervous system and epidermal sensory neurons , 2004, Genesis.

[3]  A. Vincent,et al.  collier, a novel regulator of Drosophila head development, is expressed in a single mitotic domain , 1996, Current Biology.

[4]  P. Bovolenta,et al.  Comprehensive characterization of the cis-regulatory code responsible for the spatio-temporal expression of olSix3.2 in the developing medaka forebrain , 2007, Genome Biology.

[5]  F. Delsuc,et al.  Tunicates and not cephalochordates are the closest living relatives of vertebrates , 2006, Nature.

[6]  Tanya Vavouri,et al.  Ancient duplicated conserved noncoding elements in vertebrates: a genomic and functional analysis. , 2006, Genome research.

[7]  B. Ye,et al.  unc-3, a gene required for axonal guidance in Caenorhabditis elegans, encodes a member of the O/E family of transcription factors. , 1998, Development.

[8]  Sarah F. Smith,et al.  Highly conserved regulatory elements around the SHH gene may contribute to the maintenance of conserved synteny across human chromosome 7q36.3. , 2005, Genomics.

[9]  A. Visel,et al.  Ultraconservation identifies a small subset of extremely constrained developmental enhancers , 2008, Nature Genetics.

[10]  M. Bronner‐Fraser,et al.  Conservation of Pax gene expression in ectodermal placodes of the lamprey. , 2002, Gene.

[11]  Nicholas H. Putnam,et al.  The amphioxus genome illuminates vertebrate origins and cephalochordate biology. , 2008, Genome research.

[12]  Paul Richardson,et al.  The Draft Genome of Ciona intestinalis: Insights into Chordate and Vertebrate Origins , 2002, Science.

[13]  M. Brand,et al.  Characterization of three novel members of the zebrafish Pax2/5/8 family: dependency of Pax5 and Pax8 expression on the Pax2.1 (noi) function. , 1998, Development.

[14]  Justin Johnson,et al.  Ancient Noncoding Elements Conserved in the Human Genome , 2006, Science.

[15]  T. Sauka-Spengler,et al.  Insights From a Sea Lamprey Into the Evolution of Neural Crest Gene Regulatory Network , 2008, The Biological Bulletin.

[16]  K. Grzeschik,et al.  Human GLI3 Intragenic Conserved Non-Coding Sequences Are Tissue-Specific Enhancers , 2007, PloS one.

[17]  M. Mattei,et al.  Family of Ebf/Olf‐1‐related genes potentially involved in neuronal differentiation and regional specification in the central nervous system , 1997, Developmental dynamics : an official publication of the American Association of Anatomists.

[18]  Alan M. Moses,et al.  In vivo enhancer analysis of human conserved non-coding sequences , 2006, Nature.

[19]  Chuong B. Do,et al.  Access the most recent version at doi: 10.1101/gr.926603 References , 2003 .

[20]  S. Hedges,et al.  Molecular phylogeny and divergence times of deuterostome animals. , 2005, Molecular biology and evolution.

[21]  J. Tena,et al.  A functional survey of the enhancer activity of conserved non-coding sequences from vertebrate Iroquois cluster gene deserts. , 2005, Genome research.

[22]  Jeffrey S. Levinton,et al.  Molecular Evidence for Deep Precambrian Divergences Among Metazoan Phyla , 1996, Science.

[23]  C. Amemiya,et al.  Evolutionary constraint on Otx2 neuroectoderm enhancers-deep conservation from skate to mouse and unique divergence in teleost , 2006, Proceedings of the National Academy of Sciences.

[24]  Vincent Laudet,et al.  Analysis of lamprey and hagfish genes reveals a complex history of gene duplications during early vertebrate evolution. , 2002, Molecular biology and evolution.

[25]  Klaudia Walter,et al.  Highly Conserved Non-Coding Sequences Are Associated with Vertebrate Development , 2004, PLoS biology.

[26]  M. Nóbrega,et al.  Scanning Human Gene Deserts for Long-Range Enhancers , 2003, Science.

[27]  Gill Bejerano,et al.  Ultraconserved elements in insect genomes: a highly conserved intronic sequence implicated in the control of homothorax mRNA splicing. , 2005, Genome research.

[28]  C. Tickle,et al.  Long-range conserved non-coding SHOX sequences regulate expression in developing chicken limb and are associated with short stature phenotypes in human patients. , 2006, Human molecular genetics.

[29]  Boris Lenhard,et al.  Arrays of ultraconserved non-coding regions span the loci of key developmental genes in vertebrate genomes , 2004, BMC Genomics.

[30]  Nicholas H. Putnam,et al.  The amphioxus genome and the evolution of the chordate karyotype , 2008, Nature.

[31]  Klaudia Walter,et al.  Parallel evolution of conserved non-coding elements that target a common set of developmental regulatory genes from worms to humans , 2007, Genome Biology.

[32]  Charles E. Chapple,et al.  Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype , 2004, Nature.

[33]  M. Goulding,et al.  PAX2 is expressed in multiple spinal cord interneurons, including a population of EN1+ interneurons that require PAX6 for their development. , 1997, Development.

[34]  K. Robertson,et al.  An EBF3-mediated transcriptional program that induces cell cycle arrest and apoptosis. , 2006, Cancer research.

[35]  N. M. Brooke,et al.  A molecular timescale for vertebrate evolution , 1998, Nature.

[36]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[37]  Maximilian Muenke,et al.  A functional screen for sonic hedgehog regulatory elements across a 1 Mb interval identifies long-range ventral forebrain enhancers , 2006, Development.

[38]  Sonja J. Prohaska,et al.  Independent Hox-cluster duplications in lampreys. , 2003, Journal of experimental zoology. Part B, Molecular and developmental evolution.

[39]  L. Holland,et al.  The Ciona intestinalis genome: when the constraints are off. , 2003, BioEssays : news and reviews in molecular, cellular and developmental biology.

[40]  K. Grzeschik,et al.  Ultraconserved non‐coding sequence element controls a subset of spatiotemporal GLI3 expression , 2007, Development, growth & differentiation.

[41]  Bin Ma,et al.  PatternHunter: faster and more sensitive homology search , 2002, Bioinform..