A distal enhancer and an ultraconserved exon are derived from a novel retroposon

Hundreds of highly conserved distal cis-regulatory elements have been characterized so far in vertebrate genomes. Many thousands more are predicted on the basis of comparative genomics. However, in stark contrast to the genes that they regulate, in invertebrates virtually none of these regions can be traced by using sequence similarity, leaving their evolutionary origins obscure. Here we show that a class of conserved, primarily non-coding regions in tetrapods originated from a previously unknown short interspersed repetitive element (SINE) retroposon family that was active in the Sarcopterygii (lobe-finned fishes and terrestrial vertebrates) in the Silurian period at least 410 million years ago (ref. 4), and seems to be recently active in the ‘living fossil’ Indonesian coelacanth, Latimeria menadoensis. Using a mouse enhancer assay we show that one copy, 0.5 million bases from the neuro-developmental gene ISL1, is an enhancer that recapitulates multiple aspects of Isl1 expression patterns. Several other copies represent new, possibly regulatory, alternatively spliced exons in the middle of pre-existing Sarcopterygian genes. One of these, a more than 200-base-pair ultraconserved region, 100% identical in mammals, and 80% identical to the coelacanth SINE, contains a 31-amino-acid-residue alternatively spliced exon of the messenger RNA processing gene PCBP2 (ref. 6). These add to a growing list of examples in which relics of transposable elements have acquired a function that serves their host, a process termed ‘exaptation’, and provide an origin for at least some of the many highly conserved vertebrate-specific genomic sequences.

[1]  B. Mcclintock The origin and behavior of mutable loci in maize , 1950, Proceedings of the National Academy of Sciences.

[2]  R. Britten,et al.  Repetitive and Non-Repetitive DNA Sequences and a Speculation on the Origins of Evolutionary Novelty , 1971, The Quarterly Review of Biology.

[3]  S. Gould,et al.  Exaptation—a Missing Term in the Science of Form , 1982, Paleobiology.

[4]  J. Rossant,et al.  A transgene containing lacZ inserted into the dystonia locus is expressed in neural tube , 1988, Nature.

[5]  Wen-Hsiung Li,et al.  Fundamentals of molecular evolution , 1990 .

[6]  T. Jessell,et al.  Requirement for LIM Homeobox Gene Isl1 in Motor Neuron Generation Reveals a Motor Neuron– Dependent Step in Interneuron Differentiation , 1996, Cell.

[7]  John G Flanagan,et al.  Topographic Guidance Labels in a Sensory Projection to the Forebrain , 1998, Neuron.

[8]  Dan Graur,et al.  Fundamentals of Molecular Evolution, 2nd Edition , 2000 .

[9]  A. Eyre-Walker Fundamentals of Molecular Evolution (2nd edn) , 2000, Heredity.

[10]  S. Jang,et al.  Protein-protein interaction among hnRNPs shuttling between nucleus and cytoplasm. , 2000, Journal of molecular biology.

[11]  J. Livet,et al.  The branchial arches and HGF are growth-promoting and chemoattractant for cranial motor axons. , 2000, Development.

[12]  A. Weiner SINEs and LINEs: the art of biting the hand that feeds you. , 2002, Current opinion in cell biology.

[13]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[14]  S. Brenner,et al.  Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[15]  M. Batzer,et al.  Mammalian retroelements. , 2002, Genome research.

[16]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[17]  S. Liebhaber,et al.  The poly(C)-binding proteins: a multiplicity of functions and a search for mechanisms. , 2002, RNA.

[18]  S. Liebhaber,et al.  Identification of mRNAs Associated with αCP2-Containing RNP Complexes , 2003, Molecular and Cellular Biology.

[19]  W. Makałowski Not Junk After All , 2003, Science.

[20]  A. Chkheidze,et al.  A Novel Set of Nuclear Localization Signals Determine Distributions of the αCP RNA-Binding Proteins , 2003, Molecular and Cellular Biology.

[21]  Wojciech Makalowski,et al.  Genomics. Not junk after all. , 2003, Science.

[22]  Noam Shomron,et al.  The Birth of an Alternatively Spliced Exon: 3' Splice-Site Selection in Alu Exons , 2003, Science.

[23]  H. Kazazian Mobile Elements: Drivers of Genome Evolution , 2004, Science.

[24]  D. Haussler,et al.  Ultraconserved Elements in the Human Genome , 2004, Science.

[25]  J. Brosius The Contribution of RNAs and Retroposition to Evolutionary Novelties , 2003, Genetica.

[26]  C. Amemiya,et al.  Genome resource for the Indonesian coelacanth, Latimeria menadoensis. , 2004, Journal of experimental zoology. Part A, Comparative experimental biology.

[27]  David Haussler,et al.  Into the heart of darkness: large-scale clustering of human non-coding DNA , 2004, ISMB/ECCB.

[28]  Klaudia Walter,et al.  Highly Conserved Non-Coding Sequences Are Associated with Vertebrate Development , 2004, PLoS biology.

[29]  S. Higashijima,et al.  Comparative functional genomics revealed conservation and diversification of three enhancers of the isl1 gene for motor and sensory neuron-specific expression. , 2005, Developmental biology.

[30]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[31]  Yonghe Li,et al.  Striking differences of LDL receptor-related protein 1B expression in mouse and human. , 2005, Biochemical and biophysical research communications.

[32]  M. Nóbrega,et al.  In vivo characterization of a vertebrate ultraconserved enhancer. , 2005, Genomics.

[33]  H. Nagatsuka,et al.  Genetic and epigenetic alterations of BRG1 promote oral cancer development. , 2005, International journal of oncology.

[34]  Tanya Vavouri,et al.  Ancient duplicated conserved noncoding elements in vertebrates: a genomic and functional analysis. , 2006, Genome research.