Extensive variation between inbred mouse strains due to endogenous L1 retrotransposition.

Numerous inbred mouse strains comprise models for human diseases and diversity, but the molecular differences between them are mostly unknown. Several mammalian genomes have been assembled, providing a framework for identifying structural variations. To identify variants between inbred mouse strains at a single nucleotide resolution, we aligned 26 million individual sequence traces from four laboratory mouse strains to the C57BL/6J reference genome. We discovered and analyzed over 10,000 intermediate-length genomic variants (from 100 nucleotides to 10 kilobases), distinguishing these strains from the C57BL/6J reference. Approximately 85% of such variants are due to recent mobilization of endogenous retrotransposons, predominantly L1 elements, greatly exceeding that reported in humans. Many genes' structures and expression are altered directly by polymorphic L1 retrotransposons, including Drosha (also called Rnasen), Parp8, Scn1a, Arhgap15, and others, including novel genes. L1 polymorphisms are distributed nonrandomly across the genome, as they are excluded significantly from the X chromosome and from genes associated with the cell cycle, but are enriched in receptor genes. Thus, recent endogenous L1 retrotransposition has diversified genomic structures and transcripts extensively, distinguishing mouse lineages and driving a major portion of natural genetic variation.

[1]  D. Conrad,et al.  A worldwide survey of haplotype variation and linkage disequilibrium in the human genome , 2006, Nature Genetics.

[2]  Stephen L. Gasior,et al.  Characterization of pre-insertion loci of de novo L1 insertions. , 2007, Gene.

[3]  William H. Majoros,et al.  A Comparison of Whole-Genome Shotgun-Derived Mouse Chromosome 16 and the Human Genome , 2002, Science.

[4]  E. Ostertag,et al.  Biology of mammalian L1 retrotransposons. , 2001, Annual review of genetics.

[5]  David I. K. Martin,et al.  Retrotransposons as epigenetic mediators of phenotypic variation in mammals , 2001, Nature Genetics.

[6]  J. V. Moran,et al.  An actively retrotransposing, novel subfamily of mouse L1 elements , 1998, The EMBO journal.

[7]  M. Speek Antisense Promoter of Human L1 Retrotransposon Drives Transcription of Adjacent Cellular Genes , 2001, Molecular and Cellular Biology.

[8]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[9]  P. Stenson,et al.  A systematic analysis of LINE-1 endonuclease-dependent retrotranspositional events causing human genetic disease , 2005, Human Genetics.

[10]  Ira M. Hall,et al.  Recurrent DNA copy number variation in the laboratory mouse , 2007, Nature Genetics.

[11]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[12]  T. Bestor,et al.  Cytosine methylation mediates sexual conflict. , 2003, Trends in genetics : TIG.

[13]  D. Slonim From patterns to pathways: gene expression data analysis comes of age , 2002, Nature Genetics.

[14]  J. V. Moran,et al.  Hot L1s account for the bulk of retrotransposition in the human population , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Thomas D. Wu,et al.  GMAP: a genomic mapping and alignment program for mRNA and EST sequence , 2005, Bioinform..

[16]  Timothy B. Stockwell,et al.  The Diploid Genome Sequence of an Individual Human , 2007, PLoS biology.

[17]  R. Redon,et al.  Relative Impact of Nucleotide and Copy Number Variation on Gene Expression Phenotypes , 2007, Science.

[18]  Hyuna Yang,et al.  On the subspecific origin of the laboratory mouse , 2007, Nature Genetics.

[19]  S. Lewis,et al.  The generic genome browser: a building block for a model organism system database. , 2002, Genome research.

[20]  Fred H. Gage,et al.  Somatic mosaicism in neuronal precursor cells mediated by L1 retrotransposition , 2005, Nature.

[21]  S. Martin,et al.  Recombination between subtypes creates a mosaic lineage of LINE-1 that is expressed and actively retrotransposing in the mouse genome. , 1998, Journal of molecular biology.

[22]  Hiroaki Kitano,et al.  The PANTHER database of protein families, subfamilies, functions and pathways , 2004, Nucleic Acids Res..

[23]  P. Deininger,et al.  Human retroelements may introduce intragenic polyadenylation signals , 2005, Cytogenetic and Genome Research.

[24]  Eric S. Lander,et al.  The mosaic structure of variation in the laboratory mouse genome , 2002, Nature.

[25]  Ryan E. Mills,et al.  An initial map of insertion and deletion (INDEL) variation in the human genome. , 2006, Genome research.

[26]  Peer Bork,et al.  SMART 5: domains in the context of genomes and networks , 2005, Nucleic Acids Res..

[27]  Jef D. Boeke,et al.  A highly active synthetic mammalian retrotransposon , 2004, Nature.

[28]  S. Boissinot,et al.  Selection against deleterious LINE-1-containing loci in the human lineage. , 2001, Molecular biology and evolution.

[29]  Jeffrey S. Han,et al.  Active retrotransposition by a synthetic L1 element in mice , 2006, Proceedings of the National Academy of Sciences.

[30]  G. Hannon,et al.  miRNAs on the move: miRNA biogenesis and the RNAi machinery. , 2004, Current opinion in cell biology.

[31]  E. Ostertag,et al.  A novel active L1 retrotransposon subfamily in the mouse. , 2001, Genome research.

[32]  Mark J Daly,et al.  Genetic variation in laboratory mice , 2005, Nature Genetics.

[33]  Philip M. Kim,et al.  Paired-End Mapping Reveals Extensive Structural Variation in the Human Genome , 2007, Science.

[34]  Janan T. Eppig,et al.  Genealogies of mouse inbred strains , 2000, Nature Genetics.

[35]  J. V. Moran,et al.  Multiple Fates of L1 Retrotransposition Intermediates in Cultured Human Cells , 2005, Molecular and Cellular Biology.

[36]  Giovanni Parmigiani,et al.  Human L1 Retrotransposition Is Associated with Genetic Instability In Vivo , 2002, Cell.

[37]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[38]  Eleazar Eskin,et al.  A sequence-based variation map of 8.27 million SNPs in inbred mouse strains , 2007, Nature.

[39]  Granger G. Sutton,et al.  A Tool for Analyzing Mate Pairs in Assemblies (TAMPA) , 2005, J. Comput. Biol..

[40]  Jef D. Boeke,et al.  Transcriptional disruption by the L1 retrotransposon and implications for mammalian transcriptomes , 2004, Nature.

[41]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[42]  N. Yang,et al.  L1 retrotransposition is suppressed by endogenously encoded small interfering RNAs in human cultured cells , 2006, Nature Structural &Molecular Biology.

[43]  M. Lyon,et al.  X-Chromosome inactivation: a repeat hypothesis , 1998, Cytogenetic and Genome Research.

[44]  H. Kazazian Mobile Elements: Drivers of Genome Evolution , 2004, Science.

[45]  J. Takeda,et al.  Retrotransposons Influence the Mouse Transcriptome: Implication for the Divergence of Genetic Traits , 2007, Genetics.

[46]  Jeffrey S. Han,et al.  Gene-breaking: a new paradigm for human retrotransposon-mediated gene evolution. , 2005, Genome research.

[47]  P. Deininger,et al.  LINE-1 RNA splicing and influences on mammalian gene expression , 2006, Nucleic acids research.

[48]  J. Nathans,et al.  Effects of L1 retrotransposon insertion on transcript processing, localization and accumulation: lessons from the retinal degeneration 7 mouse and implications for the genomic ecology of L1 elements. , 2006, Human molecular genetics.

[49]  P. Kassner,et al.  Significant gene content variation characterizes the genomes of inbred mouse strains. , 2007, Genome research.

[50]  C. Walsh,et al.  Cytosine methylation and the ecology of intragenomic parasites. , 1997, Trends in genetics : TIG.