Molecular archeology of L1 insertions in the human genome

BackgroundAs the rough draft of the human genome sequence nears a finished product and other genome-sequencing projects accumulate sequence data exponentially, bioinformatics is emerging as an important tool for studies of transposon biology. In particular, L1 elements exhibit a variety of sequence structures after insertion into the human genome that are amenable to computational analysis. We carried out a detailed analysis of the anatomy and distribution of L1 elements in the human genome using a new computer program, TSDfinder, designed to identify transposon boundaries precisely.ResultsStructural variants of L1 elements shared similar trends in the length and quality of their target site duplications (TSDs) and poly(A) tails. Furthermore, we found no correlation between the composition and genomic location of the pre-insertion locus and the resulting anatomy of the L1 insertion. We verified that L1 insertions with TSDs have the 5'-TTAAAA-3' cleavage site associated with L1 endonuclease activity. In addition, the second target DNA cut required for L1 insertion weakly matches the consensus pattern TTAAAA. On the other hand, the L1-internal breakpoints of deleted and inverted L1 elements do not resemble L1 endonuclease cleavage sites. Finally, the genome sequence data indicate that whereas singly inverted elements are common, doubly inverted elements are almost never found.ConclusionsThe sequence data give no indication that the creation of L1 structural variants depends on characteristics of the insertion locus. In addition, the formation of 5' truncated and 5' inverted L1s are probably not due to the action of the L1 endonuclease.

[1]  J. Boeke,et al.  Targeting of human retrotransposon integration is directed by the specificity of the L1 endonuclease for regions of unusual DNA structure. , 1998, Biochemistry.

[2]  S. Boissinot,et al.  Adaptive evolution in LINE-1 retrotransposons. , 2001, Molecular biology and evolution.

[3]  D. Mccormick Sequence the Human Genome , 1986, Bio/Technology.

[4]  D. Page,et al.  Four evolutionary strata on the human X chromosome. , 1999, Science.

[5]  J. V. Moran,et al.  Many human L1 elements are capable of retrotransposition , 1997, Nature Genetics.

[6]  R. E. Thayer,et al.  LINE-1: a human transposable element. , 1993, Gene.

[7]  J. Jurka,et al.  Sequence patterns indicate an enzymatic involvement in integration of mammalian retroposons. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[8]  S. Boissinot,et al.  L1 (LINE-1) retrotransposon evolution and amplification in recent human history. , 2000, Molecular biology and evolution.

[9]  Marie-Christine Chaboissier,et al.  Retrotransposition of the I factor, a non-long terminal repeat retrotransposon of Drosophila, generates tandem repeats at the 3' end , 2000, Nucleic Acids Res..

[10]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[11]  H. Nishio,et al.  Insertion of a 5' truncated L1 element into the 3' end of exon 44 of the dystrophin gene resulted in skipping of the exon during splicing in a case of Duchenne muscular dystrophy. , 1993, The Journal of clinical investigation.

[12]  M. Boguski,et al.  Frequent human genomic DNA transduction driven by LINE-1 retrotransposition. , 2000, Genome research.

[13]  M. Hattori,et al.  Sequence analysis of a Kpnl family member near the 3′ end of human β-globin gene , 1985 .

[14]  W. Berger,et al.  Full-length human L1 insertions retain the capacity for high frequency retrotransposition in cultured cells. , 1999, Human molecular genetics.

[15]  Thierry Heidmann,et al.  Human LINE retrotransposons generate processed pseudogenes , 2000, Nature Genetics.

[16]  J. Boeke,et al.  Target DNA chromatinization modulates nicking by L1 endonuclease. , 2001, Nucleic acids research.

[17]  T. Heidmann,et al.  Functional differences between the human LINE retrotransposon and retroviral reverse transcriptases for in vivo mRNA reverse transcription , 1997, The EMBO journal.

[18]  E. Ostertag,et al.  Transduction of 3'-flanking sequences is common in L1 retrotransposition. , 2000, Human molecular genetics.

[19]  A. F. Scott,et al.  Isolation of an active human transposable element. , 1991, Science.

[20]  Jef D Boeke,et al.  Human L1 Retrotransposon Encodes a Conserved Endonuclease Required for Retrotransposition , 1996, Cell.

[21]  Y. Ilyin,et al.  jockey, a mobile drosophila element similar to mammalian LINEs, is transcribed from the internal promoter by RNA polymerase II , 1988, Cell.

[22]  R. Deberardinis,et al.  Rapid amplification of a retrotransposon subfamily is evolving the mouse genome , 1998, Nature Genetics.

[23]  Jef D Boeke,et al.  High Frequency Retrotransposition in Cultured Mammalian Cells , 1996, Cell.

[24]  M F Singer,et al.  LINE-1: a mammalian transposable element. , 1987, Biochimica et biophysica acta.

[25]  M. Hattori,et al.  Identification of an internal cis-element essential for the human L1 transcription and a nuclear factor(s) binding to the element. , 1992, Nucleic acids research.

[26]  C. Schmid,et al.  Recently transposed Alu repeats result from multiple source genes. , 1990, Nucleic acids research.

[27]  H H Kazazian,et al.  HUGO—a midlife crisis? , 1998, Nature Genetics.

[28]  A. Smit,et al.  The origin of interspersed repeats in the human genome. , 1996, Current opinion in genetics & development.

[29]  J. Boeke,et al.  Reverse transcriptase encoded by a human transposable element. , 1991, Science.

[30]  A. F. Scott,et al.  Two additional potential retrotransposons isolated from a human L1 subfamily that contains an active retrotransposable element. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[31]  T. Eickbush Exon Shuffling in Retrospect , 1999, Science.

[32]  T. Eickbush,et al.  Reverse transcription of R2Bm RNA is primed by a nick at the chromosomal target site: A mechanism for non-LTR retrotransposition , 1993, Cell.

[33]  E. Ostertag,et al.  Twin priming: a proposed mechanism for the creation of inversions in L1 retrotransposition. , 2001, Genome research.

[34]  S. Boissinot,et al.  Selection against deleterious LINE-1-containing loci in the human lineage. , 2001, Molecular biology and evolution.

[35]  J. Boeke,et al.  Retrotransposon R1Bm endonuclease cleaves the target sequence. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[36]  A. Smit,et al.  Ancestral, mammalian-wide subfamilies of LINE-1 repetitive sequences. , 1995, Journal of molecular biology.

[37]  S. Scherer,et al.  Germline and somatic mutations in the tyrosine kinase domain of the MET proto-oncogene in papillary renal carcinomas , 1997, Nature Genetics.

[38]  T. D. Schneider,et al.  Sequence logos: a new way to display consensus sequences. , 1990, Nucleic acids research.

[39]  M. Hattori,et al.  Sequence analysis of a KpnI family member near the 3' end of human beta-globin gene. , 1985, Nucleic acids research.

[40]  A. Smit Interspersed repeats and other mementos of transposable elements in mammalian genomes. , 1999, Current opinion in genetics & development.

[41]  M. Singer,et al.  The human LINE-1 reverse transcriptase:effect of deletions outside the common reverse transcriptase domain. , 1998, Nucleic acids research.

[42]  A. F. Scott,et al.  Origin of the human L1 elements: Proposed progenitor genes deduced from a consensus DNA sequence☆ , 1987, Genomics.

[43]  Giovanni Parmigiani,et al.  Human L1 Retrotransposition Is Associated with Genetic Instability In Vivo , 2002, Cell.

[44]  A. Troxel,et al.  Genomic characterization of recent human LINE-1 insertions: evidence supporting random insertion. , 2001, Genome research.

[45]  Thomas L. Madden,et al.  BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences. , 1999, FEMS microbiology letters.

[46]  J. Boeke,et al.  An in vivo assay for the reverse transcriptase of human retrotransposon L1 in Saccharomyces cerevisiae , 1994, Molecular and cellular biology.

[47]  R. Paro,et al.  The molecular basis of I-R hybrid Dysgenesis in drosophila melanogaster: Identification, cloning, and properties of the I factor , 1984, Cell.

[48]  H. Willard,et al.  A first-generation X-inactivation profile of the human X chromosome. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[49]  J. Weber,et al.  Alu repeats: a source for the genesis of primate microsatellites. , 1995, Genomics.

[50]  J A Bailey,et al.  Molecular evidence for a relationship between LINE-1 elements and X chromosome inactivation: the Lyon repeat hypothesis. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[51]  H. Kazazian,et al.  A new retrotransposable human L1 element from the LRE2 locus on chromosome 1q produces a chimaeric insertion , 1994, Nature Genetics.

[52]  G. Swergold Identification, characterization, and cell specificity of a human LINE-1 promoter , 1990, Molecular and cellular biology.