Automated characterization of potentially active retroid agents in the human genome.

Retroid agents are genomes that encode the reverse transcriptase (RT) and replicate by way of an RNA intermediate. Some retroid agents are implicated in disease via insertional mutagenesis, while others have been found to encode proteins essential to primate reproduction or provide regulatory sequences for host cell processes. The Genome Parsing Suite (GPS), a generic multistep automated process, was developed to characterize all RT-like sequences in the human genome database and to annotate the gene complement of the retroid agents that encode these sequences. In this report the GPS analyzes all significant WU-tBLASTn hits returned for 30 representative RT queries. A total of 128,779 unique RT signals were identified, and 7594 of these were retrieved by RTs not previously reported in the human genome. We have identified 9652 full-length long interspersed nuclear elements (LINEs). Only 159 LINEs are without stop codons or frameshifts.

[1]  R. Hull Classifying reverse transcribing elements: a proposal and a challenge to the ICTV , 2001, Archives of Virology.

[2]  J. F. Atkins,et al.  Recoding: dynamic reprogramming of translation. , 1996, Annual review of biochemistry.

[3]  M. A. McClure,et al.  Computer analysis of retroviral pol genes: assignment of enzymatic functions to specific sequences and homologies with nonviral enzymes. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Michael Tristem,et al.  Identification and Characterization of Novel Human Endogenous Retrovirus Families by Phylogenetic Screening of the Human Genome Mapping Project Database , 2000, Journal of Virology.

[5]  Jef D. Boeke,et al.  Human L1 Retrotransposition: cisPreference versus trans Complementation , 2001, Molecular and Cellular Biology.

[6]  I. Kanazawa,et al.  An ancient retrotransposal insertion causes Fukuyama-type congenital muscular dystrophy , 1998, Nature.

[7]  N. Tordo,et al.  Sequence comparison of five polymerases (L proteins) of unsegmented negative-strand RNA viruses: theoretical assignment of functional domains. , 1990, The Journal of general virology.

[8]  A. Ballabio,et al.  LINE-1 elements at the sites of molecular rearrangements in Alport syndrome-diffuse leiomyomatosis. , 1999, American journal of human genetics.

[9]  A. Nakamura,et al.  Insertional mutation by transposable element, L1, in the DMD gene results in X-linked dilated cardiomyopathy. , 1998, Human molecular genetics.

[10]  Kowalski,et al.  Low Identity, Low Similarity Protein Sequences: Independent Modeling of the Ordered-Series-of-Motifs and Motif-Intervening-Regions. , 1998, Genome informatics. Workshop on Genome Informatics.

[11]  A. Perl,et al.  Polymorphic genotypes of the HRES-1 human endogenous retrovirus locus correlate with systemic lupus erythematosus and autoreactivity , 1999, Immunogenetics.

[12]  Bernard Mandrand,et al.  An Envelope Glycoprotein of the Human Endogenous Retrovirus HERV-W Is Expressed in the Human Placenta and Fuses Cells Expressing the Type D Mammalian Retrovirus Receptor , 2000, Journal of Virology.

[13]  M. Meisler,et al.  Expression of the human amylase genes: recent origin of a salivary amylase promoter from an actin pseudogene. , 1988, Nucleic acids research.

[14]  H. Temin Retrons in bacteria , 1989, Nature.

[15]  Shun–ichi Kobayashi Tokyo campus rising , 1998, Nature.

[16]  Jerzy Jurka,et al.  HERVd: the Human Endogenous RetroViruses Database: update , 2004, Nucleic Acids Res..

[17]  J. Mccoy,et al.  Syncytin is a captive retroviral envelope protein involved in human placental morphogenesis , 2000, Nature.

[18]  D. Giedroc,et al.  Equilibrium unfolding pathway of an H-type RNA pseudoknot which promotes programmed −1 ribosomal frameshifting1 , 1999, Journal of Molecular Biology.

[19]  L. Brakier-Gingras,et al.  Characterization of the frameshift stimulatory signal controlling a programmed –1 ribosomal frameshift in the human immunodeficiency virus type 1 , 2002, Nucleic acids research.

[20]  K. Usadel,et al.  An endogenous retroviral long terminal repeat at the HLA-DQB1 gene locus confers susceptibility to rheumatoid arthritis. , 1999, Human immunology.

[21]  D. Mattson,et al.  Human T-cell lymphotropic virus (HTLV)-related endogenous sequence, HRES-1, encodes a 28-kDa protein: a possible autoantigen for HTLV-I gag-reactive autoantibodies. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[22]  L. Duret,et al.  The endogenous retroviral locus ERVWE1 is a bona fide gene involved in hominoid placental physiology. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[23]  K. Lowenhaupt,et al.  Drosophila telomeres: new views on chromosome evolution. , 1996, Trends in genetics : TIG.

[24]  C. Meischl,et al.  A new exon created by intronic insertion of a rearranged LINE-1 element as the cause of chronic granulomatous disease , 2000, European Journal of Human Genetics.

[25]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[26]  Tim Hubbard Finishing the euchromatic sequence of the human genome , 2004 .

[27]  A. Perl,et al.  Detection and cloning of new HTLV-related endogenous sequences in man. , 1989, Nucleic acids research.

[28]  E. Domingo,et al.  Origin and Evolution of Viruses , 2010, Virus Genes.

[29]  J. V. Moran,et al.  Hot L1s account for the bulk of retrotransposition in the human population , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[30]  R. Hehlmann,et al.  Retrovirus-like particles from the human T47D cell line are related to mouse mammary tumour virus and are of human endogenous origin. , 1992, The Journal of general virology.

[31]  J Schröder,et al.  Retroviral RNA identified in the cerebrospinal fluids and brains of individuals with schizophrenia , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[33]  J. Landry,et al.  Long Terminal Repeats Are Used as Alternative Promoters for the Endothelin B Receptor and Apolipoprotein C-I Genes in Humans* , 2001, The Journal of Biological Chemistry.

[34]  Jef D Boeke,et al.  High Frequency Retrotransposition in Cultured Mammalian Cells , 1996, Cell.

[35]  M. O. Dayhoff,et al.  Atlas of protein sequence and structure , 1965 .

[36]  Mcclure Ma,et al.  The effects of ordered-series-of-motifs anchoring and sub-class modeling on the generation of HMMs representing highly divergent protein sequences. , 1998 .

[37]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[38]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[39]  Pavel V Baranov,et al.  Recoding: translational bifurcations in gene expression. , 2002, Gene.

[40]  J. V. Moran,et al.  Many human L1 elements are capable of retrotransposition , 1997, Nature Genetics.

[41]  E. Ostertag,et al.  SVA elements are nonautonomous retrotransposons that cause disease in humans. , 2003, American journal of human genetics.

[42]  A. Smit Interspersed repeats and other mementos of transposable elements in mammalian genomes. , 1999, Current opinion in genetics & development.

[43]  A. M. Marcella,et al.  The Retroid Agents , 1999 .

[44]  R. Lawn,et al.  Apolipoprotein(a) Gene Enhancer Resides within a LINE Element* , 1998, The Journal of Biological Chemistry.

[45]  J. Haber,et al.  Capture of retrotransposon DNA at the sites of chromosomal double-strand breaks , 1996, Nature.

[46]  H. Temin Reverse transcription in the eukaryotic genome: retroviruses, pararetroviruses, retrotransposons, and retrotranscripts. , 1985, Molecular biology and evolution.

[47]  V. Najfeld,et al.  Identification of a proviral structure in human breast cancer. , 2001, Cancer research.

[48]  C. Pleij,et al.  Analysis of the role of the pseudoknot component in the SRV-1 gag-pro ribosomal frameshift signal: loop lengths and stability of the stem regions. , 1995, RNA.

[49]  M. A. McClure,et al.  Sequence comparisons of retroviral proteins: relative rates of change and general phylogeny. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[50]  D. Kordis,et al.  Unusual horizontal transfer of a long interspersed nuclear element between distant vertebrate classes. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[51]  M. Martin,et al.  Identification and cloning of endogenous retroviral sequences present in human DNA. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[52]  Y. Sakaki,et al.  Identification of critical CpG sites for repression of L1 transcription by DNA methylation. , 1997, Gene.

[53]  Rajasekhar Raman,et al.  Parameterization studies of hidden Markov models representing highly divergent protein sequences , 1995, Proceedings of the Twenty-Eighth Annual Hawaii International Conference on System Sciences.

[54]  A. Burt,et al.  Long-term reinfection of the human genome by endogenous retroviruses. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[55]  R. Hehlmann,et al.  HERV-IP-T47D, a novel type C-related human endogenous retroviral sequence derived from T47D particles. , 2000, AIDS research and human retroviruses.

[56]  M. A. McClure Evolution of retroposons by acquisition or deletion of retrovirus-like genes. , 1991, Molecular biology and evolution.

[57]  M. A. McClure,et al.  Origins and Evolutionary Relationships of Retroviruses , 1989, The Quarterly Review of Biology.

[58]  T. Steitz,et al.  Crystal structure at 3.5 A resolution of HIV-1 reverse transcriptase complexed with an inhibitor. , 1992, Science.