PLOTREP: a web tool for defragmentation and visual analysis of dispersed genomic repeats

Identification of dispersed or interspersed repeats, most of which are derived from transposons, retrotransposons or retrovirus-like elements, is an important step in genome annotation. Software tools that compare genomic sequences with precompiled repeat reference libraries using sensitive similarity-based methods provide reliable means of finding the positions of fragments homologous to known repeats. However, their output is often incomplete and fragmented owing to the mutations (nucleotide substitutions, deletions or insertions) that can result in considerable divergence from the reference sequence. Merging these fragments to identify the whole region that represents an ancient copy of a mobile element is challenging, particularly if the element is large and suffered multiple deletions or insertions. Here we report PLOTREP, a tool designed to post-process results obtained by sequence similarity search and merge fragments belonging to the same copy of a repeat. The software allows rapid visual inspection of the results using a dot-plot like graphical output. The web implementation of PLOTREP is available at .

[1]  Jianxin Ma,et al.  Genomic sequencing reveals gene content, genomic organization, and recombination relationships in barley , 2002, Functional & Integrative Genomics.

[2]  J. Jurka Repbase update: a database and an electronic journal of repetitive elements. , 2000, Trends in genetics : TIG.

[3]  Guojun Yang,et al.  MAK, a computational tool kit for automated MITE analysis , 2003, Nucleic Acids Res..

[4]  Pierre Sourdille,et al.  Molecular Basis of Evolutionary Events That Shaped the Hardness Locus in Diploid and Polyploid Wheat Species (Triticum and Aegilops)w⃞ , 2005, The Plant Cell Online.

[5]  C. Robin Buell,et al.  The TIGR Plant Repeat Databases: a collective resource for the identification of repetitive sequences in plants , 2004, Nucleic Acids Res..

[6]  Jerzy Jurka,et al.  Censor - a Program for Identification and Elimination of Repetitive Elements From DNA Sequences , 1996, Comput. Chem..

[7]  Pavel Neumann,et al.  Highly abundant pea LTR retrotransposon Ogre is constitutively transcribed and partially spliced , 2003, Plant Molecular Biology.

[8]  J. Bennetzen,et al.  Plant retrotransposons. , 1999, Annual review of genetics.

[9]  J. Jurka,et al.  Repeats in genomic DNA: mining and meaning. , 1998, Current opinion in structural biology.

[10]  Pierre Sourdille,et al.  Updating of transposable element annotations from large wheat genomic sequences reveals diverse activities and gene associations , 2005, Molecular Genetics and Genomics.

[11]  R. Durbin,et al.  A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. , 1995, Gene.

[12]  Beat Keller,et al.  CACTA Transposons in Triticeae. A Diverse Family of High-Copy Repetitive Elements1 , 2003, Plant Physiology.

[13]  Z. Tu,et al.  Eight novel families of miniature inverted repeat transposable elements in the African malaria mosquito, Anopheles gambiae. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[14]  John F. McDonald,et al.  LTR_STRUC: a novel search and identification program for LTR retrotransposons , 2003, Bioinform..

[15]  J. Bennetzen,et al.  Nested Retrotransposons in the Intergenic Regions of the Maize Genome , 1996, Science.

[16]  A. Schulman,et al.  Envelope-class retrovirus-like elements are widespread, transcribed and spliced, and insertionally polymorphic in plants. , 2001, Genome research.

[17]  J. Jurka,et al.  Repbase Update, a database of eukaryotic repetitive elements , 2005, Cytogenetic and Genome Research.

[18]  H. Kazazian Mobile Elements: Drivers of Genome Evolution , 2004, Science.

[19]  Cédric Feschotte,et al.  Plant transposable elements: where genetics meets genomics , 2002, Nature Reviews Genetics.

[20]  Yong Qiang Gu,et al.  Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four Triticeae Genomes , 2004, Plant Physiology.

[21]  Peer Bork,et al.  BLAST2GENE: a comprehensive conversion of BLAST output into independent genes and gene fragments , 2004, Bioinform..

[22]  S. Eddy,et al.  Automated de novo identification of repeat sequence families in sequenced genomes. , 2002, Genome research.

[23]  Eviatar Nevo,et al.  Retrotransposon BARE-1 and Its Role in Genome Evolution in the Genus Hordeum , 1999, Plant Cell.

[24]  Richard M. Bruskiewich,et al.  Transposable element annotation of the rice genome , 2004, Bioinform..

[25]  Pavel A. Pevzner,et al.  De novo identification of repeat families in large genomes , 2005, ISMB.

[26]  Eugene W. Myers,et al.  PILER: identification and classification of genomic repeats , 2005, ISMB.

[27]  David A Wright,et al.  Athila4 of Arabidopsis and Calypso of soybean define a lineage of endogenous plant retroviruses. , 2002, Genome research.