Genome ARTIST: a robust, high-accuracy aligner tool for mapping transposon insertions and self-insertions

A critical topic of insertional mutagenesis experiments performed on model organisms is mapping the hits of artificial transposons (ATs) at nucleotide level accuracy. Obviously, mapping errors may occur when sequencing artifacts or mutations as SNPs and small indels are present very close to the junction between a genomic sequence and a transposon inverted repeat (TIR). Another particular item of insertional mutagenesis is mapping of the transposon self-insertions and, to our best knowledge, there is no publicly available mapping tool designed to analyze such molecular events. We developed Genome ARTIST, a pairwise gapped aligner tool which works out both issues by means of an original, robust mapping strategy. Genome ARTIST is not designed to use NGS data but to analyze ATs insertions obtained in small to medium-scale mutagenesis experiments. Genome ARTIST employs a heuristic approach to find DNA sequence similarities and harnesses a multi-step implementation of a Smith-Waterman adapted algorithm to compute the mapping alignments. The experience is enhanced by easily customizable parameters and a user-friendly interface that describes the genomic landscape surrounding the insertion. Genome ARTIST deals with many genomes of bacteria and eukaryotes available in Ensembl and GenBank repositories. Our tool specifically harnesses/exploits the sequence annotation data provided by FlyBase for Drosophila melanogaster (the fruit fly), which enables mapping of insertions relative to various genomic features such as natural transposons. Genome ARTIST was tested against other alignment tools using relevant query sequences derived from the D. melanogaster and Mus musculus (mouse) genomes. Real and simulated query sequences were also comparatively inquired, revealing that Genome ARTIST is a very robust solution for mapping transposon insertions. Genome ARTIST is a stand-alone user-friendly application, designed for high-accuracy mapping of transposon insertions and self-insertions. The tool is also useful for routine aligning assessments like detection of SNPs or checking the specificity of primers and probes. Genome ARTIST is an open source software and is available for download at www.genomeartist.ro and at www.bioinformatics.org.

[1]  Jeff J. Sekelsky,et al.  From sequence to phenotype: reverse genetics in drosophila melanogaster , 2002, Nature Reviews Genetics.

[2]  L. Savu,et al.  Insertional hotspots of artificial P transposons are tagged by consensus motifs in various genomic sequences of Drosophila melanogaster , 2011 .

[3]  J. Bessereau Transposons in C. elegans. , 2006, WormBook : the online review of C. elegans biology.

[4]  P. Rørth,et al.  A modular misexpression screen in Drosophila detecting tissue-specific phenotypes. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[5]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[6]  G. Rubin,et al.  The BDGP Gene Disruption Project , 2004, Genetics.

[7]  S. Wessler,et al.  Insertion Preference of Maize and Rice Miniature Inverted Repeat Transposable Elements as Revealed by the Analysis of Nested Elements Article, publication date, and citation information can be found at www.aspb.org/cgi/doi/10/1105/tpc.010235. , 2001, The Plant Cell Online.

[8]  C. Bergman A proposal for the reference-based annotation of de novo transposable element insertions , 2012, Mobile genetic elements.

[9]  L. Savu,et al.  Transposition of P{lacW} gammaCop057302 into the germline of Drosophila melanogaster correlates with retaining of the original insertion , 2008 .

[10]  Dawn Field,et al.  Open software for biologists: from famine to feast , 2006, Nature Biotechnology.

[11]  M. Muñoz-López,et al.  DNA Transposons: Nature and Applications in Genomics , 2010, Current genomics.

[12]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[13]  Miriam K. Konkel,et al.  Tangram: a comprehensive toolbox for mobile element insertion detection , 2014, BMC Genomics.

[14]  Casey M. Bergman,et al.  Whole Genome Resequencing Reveals Natural Target Site Preferences of Transposable Elements in Drosophila melanogaster , 2012, PloS one.

[15]  Michael Ashburner,et al.  Drosophila: A laboratory handbook , 1990 .

[16]  Renyi Liu,et al.  ITIS, a bioinformatics tool for accurate identification of transposon insertion sites using next-generation sequencing data , 2015, BMC Bioinformatics.

[17]  Zhiping Weng,et al.  TEMP: a computational method for analyzing transposable element polymorphism in populations , 2014, Nucleic acids research.

[18]  C. Feschotte,et al.  DNA transposons and the evolution of eukaryotic genomes. , 2007, Annual review of genetics.

[19]  Kevin A. T. Silverstein,et al.  TAPDANCE: An automated tool to identify and annotate transposon insertion CISs and associations between CISs from next generation sequence data , 2012, BMC Bioinformatics.

[20]  U. Grossniklaus,et al.  The art and design of genetic screens: Arabidopsis thaliana , 2002, Nature Reviews Genetics.

[21]  Z. Izsvák,et al.  Technology transfer from worms and flies to vertebrates: transposition-based genome manipulations and their future perspectives , 2007, Genome Biology.

[22]  A. Spradling,et al.  Efficient and dispersed local P element transposition from Drosophila females. , 1993, Genetics.

[23]  Eric Haugen,et al.  Comprehensive transposon mutant library of Pseudomonas aeruginosa , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[24]  L. Savu,et al.  Mapping of multiple P{lacW} insertions into the germline of Drosophila melanogaster. , 2009 .

[25]  L. Luo,et al.  Splinkerette PCR for Mapping Transposable Elements in Drosophila , 2010, PloS one.

[26]  Jun Kong,et al.  iMapper: a web application for the automated analysis and mapping of insertional mutagenesis sequence data against Ensembl genomes , 2008, Bioinform..

[27]  G M Rubin,et al.  Gene disruptions using P transposable elements: an integral component of the Drosophila genome project. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Yutaka Okumoto,et al.  The Use of RelocaTE and Unassembled Short Reads to Produce High-Resolution Snapshots of Transposable Element Generated Diversity in Rice , 2013, G3: Genes, Genomes, Genetics.

[29]  K. Kawakami Transposon tools and methods in zebrafish , 2005, Developmental dynamics : an official publication of the American Association of Anatomists.

[30]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[31]  Michael Ashburner,et al.  Recurrent insertion and duplication generate networks of transposable element sequences in the Drosophila melanogaster genome , 2006, Genome Biology.

[32]  E. Chen,et al.  Shuttle mutagenesis: a method of transposon mutagenesis for Saccharomyces cerevisiae. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[33]  T. Uemura,et al.  Searching for pattern and mutation in the Drosophila genome with a P-lacZ vector. , 1989, Genes & development.

[34]  A. Spradling,et al.  The Drosophila Gene Disruption Project: Progress Using Transposons With Distinctive Site Specificities , 2011, Genetics.

[35]  K. Golic,et al.  Local transposition of P elements in Drosophila melanogaster and recombination between duplicated elements using a site-specific recombinase. , 1994, Genetics.

[36]  Akira Takahashi,et al.  Transposon Insertion Finder (TIF): a novel program for detection of de novo transpositions of transposable elements , 2014, BMC Bioinformatics.

[37]  J. Mullikin,et al.  SSAHA: a fast search method for large DNA databases. , 2001, Genome research.