SNiPlay3: a web-based application for exploration and large scale analyses of genomic variations

SNiPlay is a web-based tool for detection, management and analysis of genetic variants including both single nucleotide polymorphisms (SNPs) and InDels. Version 3 now extends functionalities in order to easily manage and exploit SNPs derived from next generation sequencing technologies, such as GBS (genotyping by sequencing), WGRS (whole gre-sequencing) and RNA-Seq technologies. Based on the standard VCF (variant call format) format, the application offers an intuitive interface for filtering and comparing polymorphisms using user-defined sets of individuals and then establishing a reliable genotyping data matrix for further analyses. Namely, in addition to the various scaled-up analyses allowed by the application (genomic annotation of SNP, diversity analysis, haplotype reconstruction and network, linkage disequilibrium), SNiPlay3 proposes new modules for GWAS (genome-wide association studies), population stratification, distance tree analysis and visualization of SNP density. Additionally, we developed a suite of Galaxy wrappers for each step of the SNiPlay3 process, so that the complete pipeline can also be deployed on a Galaxy instance using the Galaxy ToolShed procedure and then be computed as a Galaxy workflow. SNiPlay is accessible at http://sniplay.southgreen.fr.

[1]  Daniel J. Blankenberg,et al.  Galaxy: a platform for interactive large-scale genome analysis. , 2005, Genome research.

[2]  F. Tajima Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. , 1989, Genetics.

[3]  Dirk Walther,et al.  Matapax: An Online High-Throughput Genome-Wide Association Study Pipeline[C][W][OA] , 2012, Plant Physiology.

[4]  L. Stein,et al.  JBrowse: a next-generation genome browser. , 2009, Genome research.

[5]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[6]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[7]  Bjarni J. Vilhjálmsson,et al.  An efficient multi-locus mixed model approach for genome-wide association studies in structured populations , 2012, Nature Genetics.

[8]  Manuel Ruiz,et al.  SNiPlay: a web-based tool for detection, management and analysis of SNPs. Application to grapevine diversity projects , 2011, BMC Bioinformatics.

[9]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[10]  A. Nekrutenko,et al.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences , 2010, Genome Biology.

[11]  Edward S. Buckler,et al.  TASSEL: software for association mapping of complex traits in diverse samples , 2007, Bioinform..

[12]  Pierre Larmande,et al.  Gigwa—Genotype investigator for genome-wide analyses , 2016, GigaScience.

[13]  Pierre Larmande,et al.  Genotype investigator for Genome Wide Analysis (GIGwA) [P1111] , 2015 .

[14]  M. Siol,et al.  EggLib: processing, analysis and simulation tools for population genetics and genomics , 2012, BMC Genetics.

[15]  Olivier Gascuel,et al.  Fast and Accurate Phylogeny Reconstruction Algorithms Based on the Minimum-Evolution Principle , 2002, WABI.

[16]  Ron Shamir,et al.  GEVALT: An integrated software tool for genotype analysis , 2007, BMC Bioinformatics.

[17]  Daniel J. Blankenberg,et al.  Galaxy: A Web‐Based Genome Analysis Tool for Experimentalists , 2010, Current protocols in molecular biology.

[18]  Pierre Larmande,et al.  Erratum to: Gigwa-Genotype investigator for genome-wide analyses , 2016, GigaScience.

[19]  Bjarni J. Vilhjálmsson,et al.  GWAPP: A Web Application for Genome-Wide Association Mapping in Arabidopsis[W][OA] , 2012, Plant Cell.

[20]  Tae-Ho Lee,et al.  SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data , 2014, BMC Genomics.

[21]  D. Coltman,et al.  Detecting population structure using STRUCTURE software: effect of background linkage disequilibrium , 2007, Heredity.

[22]  Pablo Cingolani,et al.  © 2012 Landes Bioscience. Do not distribute. , 2022 .

[23]  David H. Alexander,et al.  Fast model-based estimation of ancestry in unrelated individuals. , 2009, Genome research.

[24]  Ali Al-Shahib,et al.  snp-search: simple processing, manipulation and searching of SNPs from high-throughput sequencing , 2013, BMC Bioinformatics.