VarioWatch: providing large-scale and comprehensive annotations on human genomic variants in the next generation sequencing era

VarioWatch (http://genepipe.ncgm.sinica.edu.tw/variowatch/) has been vastly improved since its former publication GenoWatch in the 2008 Web Server Issue. It is now at least 10 000-times faster in annotating a variant. Drastic speed increase, through complete re-design of its working mechanism, makes VarioWatch capable of annotating millions of human genomic variants generated from next generation sequencing in minutes, if not seconds. While using MegaQuery of VarioWatch to quickly annotate variants, users can apply various filters to retrieve a subgroup of variants according to the risk levels, interested regions, etc. that satisfy users’ requirements. In addition to performance leap, many new features have also been added, such as annotation on novel variants, functional analyses on splice sites and in/dels, detailed variant information in tabulated form, plus a risk level decision tree regarding the analyzed variant. Up to 1000 target variants can be visualized with our carefully designed Genome View, Gene View, Transcript View and Variation View. Two commonly used reference versions, NCBI build 36.3 and NCBI build 37.2, are supported. VarioWatch is unique in its ability to annotate comprehensively and efficiently millions of variants online, immediately delivering the results in real time, plus visualizes up to 1000 annotated variants.

[1]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[2]  Adam Yao,et al.  GenoWatch: a disease gene mining browser for association study , 2008, Nucleic Acids Res..

[3]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[4]  Steven Henikoff,et al.  SIFT: predicting amino acid changes that affect protein function , 2003, Nucleic Acids Res..

[5]  Susumu Goto,et al.  The KEGG resource for deciphering the genome , 2004, Nucleic Acids Res..

[6]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[7]  The UniProt Consortium,et al.  Reorganizing the protein space at the Universal Protein Resource (UniProt) , 2011, Nucleic Acids Res..

[8]  Christopher B. Burge,et al.  RESCUE-ESE identifies candidate exonic splicing enhancers in vertebrate exons , 2004, Nucleic Acids Res..

[9]  Michael Cariaso,et al.  SNPedia: a wiki supporting personal genome annotation, interpretation and analysis , 2011, Nucleic Acids Res..

[10]  Mostafa Ronaghi,et al.  pfSNP: An integrated potentially functional SNP resource that facilitates hypotheses generation through knowledge syntheses , 2011, Human mutation.

[11]  Arshad Khan,et al.  SNPnexus: a web database for functional annotation of newly discovered and public domain single nucleotide polymorphisms , 2008, Bioinform..

[12]  Rachael P. Huntley,et al.  The GOA database in 2009—an integrated Gene Ontology Annotation resource , 2008, Nucleic Acids Res..

[13]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[14]  Ralf H. Bortfeldt,et al.  CandiSNPer: a web tool for the identification of candidate SNPs for causal variants , 2010, Bioinform..

[15]  Alberto Riva,et al.  SNPper: retrieval and analysis of human SNPs , 2002, Bioinform..

[16]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[17]  Carol A. Bocchini,et al.  A new face and new challenges for Online Mendelian Inheritance in Man (OMIM®) , 2011, Human mutation.

[18]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[19]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[20]  Tatiana A. Tatusova,et al.  NCBI Reference Sequences: current status, policy and new initiatives , 2008, Nucleic Acids Res..

[21]  M. Kimmel,et al.  Conflict of interest statement. None declared. , 2010 .

[22]  Life Technologies,et al.  A map of human genome variation from population-scale sequencing , 2011 .

[23]  Gene W. Yeo,et al.  Systematic Identification and Analysis of Exonic Splicing Silencers , 2004, Cell.

[24]  Vladimir Makarov,et al.  AnnTools: a comprehensive and versatile annotation toolkit for genomic variants , 2012, Bioinform..

[25]  Elizabeth T. Cirulli,et al.  SVA: software for annotating and visualizing sequenced human genomes , 2011, Bioinform..

[26]  J. Silberg,et al.  A transposase strategy for creating libraries of circularly permuted proteins , 2012, Nucleic acids research.

[27]  Andrew E. Bruno,et al.  miRdSNP: a database of disease-associated SNPs and microRNA target sites on 3'UTRs of human genes , 2012, BMC Genomics.

[28]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[29]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[30]  Adam Yao,et al.  Functional analysis of novel SNPs and mutations in human and mouse genomes , 2008, BMC Bioinformatics.