BS-SNPer: SNP calling in bisulfite-seq data

Summary: Sodium bisulfite conversion followed by sequencing (BS-Seq, such as whole genome bisulfite sequencing or reduced representation bisulfite sequencing) has become popular for studying human epigenetic profiles. Identifying single nucleotide polymorphisms (SNPs) is important for quantification of methylation levels and for study of allele-specific epigenetic events such as imprinting. However, SNP calling in such data is complex and time consuming. Here, we present an ultrafast and memory-efficient package named BS-SNPer for the exploration of SNP sites from BS-Seq data. Compared with Bis-SNP, a popular BS-Seq specific SNP caller, BS-SNPer is over 100 times faster and uses less memory. BS-SNPer also offers higher sensitivity and specificity compared with existing methods. Availability and implementation: BS-SNPer is written in C++ and Perl, and is freely available at https://github.com/hellbelly/BS-Snper. Contact: bolund@biomed.au.dk, kdso@clin.au.dk or orntoft@ki.au.dk Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  P. Laird Principles and challenges of genome-wide DNA methylation analysis , 2010, Nature Reviews Genetics.

[2]  J. Oliver,et al.  MethylExtract: High-Quality methylation maps and SNV calling from whole genome bisulfite sequencing data , 2013, F1000Research.

[3]  Huanming Yang,et al.  Multilayered molecular profiling supported the monoclonal origin of metastatic renal cell carcinoma , 2014, International journal of cancer.

[4]  P. Laird,et al.  Regions of focal DNA hypermethylation and long-range hypomethylation in colorectal cancer coincide with nuclear lamina–associated domains , 2011, Nature Genetics.

[5]  D. Bell,et al.  Sequence context at human single nucleotide polymorphisms: overrepresentation of CpG dinucleotide at polymorphic sites and suppression of variation in CpG islands. , 2003, Journal of molecular biology.

[6]  J. Wilkins,et al.  Genomic imprinting and methylation: epigenetic canalization and conflict. , 2005, Trends in genetics : TIG.

[7]  A. Feinberg,et al.  Increased methylation variation in epigenetic domains across cancer types , 2011, Nature Genetics.

[8]  Huanming Yang,et al.  SNP detection for massively parallel whole-genome resequencing. , 2009, Genome research.

[9]  Felix Krueger,et al.  Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications , 2011, Bioinform..

[10]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[11]  Lee E. Edsall,et al.  Human DNA methylomes at base resolution show widespread epigenomic differences , 2009, Nature.

[12]  W. Reik,et al.  Genomic imprinting: parental influence on the genome , 2001, Nature Reviews Genetics.

[13]  Stefano Lonardi,et al.  BRAT-BW: efficient and accurate mapping of bisulfite-treated reads , 2012, Bioinform..

[14]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[15]  Wei Li,et al.  BSMAP: whole genome bisulfite sequence MAPping program , 2009, BMC Bioinformatics.

[16]  P. Laird,et al.  Bis-SNP: Combined DNA methylation and SNP calling for Bisulfite-seq data , 2012, Genome Biology.

[17]  Pao-Yang Chen,et al.  BS Seeker: precise mapping for bisulfite sequencing , 2010, BMC Bioinformatics.

[18]  Brent Pedersen,et al.  MethylCoder: software pipeline for bisulfite-treated sequences , 2011, Bioinform..

[19]  J. Oliver,et al.  MethylExtract: High-Quality methylation maps and SNV calling from whole genome bisulfite sequencing data. , 2013, F1000Research.