SNP2TFBS – a database of regulatory SNPs affecting predicted transcription factor binding site affinity

SNP2TFBS is a computational resource intended to support researchers investigating the molecular mechanisms underlying regulatory variation in the human genome. The database essentially consists of a collection of text files providing specific annotations for human single nucleotide polymorphisms (SNPs), namely whether they are predicted to abolish, create or change the affinity of one or several transcription factor (TF) binding sites. A SNP's effect on TF binding is estimated based on a position weight matrix (PWM) model for the binding specificity of the corresponding factor. These data files are regenerated at regular intervals by an automatic procedure that takes as input a reference genome, a comprehensive SNP catalogue and a collection of PWMs. SNP2TFBS is also accessible over a web interface, enabling users to view the information provided for an individual SNP, to extract SNPs based on various search criteria, to annotate uploaded sets of SNPs or to display statistics about the frequencies of binding sites affected by selected SNPs. Homepage: http://ccg.vital-it.ch/snp2tfbs/.

[1]  David J. Arenillas,et al.  JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles , 2015, Nucleic Acids Res..

[2]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[3]  Helge G. Roider,et al.  Transcription factor binding predictions using TRAP for the analysis of ChIP-seq data and regulatory SNPs , 2011, Nature Protocols.

[4]  Leng Han,et al.  Investigating the relationship of DNA methylation with mutation rate and allele frequency in the human genome , 2012, BMC Genomics.

[5]  Sebo Withoff,et al.  Genetic variation in the non-coding genome: Involvement of micro-RNAs and long non-coding RNAs in disease. , 2014, Biochimica et biophysica acta.

[6]  James Bailey,et al.  is-rSNP: a novel technique for in silico regulatory SNP detection , 2010, Bioinform..

[7]  Jonathan K. Pritchard,et al.  The Genetic and Mechanistic Basis for Variation in Gene Regulation , 2015, PLoS genetics.

[8]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[9]  Chandler Zuo,et al.  atSNP: transcription factor binding affinity testing for regulatory SNP detection , 2015, Bioinform..

[10]  Jing Wu,et al.  Hidden Markov model and its applications in motif findings. , 2010, Methods in molecular biology.

[11]  Eurie L. Hong,et al.  Annotation of functional variation in personal genomes using RegulomeDB , 2012, Genome research.

[12]  James Bailey,et al.  is-rSNP: a novel technique for in silico regulatory SNP detection , 2010, BMC Bioinformatics.

[13]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[14]  P. Campbell,et al.  OncoCis: annotation of cis-regulatory mutations in cancer , 2014, Genome Biology.

[15]  David J. Arenillas,et al.  In Silico Detection of Sequence Variations Modifying Transcriptional Regulation , 2007, PLoS Comput. Biol..

[16]  A. Riva Large-scale computational identification of regulatory SNPs with rSNP-MAPPER , 2012, BMC Genomics.

[17]  Simon G. Coetzee,et al.  motifbreakR: an R/Bioconductor package for predicting variant effects at transcription factor binding sites , 2015, Bioinform..

[18]  M. Patti,et al.  Increased SRF transcriptional activity in human and mouse skeletal muscle is a signature of insulin resistance. , 2011, The Journal of clinical investigation.

[19]  Manolis Kellis,et al.  HaploReg v4: systematic mining of putative causal variants, cell types, regulators and target genes for human complex traits and disease , 2015, Nucleic Acids Res..

[20]  Wen J. Li,et al.  Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation , 2015, Nucleic Acids Res..

[21]  M. Daly,et al.  Genetic and Epigenetic Fine-Mapping of Causal Autoimmune Disease Variants , 2014, Nature.

[22]  M. Gerstein,et al.  AlleleSeq: analysis of allele-specific expression and binding in a network framework , 2011, Molecular systems biology.