Genome-wide prediction of splice-modifying SNPs in human genes using a new analysis pipeline called AASsites

BackgroundSome single nucleotide polymorphisms (SNPs) are known to modify the risk of developing certain diseases or the reaction to drugs. Due to next generation sequencing methods the number of known human SNPs has grown. Not all SNPs lead to a modified protein, which may be the origin of a disease. Therefore, the recognition of functional SNPs is needed. Because most SNP annotation tools look for SNPs which lead to an amino acid exchange or a premature stop, we designed a new tool called AASsites which searches for SNPs which modify splicing.ResultsAASsites uses several gene prediction programs and open reading frame prediction to compare the wild type (wt) and the variant gene sequence. The results of the comparison are combined by a handmade rule system to classify a change in splicing as “likely, probable, unlikely”. Having received good results from tests with SNPs known for changing the splicing pattern we checked 80,000 SNPs from the human genome which are located near splice sites for their ability to change the splicing pattern of the gene and hereby result in a different protein. We identified 301 “likely” and 985 “probable” classified SNPs with such characteristics. Within this set 33 SNPs are described in the ssSNP Target database to cause modified splicing.ConclusionsWith AASsites single SNPs can be checked for those causing splice modifications. Screening 80,000 known human SNPs we detected about 1,200 SNPs which probably modify splicing. AASsites is available at http://genius.embnet.dkfz-heidelberg.de/menu/biounit/open-husar using any web browser.

[1]  P. Bugert,et al.  MUTATION OF THE VHL GENE IS ASSOCIATED EXCLUSIVELY WITH THE DEVELOPMENT OF NON‐PAPILLARY RENAL CELL CARCINOMAS , 1996, The Journal of pathology.

[2]  K. Xia,et al.  A novel PRPF31 splice-site mutation in a Chinese family with autosomal dominant retinitis pigmentosa. , 2004, Molecular vision.

[3]  T. Degrauw,et al.  Comprehensive mutation analysis of GLDC, AMT, and GCSH in nonketotic hyperglycinemia , 2006, Human mutation.

[4]  O. Tollersrud,et al.  Spectrum of mutations in alpha-mannosidosis. , 1999, American journal of human genetics.

[5]  Jonathan E. Allen,et al.  JIGSAW, GeneZilla, and GlimmerHMM: puzzling out the features of human genes in the ENCODE regions , 2006, Genome Biology.

[6]  John K Field,et al.  A T2517C polymorphism in the GSTM4 gene is associated with risk of developing lung cancer. , 2002, Lung cancer.

[7]  R. Guigó,et al.  GeneID in Drosophila. , 2000, Genome research.

[8]  Hagit Shatkay,et al.  An integrative scoring system for ranking SNPs by their potential deleterious effects , 2009, Bioinform..

[9]  A. De Siervi,et al.  Acute intermittent porphyria: Characterization of two novel mutations in the porphobilinogen deaminase gene, one amino acid deletion (453‐455delAGC) and one splicing aceptor site mutation (IVS8‐1G>T) , 1999, Human mutation.

[10]  R. Durbin,et al.  GeneWise and Genomewise. , 2004, Genome research.

[11]  C. Béroud,et al.  Human Splicing Finder: an online bioinformatics tool to predict splicing signals , 2009, Nucleic acids research.

[12]  M. Ingelman-Sundberg,et al.  Pharmacogenetic biomarkers as tools for improved drug therapy; emphasis on the cytochrome P450 system. , 2010, Biochemical and biophysical research communications.

[13]  S. Karlin,et al.  Prediction of complete gene structures in human genomic DNA. , 1997, Journal of molecular biology.

[14]  Thilo Dörk,et al.  Nonclassical splicing mutations in the coding and noncoding regions of the ATM Gene: Maximum entropy estimates of splice junction strengths , 2004, Human mutation.

[15]  B. Morgenstern,et al.  AUGUSTUS at EGASP: using EST, protein and genomic alignments for improved gene prediction in the human genome , 2006, Genome Biology.

[16]  J. Mallet,et al.  Deletion of 11 Amino Acids in Tuberin Associated with Severe Tuberous Sclerosis Phenotypes: Evidence for a New Essential Domain in the First Third of the Protein , 1997, European journal of human genetics : EJHG.

[17]  Hagit Shatkay,et al.  F-SNP: computationally predicted functional SNPs for disease association studies , 2007, Nucleic Acids Res..

[18]  Alexander G. Churbanov,et al.  A method of predicting changes in human gene splicing induced by genetic variants in context of cis-acting elements , 2010, BMC Bioinformatics.

[19]  Jinhua Wang,et al.  ESEfinder: a web resource to identify exonic splicing enhancers , 2003, Nucleic Acids Res..

[20]  David Haussler,et al.  LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources , 2005, Bioinform..

[21]  W. Pearson Rapid and sensitive sequence comparison with FASTP and FASTA. , 1990, Methods in enzymology.

[22]  J. Mullikin,et al.  Genomic features defining exonic variants that modulate splicing , 2010, Genome Biology.

[23]  K. Weinberg,et al.  Novel splicing, missense, and deletion mutations in seven adenosine deaminase-deficient patients with late/delayed onset of combined immunodeficiency disease. Contribution of genotype to phenotype. , 1993, The Journal of clinical investigation.

[24]  Sándor Suhai,et al.  Automatic detection of exonic splicing enhancers (ESEs) using SVMs , 2008, BMC Bioinformatics.

[25]  A. Chakravarti Single nucleotide polymorphisms: . . .to a future of genetic medicine , 2001, Nature.

[26]  Z. Hall Cancer , 1906, The Hospital.

[27]  A. Krogh Two methods for improving performance of an HMM application for gene finding , 1997 .

[28]  Rachel Karchin,et al.  Next generation tools for the annotation of human SNPs , 2009, Briefings Bioinform..

[29]  Jong Bhak,et al.  ssSNPTarget: genome‐wide splice‐site single nucleotide polymorphism database , 2009, Human mutation.

[30]  Gert Matthijs,et al.  Impaired glycosylation and cutis laxa caused by mutations in the vesicular H+-ATPase subunit ATP6V0A2 , 2008, Nature Genetics.

[31]  C. Maier,et al.  Mutation screen and association study of EZH2 as a susceptibility gene for aggressive prostate cancer , 2005, The Prostate.