SNPselector: a web tool for selecting SNPs for genetic association studies

SUMMARY Single nucleotide polymorphisms (SNPs) are commonly used for association studies to find genes responsible for complex genetic diseases. With the recent advance of SNP technology, researchers are able to assay thousands of SNPs in a single experiment. But the process of manually choosing thousands of genotyping SNPs for tens or hundreds of genes is time consuming. We have developed a web-based program, SNPselector, to automate the process. SNPselector takes a list of gene names or a list of genomic regions as input and searches the Ensembl genes or genomic regions for available SNPs. It prioritizes these SNPs on their tagging for linkage disequilibrium, SNP allele frequencies and source, function, regulatory potential and repeat status. SNPselector outputs result in compressed Excel spreadsheet files for review by the user. AVAILABILITY SNPselector is freely available at http://primer.duhs.duke.edu/

[1]  M. Hammer,et al.  Hierarchical patterns of global human Y-chromosome diversity. , 2001, Molecular biology and evolution.

[2]  David Haussler,et al.  LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources , 2005, Bioinform..

[3]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[4]  David B. Goldstein,et al.  Pharmacogenetics goes genomic , 2003, Nature Reviews Genetics.

[5]  Simon C. Potter,et al.  An overview of Ensembl. , 2004, Genome research.

[6]  Alberto Riva,et al.  SNPper: retrieval and analysis of human SNPs , 2002, Bioinform..

[7]  J. Haines,et al.  Association of single-nucleotide polymorphisms of the tau gene with late-onset Parkinson disease. , 2001, JAMA.

[8]  C. Carlson,et al.  Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. , 2004, American journal of human genetics.

[9]  G. Stormo,et al.  PromoLign: A database for upstream region analysis and SNPs , 2004, Human mutation.

[10]  D. Nickerson,et al.  Variation is the spice of life , 2001, Nature Genetics.

[11]  Li Jin,et al.  Y chromosome sequence variation and the history of human populations , 2000, Nature Genetics.

[12]  Geoffrey B. Nilsen,et al.  Whole-Genome Patterns of Common DNA Variation in Three Human Populations , 2005, Science.

[13]  Joaquín Dopazo,et al.  PupaSNP Finder: a web tool for finding SNPs with putative effect at transcriptional level , 2004, Nucleic Acids Res..

[14]  M. West,et al.  Gene Expression Phenotypes of Atherosclerosis , 2004, Arteriosclerosis, thrombosis, and vascular biology.

[15]  Simon Whelan,et al.  Statistical Methods in Molecular Evolution , 2005 .

[16]  D. Haussler,et al.  Aligning multiple genomic sequences with the threaded blockset aligner. , 2004, Genome research.

[17]  Martin Frank,et al.  Carbohydrate Structure Suite (CSS): analysis of carbohydrate 3D structures derived from the PDB , 2004, Nucleic Acids Res..

[18]  H. Garchon,et al.  Association of a single nucleotide polymorphism in the TIGR/MYOCILIN gene promoter with the severity of primary open‐angle glaucoma , 2001, Clinical genetics.

[19]  David Haussler,et al.  Phylogenetic Hidden Markov Models , 2005 .