SNPkit: an efficient approach to systematic evaluation of candidate single nucleotide polymorphisms in public databases.

There is widespread interest in devising genotyping methods for SNPs that are robust, inexpensive, and simple to perform. Although several high-throughput SNP genotyping technologies have been developed, including the oligonucleotide ligation assay, real-time PCR, and mass spectrometry, the issues of simplicity and cost-effectiveness have not been adequately addressed. Here we describe the application of a novel computer software package, SNPkit, which designs SNP genotyping assays based on a classical approach for discriminating alleles, restriction enzyme digestion. SNPkit can be used in genotyping assays for almost any SNPs including those that do not alter "natural" restriction sites. Using this method, 164 SNPs have been evaluated in DNA samples from 48 immortalized cell lines of randomly selected Chinese subjects. Sixty-two (37.8%) of the SNPs appeared to be common (frequencies of the minor alleles are > or = 5%) and were subsequently applied to a larger population-based sample. Overall, by using SNPkit, we have been able to validate and genotype accurately a large fraction of publicly available SNPs without sophisticated instrumentation.

[1]  E. Winn-Deen,et al.  Introduction to PCR/OLA/SCS, a multiplex DNA test, and its application to cystic fibrosis. , 1997, Genetic testing.

[2]  A. Brookes HGBASE--a unified human SNP database. , 2001, Trends in genetics : TIG.

[3]  W. Gish,et al.  Rapid gene mapping in Caenorhabditis elegans using a high density polymorphism map , 2001, Nature Genetics.

[4]  Pui-Yan Kwok,et al.  Single-nucleotide polymorphisms in the public domain: how useful are they? , 2001, Nature Genetics.

[5]  P. Boutin,et al.  Rapid SNP allele frequency determination in genomic DNA pools by pyrosequencing. , 2002, BioTechniques.

[6]  Gabor T. Marth,et al.  A general approach to single-nucleotide polymorphism discovery , 1999, Nature Genetics.

[7]  J. Bader The relative power of SNPs and haplotype as genetic markers for association tests. , 2001, Pharmacogenomics.

[8]  B. Parsons,et al.  Genotypic selection methods for the direct analysis of point mutations. , 1997, Mutation research.

[9]  J. Lundeberg,et al.  A brief history of genetic variation analysis. , 2002, BioTechniques.

[10]  R. Fukuda,et al.  Mismatch PCR RFLP detection of DRD2 Ser311Cys polymorphism and schizophrenia. , 1994, Biochemical and biophysical research communications.

[11]  H. Niiro,et al.  Two polymorphisms within interleukin-3 (hIL3) gene detected by mismatch PCR/RFLP , 1999, Genes and Immunity.

[12]  A. Brookes The essence of SNPs. , 1999, Gene.

[13]  M. Permutt,et al.  Assessing allele frequencies of single nucleotide polymorphisms in DNA pools by pyrosequencing technology. , 2002, BioTechniques.

[14]  G. Gallagher,et al.  Polymorphisms in the TNF gene cluster and MHC serotypes in the West of Scotland , 1997, Immunogenetics.

[15]  S. Lee,et al.  Characterization of publicly available SNPs in the Korean population , 2001, Human mutation.

[16]  J. Stephens,et al.  Haplotype Variation and Linkage Disequilibrium in 313 Human Genes , 2001, Science.

[17]  J. Silver,et al.  Nanoliter scale PCR with TaqMan detection. , 1997, Nucleic acids research.

[18]  Leonid Kruglyak,et al.  The use of a genetic map of biallelic markers in linkage studies , 1997, Nature Genetics.

[19]  T. Niu,et al.  Blood pressure and the T174M and M235T polymorphisms of the angiotensinogen gene. , 1999, Annals of epidemiology.

[20]  Christopher J. Lee,et al.  Genome-wide analysis of single-nucleotide polymorphisms in human expressed sequences , 2000, Nature Genetics.

[21]  D. Cox,et al.  Data mining: Efficiency of using sequence databases for polymorphism discovery , 2001, Human mutation.

[22]  R. Crystal,et al.  Rapid, nonradioactive detection of mutations in the human genome by allele-specific amplification. , 1989, The Journal of laboratory and clinical medicine.