Mining SNPs from EST databases.

There is considerable interest in the discovery and characterization of single nucleotide polymorphisms (SNPs) to enable the analysis of the potential relationships between human genotype and phenotype. Here we present a strategy that permits the rapid discovery of SNPs from publicly available expressed sequence tag (EST) databases. From a set of ESTs derived from 19 different cDNA libraries, we assembled 300,000 distinct sequences and identified 850 mismatches from contiguous EST data sets (candidate SNP sites), without de novo sequencing. Through a polymerase-mediated, single-base, primer extension technique, Genetic Bit Analysis (GBA), we confirmed the presence of a subset of these candidate SNP sites and have estimated the allele frequencies in three human populations with different ethnic origins. Altogether, our approach provides a basis for rapid and efficient regional and genome-wide SNP discovery using data assembled from sequences from different libraries of cDNAs.

[1]  J C Murray,et al.  Pediatrics and , 1998 .

[2]  J. Trent,et al.  Clustering of non-major histocompatibility complex susceptibility candidate loci in human autoimmune diseases. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[3]  E. Boerwinkle,et al.  DNA sequence diversity in a 9.7-kb region of the human lipoprotein lipase gene , 1998, Nature Genetics.

[4]  Amanda J. Wilson,et al.  A search for type 1 diabetes susceptibility genes in families from the United Kingdom , 1998, Nature Genetics.

[5]  C. Nusbaum,et al.  Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. , 1998, Science.

[6]  K. Clément,et al.  A mutation in the human leptin receptor gene causes obesity and pituitary dysfunction , 1998, Nature.

[7]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[8]  P. Green,et al.  Consed: a graphical tool for sequence finishing. , 1998, Genome research.

[9]  P Green,et al.  Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.

[10]  Y. Rogers,et al.  Nested genetic bit analysis (N-GBA) for mutation detection in the p53 tumor suppressor gene. , 1997, Nucleic acids research.

[11]  Francis S. Collins,et al.  Variations on a Theme: Cataloging Human DNA Sequence Variation , 1997, Science.

[12]  Leonid Kruglyak,et al.  The use of a genetic map of biallelic markers in linkage studies , 1997, Nature Genetics.

[13]  D. Nickerson,et al.  PolyPhred: automating the detection and genotyping of single nucleotide substitutions using fluorescence-based resequencing. , 1997, Nucleic acids research.

[14]  R. Griffiths,et al.  Archaic African and Asian lineages in the genetic ancestry of modern humans. , 1997, American journal of human genetics.

[15]  J. Brockmöller,et al.  Cytochrome P450 2D6 variants in a Caucasian population: allele frequencies and phenotypic consequences. , 1997, American journal of human genetics.

[16]  Peter Beighton,et al.  de la Chapelle, A. , 1997 .

[17]  N. Schork Genetically complex cardiovascular traits. Origins, problems, and potential solutions. , 1997, Hypertension.

[18]  S. P. Fodor,et al.  Detection of heterozygous mutations in BRCA1 using high density oligonucleotide arrays and two–colour fluorescence analysis , 1996, Nature Genetics.

[19]  James L. Winkler,et al.  Accessing Genetic Information with High-Density DNA Arrays , 1996, Science.

[20]  E. Mardis,et al.  Generation and analysis of 280,000 human expressed sequence tags. , 1996, Genome research.

[21]  A. Mirzabekov,et al.  DNA analysis and diagnostics on oligonucleotide microchips. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Cécile Fizames,et al.  A comprehensive genetic map of the human genome based on 5,264 microsatellites , 1996, Nature.

[23]  Francis S. Collins,et al.  Positional cloning moves from perditional to traditional , 1995, Nature Genetics.

[24]  Y. Rogers,et al.  Genetic Bit Analysis: a solid phase method for typing single nucleotide polymorphisms. , 1994, Nucleic acids research.

[25]  Steven E. Bayer,et al.  A strong candidate for the breast and ovarian cancer susceptibility gene BRCA1. , 1994, Science.

[26]  D. Nickerson,et al.  Comparative analysis of human DNA variations by fluorescence-based sequencing of PCR products. , 1994, Genomics.

[27]  J. Lavergne,et al.  Human Gene Mutation , 1994 .

[28]  Y. Rogers,et al.  The use of phosphorothioate primers and exonuclease hydrolysis for the preparation of single-stranded PCR products and their detection by solid-phase hybridization. , 1994, PCR methods and applications.

[29]  L. Abraham,et al.  Analysis of MHC genomic structure and gene content between HLA-B and TNF using yeast artificial chromosomes. , 1993, Genomics.

[30]  J. Craig Venter,et al.  Rapid cDNA sequencing (expressed sequence tags) from a directionally cloned human infant brain cDNA library , 1993, Nature Genetics.

[31]  L. Hood,et al.  Sequence length and error analysis of Sequenase and automated Taq cycle sequencing methods. , 1993, BioTechniques.

[32]  Eric Lander,et al.  Linkage disequilibrium mapping in isolated founder populations: diastrophic dysplasia in Finland , 1992, Nature Genetics.

[33]  R Higuchi,et al.  Chelex 100 as a medium for simple extraction of DNA for PCR-based typing from forensic material. , 2013, BioTechniques.

[34]  Wen-Hsiung Li,et al.  Low nucleotide diversity in man. , 1991, Genetics.

[35]  J. Sambrook,et al.  Molecular Cloning: A Laboratory Manual , 2001 .