Gene-based SNP discovery as part of the Japanese Millennium Genome Project: identification of 190 562 genetic variations in the human genome

AbstractTo construct an infrastructure for genome-wide association studies of common diseases or drug sensitivities, we have been systematically exploring common variants by resequencing genomic regions containing genes in DNA from 24 Japanese individuals. We have analyzed a total of 154Mb, corresponding to approximately 5% of the human genome, and so far have identified 174 269 single-nucleotide polymorphisms and 16 293 insertion/deletion polymorphisms within gene regions, i.e., one polymorphism in 807 bp on average. Our data are freely available via our web site (http://snp.ims.u-tokyo.ac.jp) and will facilitate studies to identify genes associated with susceptibility to common diseases and genes involved in sensitivity to therapeutic drugs.

[1]  Yusuke Nakamura,et al.  A high-throughput SNP typing system for genome-wide association studies , 2001, Journal of Human Genetics.

[2]  Donna R. Maglott,et al.  RefSeq and LocusLink: NCBI gene-centered resources , 2001, Nucleic Acids Res..

[3]  N. Murata,et al.  Identification of 142 single nucleotide polymorphisms in 41 candidate genes for rheumatoid arthritis in the Japanese population , 2000, Human Genetics.

[4]  Yusuke Nakamura,et al.  JSNP: a database of common gene variations in the Japanese population , 2002, Nucleic Acids Res..

[5]  P Sham,et al.  A SNP resource for human chromosome 22: extracting dense clusters of SNPs from the genomic sequence. , 2001, Genome research.

[6]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[7]  Eric S. Lander,et al.  An SNP map of the human genome generated by reduced representation shotgun sequencing , 2000, Nature.

[8]  R. Yamada,et al.  Identification of 187 single nucleotide polymorphisms (SNPs) among 41 candidate genes for ischemic heart disease in the Japanese population , 2000, Human Genetics.

[9]  M. Daly,et al.  A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms , 2001, Nature.

[10]  D. Nickerson,et al.  PolyPhred: automating the detection and genotyping of single nucleotide substitutions using fluorescence-based resequencing. , 1997, Nucleic acids research.

[11]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[12]  N. Shen,et al.  Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasis , 1999, Nature Genetics.

[13]  E. Lander,et al.  Characterization of single-nucleotide polymorphisms in coding regions of human genes , 1999 .

[14]  D. Nickerson,et al.  Variation is the spice of life , 2001, Nature Genetics.

[15]  G. Rubin,et al.  A computer program for aligning a cDNA sequence with a genomic DNA sequence. , 1998, Genome research.

[16]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: 2002 update , 2002, Nucleic Acids Res..

[17]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[18]  J. Stephens,et al.  Haplotype Variation and Linkage Disequilibrium in 313 Human Genes , 2001, Science.

[19]  D. Mccormick Sequence the Human Genome , 1986, Bio/Technology.

[20]  S Rozen,et al.  Primer3 on the WWW for general users and for biologist programmers. , 2000, Methods in molecular biology.

[21]  S. Karlin,et al.  Prediction of complete gene structures in human genomic DNA. , 1997, Journal of molecular biology.