A highly annotated whole-genome sequence of a Korean individual

Recent advances in sequencing technologies have initiated an era of personal genome sequences. To date, human genome sequences have been reported for individuals with ancestry in three distinct geographical regions: a Yoruba African, two individuals of northwest European origin, and a person from China. Here we provide a highly annotated, whole-genome sequence for a Korean individual, known as AK1. The genome of AK1 was determined by an exacting, combined approach that included whole-genome shotgun sequencing (27.8× coverage), targeted bacterial artificial chromosome sequencing, and high-resolution comparative genomic hybridization using custom microarrays featuring more than 24 million probes. Alignment to the NCBI reference, a composite of several ethnic clades, disclosed nearly 3.45 million single nucleotide polymorphisms (SNPs), including 10,162 non-synonymous SNPs, and 170,202 deletion or insertion polymorphisms (indels). SNP and indel densities were strongly correlated genome-wide. Applying very conservative criteria yielded highly reliable copy number variants for clinical considerations. Potential medical phenotypes were annotated for non-synonymous SNPs, coding domain indels, and structural variants. The integration of several human whole-genome sequences derived from several ethnic groups will assist in understanding genetic ancestry, migration patterns and population bottlenecks.

[1]  F. James Rohlf,et al.  Biometry: The Principles and Practice of Statistics in Biological Research , 1969 .

[2]  D. M. Power Biometry. The Principles and Practice of Statistics in Biological Research; Statistical Tables , 1970 .

[3]  D. Mccormick Sequence the Human Genome , 1986, Bio/Technology.

[4]  Z. Zhang,et al.  Construction of a bacterial artificial chromosome library containing large Eco RI and Hin dIII genomic fragments of lettuce , 1997, Theoretical and Applied Genetics.

[5]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[6]  Alexey S Kondrashov,et al.  Patterns in spontaneous mutation revealed by human-baboon sequence comparison. , 2002, Trends in genetics : TIG.

[7]  S. Gaudieri,et al.  In polymorphic genomic regions indels cluster with nucleotide polymorphism: Quantum Genomics. , 2003, Gene.

[8]  Alexey S Kondrashov,et al.  Direct estimates of human per nucleotide mutation rates at 20 loci causing mendelian diseases , 2003, Human mutation.

[9]  David Haussler,et al.  Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution. , 2003, Genome research.

[10]  Dmitri A. Petrov,et al.  DNA loss and evolution of genome size in Drosophila , 2002, Genetica.

[11]  G. Weinstock,et al.  Dynamic building of a BAC clone tiling path for the Rat Genome Sequencing Project. , 2004, Genome research.

[12]  Thomas D. Wu,et al.  GMAP: a genomic mapping and alignment program for mRNA and EST sequence , 2005, Bioinform..

[13]  Yusuke Nakamura,et al.  A SNP in the ABCC11 gene is the determinant of human earwax type , 2006, Nature Genetics.

[14]  Timothy B. Stockwell,et al.  The Diploid Genome Sequence of an Individual Human , 2007, PLoS biology.

[15]  M. Satake,et al.  Evidence for natural selection on leukocyte immunoglobulin-like receptors for HLA class I in Northeast Asians. , 2008, American journal of human genetics.

[16]  J. Lupski,et al.  The complete genome of an individual by massively parallel DNA sequencing , 2008, Nature.

[17]  Alvaro J. González,et al.  Management of High-Throughput DNA Sequencing Projects: Alpheus. , 2008, Journal of computer science and systems biology.

[18]  Ryan W. Kim,et al.  Genomic Convergence Analysis of Schizophrenia: mRNA Sequencing Reveals Altered Synaptic Vesicular Transport in Post-Mortem Cerebellum , 2008, PloS one.

[19]  Neil A. Miller,et al.  Transcriptome sequencing of malignant pleural mesothelioma tumors , 2008, Proceedings of the National Academy of Sciences.

[20]  Nancy F. Hansen,et al.  Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry , 2008, Nature.

[21]  R. Hudson,et al.  Single-nucleotide mutation rate increases close to insertions/deletions in eukaryotes , 2008, Nature.

[22]  Dawei Li,et al.  The diploid genome sequence of an Asian individual , 2008, Nature.

[23]  Peter K Gregersen,et al.  Genetic risk factors for rheumatoid arthritis differ in Caucasian and Korean populations. , 2009, Arthritis and rheumatism.