Extensive sequencing of seven human genomes to characterize benchmark reference materials

The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCode™ WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly.

Gintaras Deikus | Ali Bashir | Alex Hastie | Keyan Zhao | Yutao Fu | Srinka Ghosh | Alexa B. R. McIntyre | George M. Church | Noah Spies | Chunlin Xiao | Noah Alexander | Marc Salit | Arend Sidow | Natali Gulbahce | Madeleine Ball | Feng Chen | Stephen T. Sherry | Jason Bobe | Grace X. Y. Zheng | Kristina Giorda | Jennifer McDaniel | Justin M. Zook | Mark J. P. Chaisson | Patrick Marks | David Catoe | Michael Saghbini | Mark Chaisson | Jonathan Trow | Robert Sebra | Madeleine P. Ball | Sofia Kyriazopoulou-Panagiotopoulou | Ziming Weng | Jason R. Bobe | Michael Schnall-Levin | Han Cao | Khoa Pham | Yuling Liu | Z. Weng | A. Sidow | A. Zaranek | G. Church | Yutao Fu | S. Sherry | T. Liang | E. Schadt | C. Mason | J. Zook | M. Salit | R. Sebra | F. Hyland | A. Bashir | Michael Schnall-Levin | Srinka Ghosh | Preston W. Estep | C. Xiao | F. Chen | Keyan Zhao | Kristina M. Giorda | A. Hastie | N. Gulbahce | S. Kyriazopoulou-Panagiotopoulou | H. Cao | Noah Spies | Noah Alexander | Elizabeth Hénaff | D. Chandramohan | A. Moshrefi | K. Pham | William Stedman | M. Saghbini | Ž. Džakula | G. Deikus | R. Truty | Christopher C. Chang | Jonathan Trow | H. Ordonez | Ying Sheng | Fiona Hyland | Eric Schadt | Lindsay Vang | Yuling Liu | Chris Mason | Elizabeth Henaff | Erich Jaeger | Ali Moshrefi | William Stedman | Tiffany Liang | Zeljko Dzakula | Rebecca M. Truty | Alexander W. Zaranek | Preston Estep | Grace X.Y. Zheng | Heather S. Ordonez | Patrice A. Mudivarti | Ying Sheng | Karoline Bjarnesdatter Rypdal | Erich Jaeger | Jennifer McDaniel | K. Rypdal | Patrick J. Marks | David Catoe | Lindsay K. Vang

[1]  J. Zook,et al.  Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls , 2013, Nature Biotechnology.

[2]  M. Frith,et al.  Adaptive seeds tame genomic sequence comparison. , 2011, Genome research.

[3]  Jessica C. Ebert,et al.  Computational Techniques for Human Genome Resequencing Using Mated Gapped Reads , 2012, J. Comput. Biol..

[4]  T. E. Gills,et al.  The certification, development and use of standard reference materials , 1991 .

[5]  C Garmendia,et al.  Highly efficient DNA synthesis by the phage phi 29 DNA polymerase , 1989 .

[6]  Jessica C. Ebert,et al.  Accurate whole genome sequencing and haplotyping from10-20 human cells , 2012, Nature.

[7]  A. W. Hartman,et al.  Certification of SRM1960: Nominal 10 μm Diameter Polystyrene Spheres (“Space Beads”) , 1991, Journal of research of the National Institute of Standards and Technology.

[8]  Xun Xu,et al.  Rapid detection of structural variation in a human genome using nanochannel-based genome mapping technology , 2014, GigaScience.

[9]  Wolfgang Losert,et al.  svclassify: a method to establish benchmark structural variant calls , 2015, BMC Genomics.

[10]  Zoe Ann Brown,et al.  Certification of NIST Standard Reference Material 1575a Pine Needles and Results of an International Laboratory Comparison , 2004 .

[11]  Robert B. Hartlage,et al.  This PDF file includes: Materials and Methods , 2009 .

[12]  David C. Schwartz,et al.  An algorithm for assembly of ordered restriction maps from single DNA molecules , 2006, Proceedings of the National Academy of Sciences.