The Genome of the Netherlands: design, and project goals

Within the Netherlands a national network of biobanks has been established (Biobanking and Biomolecular Research Infrastructure-Netherlands (BBMRI-NL)) as a national node of the European BBMRI. One of the aims of BBMRI-NL is to enrich biobanks with different types of molecular and phenotype data. Here, we describe the Genome of the Netherlands (GoNL), one of the projects within BBMRI-NL. GoNL is a whole-genome-sequencing project in a representative sample consisting of 250 trio-families from all provinces in the Netherlands, which aims to characterize DNA sequence variation in the Dutch population. The parent–offspring trios include adult individuals ranging in age from 19 to 87 years (mean=53 years; SD=16 years) from birth cohorts 1910–1994. Sequencing was done on blood-derived DNA from uncultured cells and accomplished coverage was 14–15x. The family-based design represents a unique resource to assess the frequency of regional variants, accurately reconstruct haplotypes by family-based phasing, characterize short indels and complex structural variants, and establish the rate of de novo mutational events. GoNL will also serve as a reference panel for imputation in the available genome-wide association studies in Dutch and other cohorts to refine association signals and uncover population-specific variants. GoNL will create a catalog of human genetic variation in this sample that is uniquely characterized with respect to micro-geographic location and a wide range of phenotypes. The resource will be made available to the research and medical community to guide the interpretation of sequencing projects. The present paper summarizes the global characteristics of the project.

[1]  H. Ostrer,et al.  Founder mutations among the Dutch , 2004, European Journal of Human Genetics.

[2]  Marian Beekman,et al.  Evidence of genetic enrichment for exceptional survival using a family approach: the Leiden Longevity Study , 2006, European Journal of Human Genetics.

[3]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[4]  D. Postma,et al.  Universal risk factors for multifactorial diseases , 2007, European Journal of Epidemiology.

[5]  K. Shianna,et al.  Long-range LD can confound genome scans in admixed populations. , 2008, American journal of human genetics.

[6]  Monique M. B. Breteler,et al.  The Rotterdam Study: 2016 objectives and design update , 2015, European Journal of Epidemiology.

[7]  Kai Ye,et al.  Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads , 2009, Bioinform..

[8]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[9]  T. Lehner,et al.  The Netherlands Twin Register Biobank: A Resource for Genetic Epidemiological Studies , 2010, Twin Research and Human Genetics.

[10]  W. G. Hill,et al.  Genome partitioning of genetic variation for complex traits using common SNPs , 2011, Nature Genetics.

[11]  Gonçalo R. Abecasis,et al.  Fine Mapping of Five Loci Associated with Low-Density Lipoprotein Cholesterol Detects Variants That Double the Explained Heritability , 2011, PLoS genetics.

[12]  Sarah Edkins,et al.  Dense genotyping identifies and localizes multiple common and rare variant association signals in celiac disease , 2011, Nature Genetics.

[13]  Si Quang Le,et al.  SNP detection and genotyping from low-coverage sequencing data on multiple diploid samples. , 2011, Genome research.

[14]  M. Arfan Ikram,et al.  The Rotterdam Study: 2012 objectives and design update , 2011, European journal of epidemiology.

[15]  Hon-Cheong So,et al.  Uncovering the total heritability explained by all true susceptibility variants in a genome‐wide association study , 2011, Genetic epidemiology.

[16]  Joshua M. Korn,et al.  Deep resequencing of GWAS loci identifies independent rare variants associated with inflammatory bowel disease , 2011, Nature Genetics.

[17]  P. Sham,et al.  Evaluating the heritability explained by known susceptibility variants: a survey of ten complex diseases , 2011, Genetic epidemiology.

[18]  S. Steinberg,et al.  Rate of de novo mutations and the importance of father’s age to disease risk , 2012, Nature.

[19]  Jacob A. Tennessen,et al.  Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes , 2012, Science.

[20]  L. Kiemeney,et al.  How to kickstart a national biobanking infrastructure – experiences and prospects of BBMRI-NL , 2012 .

[21]  Paul Scheet,et al.  A Genome-Wide Association Scan on the Levels of Markers of Inflammation in Sardinians Reveals Associations That Underpin Its Complex Regulation , 2012, PLoS genetics.

[22]  B. Penninx,et al.  Estimating the Genetic Variance of Major Depressive Disorder Due to All Single Nucleotide Polymorphisms , 2012, Biological Psychiatry.

[23]  Claudio J. Verzilli,et al.  An Abundance of Rare Functional Variants in 202 Drug Target Genes Sequenced in 14,002 People , 2012, Science.

[24]  P. Visscher,et al.  Five years of GWAS discovery. , 2012, American journal of human genetics.

[25]  Tanya M. Teslovich,et al.  Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes , 2012, Nature Genetics.

[26]  P. Scheet,et al.  The Young Netherlands Twin Register (YNTR): Longitudinal Twin and Family Studies in Over 70,000 Children , 2012, Twin Research and Human Genetics.

[27]  H. M. Draisma,et al.  The Adult Netherlands Twin Register: Twenty-Five Years of Survey and Biological Data Collection , 2013, Twin Research and Human Genetics.

[28]  P. Sullivan,et al.  Population structure, migration, and diversifying selection in the Netherlands , 2013, European Journal of Human Genetics.