Human knockouts in a cohort with a high rate of consanguinity

A major goal of biomedicine is to understand the function of every gene in the human genome.1 Null mutations can disrupt both copies of a given gene in humans and phenotypic analysis of such ‘human knockouts’ can provide insight into gene function. To date, comprehensive analysis of genes knocked out in humans has been limited by the fact that null mutations are infrequent in the general population and so, observing an individual homozygous null for a given gene is exceedingly rare.2,3 However, consanguineous unions are more likely to result in offspring who carry homozygous null mutations. In Pakistan, consanguinity rates are notably high.4 Here, we sequenced the protein-coding regions of 7,078 adult participants living in Pakistan and performed phenotypic analysis to identify homozygous null individuals and to understand consequences of complete gene disruption in humans. We enumerated 36,850 rare (<1 % minor allele frequency) null mutations. These homozygous null mutations led to complete inactivation of 961 genes in at least one participant. Homozygosity for null mutations at APOC3 was associated with absent plasma apolipoprotein C-III levels; at PLAG27, with absent enzymatic activity of soluble lipoprotein-associated phospholipase A2; at CYP2F1, with higher plasma interleukin-8 concentrations; and at either A3GALT2 or NRG4, with markedly reduced plasma insulin C-peptide concentrations. After physiologic challenge with oral fat, APOC3 knockouts displayed marked blunting of the usual post-prandial rise in plasma triglycerides compared to wild-type family members. These observations provide a roadmap to understand the consequences of complete disruption of a large fraction of genes in the human genome.

[1]  H. Stefánsson,et al.  Identification of a large set of rare complete human knockouts , 2015, Nature Genetics.

[2]  David M. Herrington,et al.  Multiple rare alleles at LDLR and APOA5 confer risk for early-onset myocardial infarction , 2014, Nature.

[3]  Christopher S. Poultney,et al.  Synaptic, transcriptional, and chromatin genes disrupted in autism , 2014, Nature.

[4]  R. Frikke-Schmidt,et al.  Loss-of-Function Mutations in APOC3 and Risk of Ischemic Vascular Disease , 2014 .

[5]  Zhimin Chen,et al.  The brown fat-enriched secreted factor Nrg 4 preserves metabolic homeostasis through attenuating hepatic lipogenesis , 2016 .

[6]  Stephan J Sanders,et al.  A framework for the interpretation of de novo mutation in human disease , 2014, Nature Genetics.

[7]  B. Nordestgaard,et al.  Loss-of-function mutations in APOC3 and risk of ischemic vascular disease. , 2014, The New England journal of medicine.

[8]  He Zhang,et al.  Loss-of-function mutations in APOC3, triglycerides, and coronary disease. , 2014, The New England journal of medicine.

[9]  Eric S. Lander,et al.  A polygenic burden of rare disruptive mutations in schizophrenia , 2014, Nature.

[10]  Mauricio O. Carneiro,et al.  From FastQ Data to High‐Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline , 2013, Current protocols in bioinformatics.

[11]  R. Hegele,et al.  Apolipoprotein C-III: going back to the future for a lipid drug target. , 2013, Circulation research.

[12]  G. Abecasis,et al.  Detecting and estimating contamination of human DNA samples in sequencing and array-based genotype data. , 2012, American journal of human genetics.

[13]  Jacob A. Tennessen,et al.  Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes , 2012, Science.

[14]  Joseph K. Pickrell,et al.  A Systematic Survey of Loss-of-Function Variants in Human Protein-Coding Genes , 2012, Science.

[15]  D. Panagiotakos,et al.  Validity of abbreviated oral fat tolerance tests for assessing postprandial lipemia. , 2011, Clinical nutrition.

[16]  Tien Yin Wong,et al.  Genome-wide association study in individuals of South Asian ancestry identifies six new type 2 diabetes susceptibility loci , 2011, Nature Genetics.

[17]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[18]  Dennis C. Friedrich,et al.  A scalable, fully automated process for construction of sequence-ready human exome targeted capture libraries , 2011, Genome Biology.

[19]  Josyf Mychaleckyj,et al.  Robust relationship inference in genome-wide association studies , 2010, Bioinform..

[20]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[21]  A. Clark,et al.  Population genetic structure of the people of Qatar. , 2010, American journal of human genetics.

[22]  R. Virmani,et al.  Concept of vulnerable/unstable plaque. , 2010, Arteriosclerosis, thrombosis, and vascular biology.

[23]  Daniel Rios,et al.  Bioinformatics Applications Note Databases and Ontologies Deriving the Consequences of Genomic Variants with the Ensembl Api and Snp Effect Predictor , 2022 .

[24]  R. Bottino,et al.  Insulin secretion and glucose metabolism in alpha 1,3‐galactosyltransferase knock‐out pigs compared to wild‐type pigs , 2010, Xenotransplantation.

[25]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[26]  Usman Ahmad,et al.  The Pakistan Risk of Myocardial Infarction Study: a resource for the study of genetic, lifestyle and other determinants of myocardial infarction in South Asia , 2009, European Journal of Epidemiology.

[27]  E. Wolf,et al.  The epidermal growth factor receptor ligands at a glance , 2009, Journal of cellular physiology.

[28]  J. O’Connell,et al.  A Null Mutation in Human APOC3 Confers a Favorable Plasma Lipid Profile and Apparent Cardioprotection , 2008, Science.

[29]  D. Pellicci,et al.  Humans Lack iGb3 Due to the Absence of Functional iGb3-Synthase: Implications for NKT Cell Development and Transplantation , 2008, PLoS biology.

[30]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[31]  A. K. Hansen,et al.  Glucose intolerance in a xenotransplantation model: studies in alpha‐gal knockout mice , 2006, APMIS : acta pathologica, microbiologica, et immunologica Scandinavica.

[32]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[33]  David W. Russell,et al.  Purification of nucleic acids by extraction with phenol:chloroform. , 2006, CSH protocols.

[34]  B. Carr,et al.  Characterization of the Human Lung CYP2F1 Gene and Identification of a Novel Lung-specific Binding Motif* , 2003, The Journal of Biological Chemistry.

[35]  Bernadette Modell,et al.  Genetic counselling and customary consanguineous marriage , 2002, Nature Reviews Genetics.

[36]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[37]  D. Eisenberg,et al.  Protein function in the post-genomic era , 2000, Nature.

[38]  A H Bittles,et al.  Reproductive behavior and health in consanguineous marriages , 1991, Science.

[39]  T. Standiford,et al.  Interleukin-8 gene expression by a pulmonary epithelial cell line. A model for cytokine networks in the lung. , 1990, The Journal of clinical investigation.

[40]  Mosteller Rd Simplified Calculation of Body-Surface Area , 1987 .

[41]  E S Lander,et al.  Homozygosity mapping: a way to map human recessive traits with the DNA of inbred children. , 1987, Science.

[42]  R. Mosteller Simplified calculation of body-surface area. , 1987, The New England journal of medicine.

[43]  Sewall Wright,et al.  Coefficients of Inbreeding and Relationship , 1922, The American Naturalist.