Insights into clonal haematopoiesis from 8,342 mosaic chromosomal alterations

The selective pressures that shape clonal evolution in healthy individuals are largely unknown. Here we investigate 8,342 mosaic chromosomal alterations, from 50 kb to 249 Mb long, that we uncovered in blood-derived DNA from 151,202 UK Biobank participants using phase-based computational techniques (estimated false discovery rate, 6–9%). We found six loci at which inherited variants associated strongly with the acquisition of deletions or loss of heterozygosity in cis. At three such loci (MPL, TM2D3–TARSL2, and FRA10B), we identified a likely causal variant that acted with high penetrance (5–50%). Inherited alleles at one locus appeared to affect the probability of somatic mutation, and at three other loci to be objects of positive or negative clonal selection. Several specific mosaic chromosomal alterations were strongly associated with future haematological malignancies. Our results reveal a multitude of paths towards clonal expansions with a wide range of effects on human health.

[1]  Neva C. Durand,et al.  A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping , 2014, Cell.

[2]  Jianxin Shi,et al.  Developing and evaluating polygenic risk prediction models for stratified disease prevention , 2016, Nature Reviews Genetics.

[3]  Alan M. Kwong,et al.  Next-generation genotype imputation service and methods , 2016, Nature Genetics.

[4]  Pedro G. Ferreira,et al.  Transcriptome and genome sequencing uncovers functional variation in humans , 2013, Nature.

[5]  Eran Halperin,et al.  Common variation at 6p21.31 (BAK1) influences the risk of chronic lymphocytic leukemia. , 2012, Blood.

[6]  K. Gunderson,et al.  High-resolution genomic profiling of chromosomal aberrations using Infinium whole-genome genotyping. , 2006, Genome research.

[7]  Guy Pratt,et al.  A genome-wide association study identifies six susceptibility loci for chronic lymphocytic leukemia , 2008, Nature Genetics.

[8]  Paul Scheet,et al.  Haplotype-based profiling of subtle allelic imbalance with SNP arrays , 2013, Genome research.

[9]  Ingo Ruczinski,et al.  Detectable clonal mosaicism from birth to old age and its relationship to cancer , 2012, Nature Genetics.

[10]  Alexander Gusev,et al.  Whole population, genome-wide mapping of hidden relatedness. , 2009, Genome research.

[11]  S. Elledge,et al.  Cumulative Haploinsufficiency and Triplosensitivity Drive Aneuploidy Patterns and Shape the Cancer Genome , 2013, Cell.

[12]  Bonnie Berger,et al.  Efficient Bayesian mixed model analysis increases association power in large cohorts , 2014 .

[13]  A. Reymond,et al.  KCTD13 is a major driver of mirrored neuroanatomical phenotypes of the 16p11.2 copy number variant , 2012, Nature.

[14]  Paolo Vineis,et al.  Meta-analysis of genome-wide association studies discovers multiple loci for chronic lymphocytic leukemia , 2016, Nature Communications.

[15]  P. Visscher,et al.  Estimating missing heritability for disease from genome-wide association studies. , 2011, American journal of human genetics.

[16]  E. Zeggini,et al.  Leukemia-Associated Somatic Mutations Drive Distinct Patterns of Age-Related Clonal Hemopoiesis , 2015, Cell reports.

[17]  A. Tefferi,et al.  Novel mutations and their functional and clinical relevance in myeloproliferative neoplasms: JAK2, MPL, TET2, ASXL1, CBL, IDH and IKZF1 , 2010, Leukemia.

[18]  Martin A. Nowak,et al.  Mutations driving CLL and their evolution in progression and relapse , 2015, Nature.

[19]  H. Ostrer,et al.  Familial colorectal cancer in Ashkenazim due to a hypermutable tract in APC , 1997, Nature Genetics.

[20]  Yunfeng Pan,et al.  A PP4 phosphatase complex dephosphorylates RPA2 to facilitate DNA repair via homologous recombination , 2010, Nature Structural &Molecular Biology.

[21]  Beryl B. Cummings,et al.  Landscape of X chromosome inactivation across human tissues , 2016, Nature.

[22]  R I Richards,et al.  FRA10B structure reveals common elements in repeat expansion and chromosomal fragile site genesis. , 1998, Molecular cell.

[23]  Paul Scheet,et al.  Extensive Hidden Genomic Mosaicism Revealed in Normal Tissue. , 2016, American journal of human genetics.

[24]  A Sigurdsson,et al.  The germline sequence variant rs2736100_C in TERT associates with myeloproliferative neoplasms , 2014, Leukemia.

[25]  K. Shianna,et al.  Long-range LD can confound genome scans in admixed populations. , 2008, American journal of human genetics.

[26]  N. Wray,et al.  Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance components analysis , 2015, Nature Genetics.

[27]  Mitchell J. Machiela,et al.  LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants , 2015, Bioinform..

[28]  Adele Seniori Costantini,et al.  InterLymph hierarchical classification of lymphoid neoplasms for epidemiologic research based on the WHO classification (2008): update and future directions. , 2010, Blood.

[29]  Joshua M. Korn,et al.  Association between microdeletion and microduplication at 16p11.2 and autism. , 2008, The New England journal of medicine.

[30]  Thomas Lengauer,et al.  CpG Island Mapping by Epigenome Prediction , 2007, PLoS Comput. Biol..

[31]  Kari Stefansson,et al.  Clonal hematopoiesis, with and without candidate driver mutations, is common in the elderly. , 2017, Blood.

[32]  T. Druley,et al.  Clonal haematopoiesis harbouring AML-associated mutations is ubiquitous in healthy adults , 2016, Nature Communications.

[33]  Murim Choi,et al.  Mitotic Recombination in Patients with Ichthyosis Causes Reversion of Dominant Mutations in KRT10 , 2010, Science.

[34]  G. von Heijne,et al.  Tissue-based map of the human proteome , 2015, Science.

[35]  Aneela Majid,et al.  A genome-wide association study identifies multiple susceptibility loci for chronic lymphocytic leukemia , 2013, Nature Genetics.

[36]  Robert I. Richards,et al.  Dynamic mutations: A new class of mutations causing human disease , 1992, Cell.

[37]  Paolo Vineis,et al.  Genome-wide Association Study Identifies Multiple Risk Loci for Chronic Lymphocytic Leukemia , 2013, Nature Genetics.

[38]  Christian Gilissen,et al.  Ultra-sensitive Sequencing Identifies High Prevalence of Clonal Hematopoiesis-Associated Mutations throughout Adult Life. , 2017, American journal of human genetics.

[39]  Paz Polak,et al.  Genetic Variation in Human DNA Replication Timing , 2014, Cell.

[40]  A. McKenna,et al.  Evolution and Impact of Subclonal Mutations in Chronic Lymphocytic Leukemia , 2012, Cell.

[41]  Hui Wang,et al.  RYBP stabilizes p53 by modulating MDM2 , 2009, EMBO reports.

[42]  P. O’Reilly,et al.  Identification of seven loci affecting mean telomere length and their association with disease , 2013, Nature Genetics.

[43]  Andrew Collins,et al.  JAK2 haplotype is a major risk factor for the development of myeloproliferative neoplasms , 2009, Nature Genetics.

[44]  Paola Guglielmelli,et al.  Genetic variation at MECOM, TERT, JAK2 and HBS1L-MYB predisposes to myeloproliferative neoplasms , 2015, Nature Communications.

[45]  Eric Banks,et al.  Tools and best practices for data processing in allelic expression analysis , 2015, Genome Biology.

[46]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[47]  Kari Stefansson,et al.  Genetic variants associated with mosaic Y chromosome loss highlight cell cycle genes and overlap with cancer susceptibility , 2017, Nature Genetics.

[48]  Giulio Genovese,et al.  Improved IBD detection using incomplete haplotype information , 2010, BMC Genetics.

[49]  William Wheeler,et al.  Detectable clonal mosaicism and its relationship to aging and cancer , 2012, Nature Genetics.

[50]  Juan R. González,et al.  R-Gada: a fast and flexible pipeline for copy number analysis in association studies , 2010, BMC Bioinformatics.

[51]  S. Gabriel,et al.  Clonal Hematopoiesis and Risk of Atherosclerotic Cardiovascular Disease , 2017, The New England journal of medicine.

[52]  Derek Y. Chiang,et al.  The landscape of somatic copy-number alteration across human cancers , 2010, Nature.

[53]  P. Stankiewicz,et al.  Recurrent reciprocal 16p11.2 rearrangements associated with global developmental delay, behavioural problems, dysmorphism, epilepsy, and abnormal head size , 2009, Journal of Medical Genetics.

[54]  A. Valencia,et al.  Non-coding recurrent mutations in chronic lymphocytic leukaemia , 2015, Nature.

[55]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[56]  Jian Gu,et al.  Mosaic loss of chromosome Y is associated with common variation near TCL1A , 2016, Nature Genetics.

[57]  Sharon J. Diskin,et al.  Adjustment of genomic waves in signal intensities from whole-genome SNP genotyping platforms , 2008, Nucleic acids research.

[58]  C. Lord,et al.  The Simons Simplex Collection: A Resource for Identification of Autism Genetic Risk Factors , 2010, Neuron.

[59]  Neil E Caporaso,et al.  B-cell clones as early markers for chronic lymphocytic leukemia. , 2009, The New England journal of medicine.

[60]  G. Sutherland,et al.  Heritable fragile sites on human chromosomes. IX. Population cytogenetics and segregation analysis of the BrdU-requiring fragile site at 10q25. , 1982, American journal of human genetics.

[61]  William Wheeler,et al.  Characterization of large structural genetic mosaicism in human autosomes. , 2015, American journal of human genetics.

[62]  M. McCarthy,et al.  Age-related clonal hematopoiesis associated with adverse outcomes. , 2014, The New England journal of medicine.

[63]  Lars Lind,et al.  Smoking is associated with mosaic loss of chromosome Y , 2015, Science.

[64]  S. Gabriel,et al.  Clonal hematopoiesis and blood-cancer risk inferred from blood DNA sequence. , 2014, The New England journal of medicine.

[65]  William Wheeler,et al.  Female chromosome X mosaicism is age-related and preferentially affects the inactivated X chromosome , 2016, Nature Communications.

[66]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[67]  Kari Stefansson,et al.  A germline variant in the TP53 polyadenylation signal confers cancer susceptibility , 2011, Nature Genetics.

[68]  Tom R. Gaunt,et al.  Improved imputation of low-frequency and rare variants using the UK10K haplotype reference panel , 2015, Nature Communications.

[69]  S. Richards,et al.  Monoclonal B-cell lymphocytosis and chronic lymphocytic leukemia. , 2008, The New England journal of medicine.

[70]  Ashot Harutyunyan,et al.  A common JAK2 haplotype confers susceptibility to myeloproliferative neoplasms , 2009, Nature Genetics.

[71]  A. Gurney,et al.  Thrombocytopenia in c-mpl-deficient mice. , 1994, Science.

[72]  Nicholas Eriksson,et al.  Germ line variants predispose to both JAK2 V617F clonal hematopoiesis and myeloproliferative neoplasms. , 2016, Blood.

[73]  M. Wigler,et al.  Circular binary segmentation for the analysis of array-based DNA copy number data. , 2004, Biostatistics.

[74]  H E White,et al.  Profound parental bias associated with chromosome 14 acquired uniparental disomy indicates targeting of an imprinted locus , 2015, Leukemia.

[75]  Ryan L. Collins,et al.  An analytical framework for whole-genome sequence association studies and its implications for autism spectrum disorder , 2018, Nature Genetics.

[76]  Michael A McDevitt,et al.  Copy neutral loss of heterozygosity: a novel chromosomal lesion in myeloid malignancies. , 2010, Blood.

[77]  Po-Ru Loh,et al.  Fast and accurate long-range phasing in a UK Biobank cohort , 2015, Nature Genetics.

[78]  E. J. Sinclair,et al.  Trisomy 15 associated with loss of the Y chromosome in bone marrow: a possible new aging effect. , 1998, Cancer genetics and cytogenetics.

[79]  Celine M Vachon,et al.  Genome-wide association study identifies a novel susceptibility locus at 6p21.3 among familial CLL. , 2011, Blood.

[80]  Kenneth Offit,et al.  A germline JAK2 SNP is associated with predisposition to the development of JAK2V617F-positive myeloproliferative neoplasms , 2009, Nature Genetics.

[81]  Shane A. McCarthy,et al.  Reference-based phasing using the Haplotype Reference Consortium panel , 2016, Nature Genetics.

[82]  Mario Cazzola,et al.  The 2016 revision to the World Health Organization classification of myeloid neoplasms and acute leukemia. , 2016, Blood.

[83]  A. Børresen-Dale,et al.  The Life History of 21 Breast Cancers , 2012, Cell.

[84]  Gabor T. Marth,et al.  An integrated map of structural variation in 2,504 human genomes , 2015, Nature.

[85]  G. Sutherland,et al.  Heritable fragile sites on human chromosomes. V. A new class of fragile site requiring BrdU for expression. , 1980, American journal of human genetics.

[86]  R Fonseca,et al.  Monoclonal B-cell lymphocytosis is characterized by mutations in CLL putative driver genes and clonal heterogeneity many years before disease progression , 2014, Leukemia.

[87]  Nathaniel Rothman,et al.  Mosaic chromosome 20q deletions are more frequent in the aging population. , 2017, Blood advances.

[88]  Jan P. Dumanski,et al.  Mosaicism in health and disease — clones picking up speed , 2016, Nature Reviews Genetics.