Centuries of genome instability and evolution in soft-shell clam transmissible cancer

Transmissible cancers are infectious parasitic clones of malignant cells that metastasize to new hosts, living past the death of the founder animal in which the cancer initiated. Several lineages of transmissible cancer have recently been identified in bivalves, including one that has spread through the soft-shell clam (Mya arenaria) population along the east coast of North America. To investigate the evolutionary history of this transmissible cancer lineage, we assembled a highly contiguous 1.2 Gb soft-shell clam reference genome and characterized somatic mutations from cancer sequences. We show that all cancer cases observed descend from a single founder and cluster into two geographically distinct sub-lineages. We discover a previously unreported clock-like mutational signature that predicts the cancer lineage to be 344 to 877 years old, indicating that it spread undetected long before it was first observed in the 1970s. We observe high mutation density, widespread copy number gain, structural rearrangement, loss of heterozygosity, variable telomere lengths, mitochondrial genome expansion, and transposable element activity, all indicative of an unstable cancer genome. Our study reveals the ability for an invertebrate cancer lineage to survive for centuries while its genome continues to structurally mutate, likely contributing to the ability of this lineage to adapt as a parasitic cancer. SUMMARY The genome of a contagious cancer in clams reveals structural instability of multiple types throughout the ∼500 years since its origin.

[1]  Z. Ning,et al.  The evolution of two transmissible leukaemias colonizing the coasts of Europe , 2022, bioRxiv.

[2]  Hannes P. Eggertsson,et al.  The evolution of two transmissible cancers in Tasmanian devils , 2022, bioRxiv.

[3]  K. Smolarz,et al.  Horizontal transmission of disseminated neoplasia in the widespread clam Macoma balthica from the Southern Baltic Sea , 2022, Molecular ecology.

[4]  S. Roberts,et al.  Epigenetic and Genetic Population Structure is Coupled in a Marine Invertebrate , 2022, bioRxiv.

[5]  Lishuai Wang,et al.  Pan-Cancer Analyses Identify the CTC1-STN1-TEN1 Complex as a Protective Factor and Predictive Biomarker for Immune Checkpoint Blockade in Cancer , 2022, Frontiers in Genetics.

[6]  J. Tubío,et al.  Mitochondrial genome sequencing of marine leukaemias reveals cancer contagion between clam species in the Seas of Southern Europe , 2022, eLife.

[7]  Rachael M. Giersch,et al.  Survival and Detection of Bivalve Transmissible Neoplasia from the Soft-Shell Clam Mya arenaria (MarBTN) in Seawater , 2021, bioRxiv.

[8]  N. Bierne,et al.  Traits of a mussel transmissible cancer are reminiscent of a parasitic life style , 2021, Scientific Reports.

[9]  M. Stratton,et al.  Increased somatic mutation burdens in normal human cells due to defective DNA polymerases , 2021, Nature Genetics.

[10]  David C. Jones,et al.  Somatic mutation rates scale with lifespan across mammals , 2021, Nature.

[11]  Menna E. Jones,et al.  Evolution and lineage dynamics of a transmissible cancer in Tasmanian devils , 2020, PLoS biology.

[12]  M. Lesser,et al.  The Genome of the Softshell Clam Mya arenaria and the Evolution of Apoptosis , 2020, Genome biology and evolution.

[13]  P. Chinnery,et al.  Recurrent horizontal transfer identifies mitochondrial positive selection in a transmissible cancer , 2020, Nature Communications.

[14]  A. Poetsch The genomics of oxidative DNA damage, repair, and resulting mutagenesis , 2020, Computational and structural biotechnology journal.

[15]  M. Lesser,et al.  Effects of Thermal Stress and Ocean Acidification on the Expression of the Retrotransposon Steamer in the Softshell Mya arenaria , 2019, Journal of Shellfish Research.

[16]  Andrew G. Clark,et al.  RepeatModeler2: automated genomic discovery of transposable element families , 2019, bioRxiv.

[17]  Rachael M. Giersch,et al.  A single clonal lineage of transmissible cancer identified in two marine mussel species in South America and Europe , 2019, eLife.

[18]  H. Jacobs,et al.  Mutating for Good: DNA Damage Responses During Somatic Hypermutation , 2019, Front. Immunol..

[19]  Anthony R. Borneman,et al.  Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies , 2018, BMC Bioinformatics.

[20]  Jun Z. Li,et al.  Helmsman: fast and efficient mutation signature analysis for massive sequencing datasets , 2018, BMC Genomics.

[21]  Amanda E. Jones,et al.  Transposons, p53 and Genome Security. , 2018, Trends in genetics : TIG.

[22]  Adrian Baez-Ortega,et al.  sigfit: flexible Bayesian inference of mutational signatures , 2018, bioRxiv.

[23]  Emmanuel Paradis,et al.  ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R , 2018, Bioinform..

[24]  M. Stratton,et al.  Universal Patterns of Selection in Cancer and Somatic Tissues , 2018, Cell.

[25]  Timothy P. L. Smith,et al.  FALCON-Phase: Integrating PacBio and Hi-C data for phased diploid genomes , 2018, bioRxiv.

[26]  Ville Mustonen,et al.  The repertoire of mutational signatures in human cancer , 2018, Nature.

[27]  Stacey Price,et al.  The Origins and Vulnerabilities of Two Transmissible Cancers in Tasmanian Devils , 2018, Cancer cell.

[28]  R. Verhaak,et al.  The Tandem Duplicator Phenotype Is a Prevalent Genome-Wide Cancer Configuration Driven by Distinct Gene Mutations , 2017, bioRxiv.

[29]  Deanna M. Church,et al.  Resolving the Full Spectrum of Human Genome Variation using Linked-Reads , 2017, bioRxiv.

[30]  Omar Wagih,et al.  ggseqlogo: a versatile R package for drawing sequence logos , 2017, Bioinform..

[31]  Shamil Sunyaev,et al.  Bayesian inference of negative and positive selection in human cancers , 2017, Nature Genetics.

[32]  Nuno A. Fonseca,et al.  Comprehensive molecular characterization of mitochondrial genomes in human cancers , 2017, bioRxiv.

[33]  Hanlee P. Ji,et al.  Genomic Instability in Cancer: Teetering on the Limit of Tolerance. , 2017, Cancer research.

[34]  Steven G. Schroeder,et al.  Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome , 2017, Nature Genetics.

[35]  S. Goff,et al.  A Sixth Modality of Infectious Disease: Contagious Cancer from Devils to Clams and Beyond , 2016, PLoS pathogens.

[36]  Shallee T. Page,et al.  Complete mitochondrial genome of the soft-shell clam Mya arenaria , 2016, Mitochondrial DNA. Part A, DNA mapping, sequencing, and analysis.

[37]  Elizabeth P. Murchison,et al.  Rapid evolutionary response to a transmissible cancer in Tasmanian devils , 2016, Nature Communications.

[38]  J. Goodier Restricting retrotransposons: a review , 2016, Mobile DNA.

[39]  J. Davidson,et al.  Field and laboratory transmission studies of haemic neoplasia in the soft-shell clam, Mya arenaria, from Atlantic Canada. , 2016, Journal of fish diseases.

[40]  Neva C. Durand,et al.  Juicer Provides a One-Click System for Analyzing Loop-Resolution Hi-C Experiments. , 2016, Cell systems.

[41]  S. Baldwin,et al.  Widespread transmission of independent cancer lineages within multiple bivalve species , 2016, Nature.

[42]  M. Schatz,et al.  Phased diploid genome assembly with single-molecule real-time sequencing , 2016, Nature Methods.

[43]  Yufeng Wu,et al.  REPdenovo: Inferring De Novo Repeat Motifs from Short Sequence Reads , 2016, PloS one.

[44]  A. B. Lyons,et al.  A second transmissible cancer in Tasmanian devils , 2015, Proceedings of the National Academy of Sciences.

[45]  M. Stratton,et al.  Clock-like mutational processes in human somatic cells , 2015, Nature Genetics.

[46]  J. Trent,et al.  Comparison against 186 canid whole-genome sequences reveals survival strategies of an ancient clonally transmissible canine tumor , 2015, Genome research.

[47]  Evgeny M. Zdobnov,et al.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs , 2015, Bioinform..

[48]  A. Villalba,et al.  Neoplastic diseases of marine bivalves. , 2015, Journal of invertebrate pathology.

[49]  S. Goff,et al.  Horizontal Transmission of Clonal Cancer Cells Causes Leukemia in Soft-Shell Clams , 2015, Cell.

[50]  Neva C. Durand,et al.  A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping , 2014, Cell.

[51]  S. Goff,et al.  Activation of transcription and retrotransposition of a novel retroelement, Steamer, in neoplastic hemocytes of the mollusk Mya arenaria , 2014, Proceedings of the National Academy of Sciences.

[52]  P. Soudant,et al.  Disseminated Neoplasia in the Soft-Shell Clam Mya arenaria: Membrane Lipid Composition and Functional Parameters of Circulating Cells , 2014, Lipids.

[53]  S. Roberts,et al.  A context dependent role for DNA methylation in bivalves. , 2014, Briefings in functional genomics.

[54]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..

[55]  Ira M. Hall,et al.  SAMBLASTER: fast duplicate marking and structural variant read extraction , 2014, Bioinform..

[56]  Richard Durbin,et al.  Estimating telomere length from whole genome sequence data , 2014, Nucleic acids research.

[57]  Peter J. Campbell,et al.  Transmissible Dog Cancer Genome Reveals the Origin and History of an Ancient Cell Lineage , 2014, Science.

[58]  Andrew C. Adey,et al.  Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions , 2013, Nature Biotechnology.

[59]  S. Gabriel,et al.  Pan-cancer patterns of somatic copy-number alteration , 2013, Nature Genetics.

[60]  David T. W. Jones,et al.  Signatures of mutational processes in human cancer , 2013, Nature.

[61]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[62]  M. Stratton,et al.  Deciphering Signatures of Mutational Processes Operative in Human Cancer , 2013, Cell reports.

[63]  R. Gibbs,et al.  Mind the Gap: Upgrading Genomes with Pacific Biosciences RS Long-Read Sequencing Technology , 2012, PloS one.

[64]  Zhengwei Zhu,et al.  CD-HIT: accelerated for clustering the next-generation sequencing data , 2012, Bioinform..

[65]  V. Beneš,et al.  DELLY: structural variant discovery by integrated paired-end and split-read analysis , 2012, Bioinform..

[66]  A. Børresen-Dale,et al.  Mutational Processes Molding the Genomes of 21 Breast Cancers , 2012, Cell.

[67]  C. Reinisch,et al.  Haemocytic leukemia in Prince Edward Island (PEI) soft shell clam (Mya arenaria): spatial distribution in agriculturally impacted estuaries. , 2012, The Science of the total environment.

[68]  S. Hochreiter,et al.  cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate , 2012, Nucleic acids research.

[69]  N. Friedman,et al.  Trinity : reconstructing a full-length transcriptome without a genome from RNA-Seq data , 2016 .

[70]  F. Berthe,et al.  Induction of transposase and polyprotein RNA levels in disseminated neoplastic hemocytes of soft-shell clams: Mya arenaria. , 2011, Developmental and comparative immunology.

[71]  A. Burt,et al.  Mitochondrial Capture by a Transmissible Cancer , 2011, Science.

[72]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[73]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[74]  F. Berthe,et al.  Reverse transcriptase activity in tissues of the soft shell clam Mya arenaria affected with haemic neoplasia. , 2009, Journal of invertebrate pathology.

[75]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[76]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[77]  C. Walker,et al.  Mass Culture and Characterization of Tumor Cells From a Naturally Occurring Invertebrate Cancer Model: Applications for Human and Animal Disease and Environmental Health , 2009, The Biological Bulletin.

[78]  F. Berthe,et al.  Assessment of haemic neoplasia in different soft shell clam Mya arenaria populations from eastern Canada by flow cytometry. , 2008, Journal of invertebrate pathology.

[79]  F. Berthe,et al.  Patterns of p53, p73 and mortalin gene expression associated with haemocyte polyploidy in the soft-shell clam, Mya arenaria. , 2008, Journal of invertebrate pathology.

[80]  C. Chi,et al.  Tandem duplication/triplication correlated with poly-cytosine stretch variation in human mitochondrial DNA D-loop region. , 2008, Mutagenesis.

[81]  Robin A. Weiss,et al.  Clonal Origin and Evolution of a Transmissible Cancer , 2006, Cell.

[82]  C. Walker,et al.  Mortalin-based cytoplasmic sequestration of p53 in a nonmammalian cancer model. , 2006, The American journal of pathology.

[83]  A. Pearse,et al.  Allograft theory: Transmission of devil facial-tumour disease , 2006, Nature.

[84]  B. Barber Neoplastic diseases of commercially important marine bivalves , 2004 .

[85]  D. Leavitt,et al.  Experimental field studies with Mya arenaria (Bivalvia) on the induction and effect of hematopoietic neoplasia. , 1997, Journal of invertebrate pathology.

[86]  P. Reno,et al.  Flow cytometric and chromosome analysis of softshell clams, Mya arenaria, with disseminated neoplasia , 1994 .

[87]  I. Sunila Respiration of sarcoma cells from the soft-shell clam Mya arenaria L. under various conditions , 1991 .

[88]  R. F. Scott,et al.  Epizootiology and distribution of transmissible sarcoma in Maryland softshell clams, Mya arenaria, 1984-1988. , 1991, Environmental health perspectives.

[89]  C. Reinisch,et al.  Hematopoietic neoplasia inMya arenaria: Prevalence and indices of physiological condition , 1990 .

[90]  C. Reinisch,et al.  Unique antigens on neoplastic cells of the soft shell clam Mya arenaria. , 1983, Developmental and comparative immunology.

[91]  J. J. Oprandy,et al.  5-bromodeoxyuridine induction of hematopoietic neoplasia and retrovirus activation in the soft-shell clam, Mya arenaria. , 1983, Journal of invertebrate pathology.

[92]  K. Cooper,et al.  The course and mortality of a hematopoietic neoplasm in the soft-shell clam, , 1982 .

[93]  P P Yevich,et al.  NEOPLASIA IN SOFT‐SHELL CLAMS (MY A ARENARIA) COLLECTED FROM OIL‐IMPACTED SITES , 1977, Annals of the New York Academy of Sciences.

[94]  S. Saila,et al.  PREVALENCE OF NEOPLASIA IN 10 NEW ENGLAND POPULATIONS OF THE SOFT‐SHELL CLAM (MYA ARENARIA) * , 1977, Annals of the New York Academy of Sciences.

[95]  F. Berthe,et al.  Transcriptome analysis of neoplastic hemocytes in soft-shell clams Mya arenaria: Focus on cell cycle molecular mechanism. , 2013, Results in immunology.

[96]  N. Taraska,et al.  Selective initiation and transmission of disseminated neoplasia in the soft shell clam Mya arenaria dependent on natural disease prevalence and animal size. , 2013, Journal of invertebrate pathology.

[97]  F. Berthe,et al.  Expression of RAS-like family members, c-jun and c-myc mRNA levels in neoplastic hemocytes of soft-shell clams Mya arenaria using microsphere-based 8-plex branched DNA assay. , 2012, Results in immunology.

[98]  F. Berthe,et al.  Lack of detection of a putative retrovirus associated with haemic neoplasia in the soft shell clam Mya arenaria. , 2012, Journal of invertebrate pathology.

[99]  F. Hetrick,et al.  Transmission studies of sarcoma in the soft-shell clam, Mya arenaria. , 1992, In vivo.

[100]  J. Baglivo,et al.  Field and laboratory comparisons of mortality in normal and neoplastic Mya arenaria. , 1991, Journal of invertebrate pathology.

[101]  J. Doyle,et al.  Isolation of plant DNA from fresh tissue , 1990 .

[102]  C. Farley,et al.  Environmental limits for survival of sarcoma cells from the soft-shell clam Mya aremaria , 1989 .