Comparative genome microsynteny illuminates the fast evolution of nuclear mitochondrial segments (NUMTs) in mammals

The escape of DNA from mitochondria into the nuclear genome (nuclear mitochondrial DNA, NUMT) is an ongoing process. Although pervasively observed in eukaryotic genomes, their evolutionary trajectories in a mammal-wide context are poorly understood. The main challenge lies in the orthology assignment of NUMTs across species due to their fast evolution and chromosomal rearrangements over the past ∼200 million years. To address this issue, we systematically investigated the characteristics of NUMT insertions in 45 mammalian genomes, and established a novel, synteny-based method to accurately predict orthologous NUMTs and ascertain their evolution across mammals. With a series of comparative analyses across taxa, we revealed that NUMTs may originate from non-random regions in mtDNA, tend to locate in transposon-rich and intergenic regions, and unlikely code for functional proteins. Using our synteny-based approach, we leveraged 630 pairwise comparisons of genome-wide microsynteny and predicted the NUMT orthology relationships across 36 mammals. With the phylogenetic patterns of NUMT presence-and-absence across taxa, we constructed the ancestral state of NUMTs given the mammal tree using a coalescent method. We found support on the ancestral node of Fereuungulata within Laurasiatheria, whose subordinal relationships are still controversial. This strongly indicates that NUMT gain-and-loss over evolutionary time provides great insights into mammal evolution. However, we also demonstrated that one should be cautious when using ancestral NUMT trees to infer phylogenetic relationships. This study broadens our knowledge on NUMT insertion and evolution in mammalian genomes and highlights the merit of NUMTs as alternative genetic markers in phylogenetic inference.

[1]  M. Caulfield,et al.  Nuclear-embedded mitochondrial DNA sequences in 66,083 human genomes , 2022, Nature.

[2]  Einat Hazkani-Covo A Burst of Numt Insertion in the Dasyuridae Family During Marsupial Evolution , 2022, Frontiers in Ecology and Evolution.

[3]  James E. Allen,et al.  Ensembl 2022 , 2021, Nucleic Acids Res..

[4]  Da-Wei Huang,et al.  Tracking the Distribution and Burst of Nuclear Mitochondrial DNA Sequences (NUMTs) in Fig Wasp Genomes , 2020, Insects.

[5]  Graham M. Hughes,et al.  Six reference-quality genomes reveal evolution of bat adaptations , 2020, Nature.

[6]  Nicholas H. Putnam,et al.  Deeply conserved synteny resolves early events in vertebrate evolution , 2020, Nature Ecology & Evolution.

[7]  Ryan E. Mills,et al.  Characterization of nuclear mitochondrial insertions in the whole genomes of primates , 2020, bioRxiv.

[8]  D. Rocha,et al.  Survey of mitochondrial sequences integrated into the bovine nuclear genome , 2020, Scientific Reports.

[9]  E. Braun,et al.  Comparative Genomics Reveals a Burst of Homoplasy-Free Numt Insertions. , 2018, Molecular biology and evolution.

[10]  R. Beck,et al.  Total evidence phylogeny and evolutionary timescale for Australian faunivorous marsupials (Dasyuromorphia) , 2017, BMC Evolutionary Biology.

[11]  O. Shaul How introns enhance gene expression. , 2017, The international journal of biochemistry & cell biology.

[12]  Robert M. Waterhouse,et al.  BUSCO Applications from Quality Assessments to Gene Prediction and Phylogenomics , 2017, bioRxiv.

[13]  Thomas K. F. Wong,et al.  ModelFinder: Fast Model Selection for Accurate Phylogenetic Estimates , 2017, Nature Methods.

[14]  Alexander Lex,et al.  UpSetR: an R package for the visualization of intersecting sets and their properties , 2017, bioRxiv.

[15]  I. Pavlidis,et al.  The ability of human nuclear DNA to cause false positive low-abundance heteroplasmy calls varies across the mitochondrial genome , 2016, BMC Genomics.

[16]  D. Larkin,et al.  Mammalian Comparative Genomics Reveals Genetic and Epigenetic Features Associated with Genome Reshuffling in Rodentia , 2016, Genome biology and evolution.

[17]  G. Wörheide,et al.  Similar Ratios of Introns to Intergenic Sequence across Animal Genomes , 2016, bioRxiv.

[18]  E. Teeling,et al.  Mammal madness: is the mammal tree of life not yet resolved? , 2016, Philosophical Transactions of the Royal Society B: Biological Sciences.

[19]  Olga Chernomor,et al.  Terrace Aware Data Structure for Phylogenomic Inference from Supermatrices , 2016, Systematic biology.

[20]  Arndt von Haeseler,et al.  W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis , 2016, Nucleic Acids Res..

[21]  W. Murphy,et al.  Phylogenomic evidence for ancient hybridization in the genomes of living cats (Felidae) , 2016, Genome research.

[22]  J. Bhak,et al.  Characterization of cetacean Numt and its application into cetacean phylogeny , 2015, Genes & Genomics.

[23]  Steven L Salzberg,et al.  HISAT: a fast spliced aligner with low memory requirements , 2015, Nature Methods.

[24]  A. von Haeseler,et al.  IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies , 2014, Molecular biology and evolution.

[25]  Roland Eils,et al.  circlize implements and enhances circular visualization in R , 2014, Bioinform..

[26]  Ryan E. Mills,et al.  The genomic landscape of polymorphic human nuclear mitochondrial insertions , 2014, bioRxiv.

[27]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[28]  E. Kejnovský,et al.  Analysis of plastid and mitochondrial DNA insertions in the nucleus (NUPTs and NUMTs) of six plant species: size, relative age and chromosomal localization , 2013, Heredity.

[29]  A. Ivessa,et al.  Accumulation of linear mitochondrial DNA fragments in the nucleus shortens the chronological life span of yeast. , 2012, European journal of cell biology.

[30]  M. Frith,et al.  Mammalian NUMT insertion is non-random , 2012, Nucleic acids research.

[31]  Liam J. Revell,et al.  phytools: an R package for phylogenetic comparative biology (and other things) , 2012 .

[32]  L. Carmel,et al.  The Function of Introns , 2012, Front. Gene..

[33]  J. Timmis,et al.  Environmental stress increases the entry of cytoplasmic organellar DNA into the nucleus in plants , 2012, Proceedings of the National Academy of Sciences.

[34]  S. Puechmaille,et al.  The evolution of sensory divergence in the context of limited gene flow in the bumblebee bat , 2011, Nature communications.

[35]  T. J. Robinson,et al.  Impacts of the Cretaceous Terrestrial Revolution and KPg Extinction on Mammal Diversification , 2011, Science.

[36]  Xuming Zhou,et al.  Phylogenomic Analysis Resolves the Interordinal Relationships and Rapid Diversification of the Laurasiatherian Mammals , 2011, Systematic biology.

[37]  O. Gascuel,et al.  Survey of Branch Support Methods Demonstrates Accuracy, Power, and Robustness of Fast Likelihood-based Approximation Schemes , 2011, Systematic biology.

[38]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[39]  A. Lawton-Rauh,et al.  Comparative and Evolutionary Genomics , 2010 .

[40]  O. Gascuel,et al.  New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. , 2010, Systematic biology.

[41]  W. Martin,et al.  Molecular Poltergeists: Mitochondrial DNA Copies (numts) in Sequenced Nuclear Genomes , 2010, PLoS genetics.

[42]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[43]  Einat Hazkani-Covo Mitochondrial insertions into primate nuclear genomes suggest the use of numts as a tool for phylogeny. , 2009, Molecular biology and evolution.

[44]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[45]  T. Kleine,et al.  DNA transfer from organelles to the nucleus: the idiosyncratic genetics of endosymbiosis. , 2009, Annual review of plant biology.

[46]  S. Covo,et al.  Numt-Mediated Double-Strand Break Repair Mitigates Deletions during Primate Genome Evolution , 2008, PLoS genetics.

[47]  A Salas,et al.  Pseudomitochondrial genome haunts disease studies , 2008, Journal of Medical Genetics.

[48]  J. DeWoody,et al.  Extensive mitochondrial DNA transfer in a rapidly evolving rodent has been mediated by independent insertion events and by duplications. , 2007, Gene.

[49]  D. Leister Origin, evolution and genetic effects of nuclear insertions of organelle DNA. , 2005, Trends in genetics : TIG.

[50]  Dario Leister,et al.  NUMTs in sequenced eukaryotic genomes. , 2004, Molecular biology and evolution.

[51]  S. Pääbo,et al.  Unreliable mtDNA data due to nuclear insertions: a cautionary tale from analysis of humans and other great apes , 2004, Molecular ecology.

[52]  S. O’Brien,et al.  Placental mammal diversification and the Cretaceous–Tertiary boundary , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[53]  D. Cooper,et al.  Human genetic disease caused by de novo mitochondrial-nuclear DNA transfer , 2003, Human Genetics.

[54]  M. Woischnik,et al.  Pattern of organization of human mitochondrial pseudogenes in the nuclear genome. , 2002, Genome research.

[55]  P. Arctander,et al.  The Human Genome Project reveals a continuous transfer of large mitochondrial fragments to the nucleus. , 2001, Molecular biology and evolution.

[56]  D. Hartl,et al.  Mitochondrial pseudogenes: evolution's misplaced witnesses. , 2001, Trends in ecology & evolution.

[57]  Jeffrey D. Palmer,et al.  Repeated, recent and diverse transfers of a mitochondrial gene to the nucleus in flowering plants , 2000, Nature.

[58]  Wei Qian,et al.  Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. , 2000, Molecular biology and evolution.

[59]  D. Murdock,et al.  Ancient mtDNA sequences in the human nuclear genome: a potential source of errors in identifying pathogenic mutations. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[60]  J. Blanchard,et al.  Mitochondrial DNA migration events in yeast and humans: integration by a common end-joining mechanism and alternative perspectives on nucleotide substitution patterns. , 1996, Molecular biology and evolution.

[61]  N. Perna,et al.  Mitochondrial DNA: Molecular fossils in the nucleus , 1996, Current Biology.

[62]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[63]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[64]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[65]  Dan Graur,et al.  A comparative analysis of numt evolution in human and chimpanzee. , 2007, Molecular biology and evolution.

[66]  M. Feldman,et al.  Rates of DNA Duplication and Mitochondrial DNA Insertion in the Human Genome , 2004, Journal of Molecular Evolution.

[67]  P. Pevzner,et al.  Genome rearrangements in mammalian evolution: lessons from human and mouse genomes. , 2003, Genome research.

[68]  M. Sorenson,et al.  Numts : A challenge for avian systematics and population biology , 1998 .