Mitochondrial DNA‐like sequences in the nucleus (NUMTs): Insights into our African origins and the mechanism of foreign DNA integration

Nuclear mitochondrial DNA sequences (NUMTs) are common in eukaryotes. However, the mechanism by which they integrate into the nuclear genome remains a riddle. We analyzed 247 NUMTs in the human nuclear DNA (nDNA), along with their flanking regions. This analysis revealed that some NUMTs have accumulated many changes, and thus have resided in the nucleus a long time, while others are >94% similar to the reference human mitochondrial DNA (mtDNA), and thus must be recent. Among the latter, two NUMTs, encompassing the COI gene, carry a set of transitions characteristic of the extant African‐specific L macrohaplogroup mtDNAs and are more homologous to human mtDNA than to chimp. Screening for one of these NUMTs revealed its presence in all human samples tested, confirming that the African macrohaplogroup L mtDNAs were present in the earliest modern humans and thus were the first human mtDNAs. An analysis of flanking sequences of the NUMTs revealed that 59% were within 150 bp of repetitive elements, with 26% being within 15 bp of and 33% being within 15–150 bp of repetitive elements. Only 14% were integrated into a repetitive element. This association of NUMTs with repetitive elements is highly nonrandom (p<0.001). These data suggest that the vicinity of transposable elements influences the ongoing integration of mtDNA sequences and their subsequent duplication within the nDNA. Finally, NUMTs appear to preferentially integrate into DNA with different GC content than the surrounding chromosomal band. Our results suggest that chromosomal structure might influence integration of NUMTs. Hum Mutat 23:125–133, 2004. © 2003 Wiley‐Liss, Inc.

[1]  M. Wagatsuma,et al.  Analysis of integrated human papillomavirus type 16 DNA in cervical cancers: amplification of viral sequences together with cellular flanking sequences , 1990, Journal of virology.

[2]  V. Sheffield,et al.  Characterization of Alu repeats that are associated with trinucleotide and tetranucleotide repeat microsatellites. , 1997, Genome research.

[3]  N Howell,et al.  Mutations in mitochondrial cytochrome c oxidase genes segregate with late-onset Alzheimer disease. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[4]  B. Kerem,et al.  Common fragile sites: G-band characteristics within an R-band. , 1999, American journal of human genetics.

[5]  A. Smit,et al.  The origin of interspersed repeats in the human genome. , 1996, Current opinion in genetics & development.

[6]  C. Spadafora,et al.  Integration of foreign DNA sequences into mouse sperm genome. , 1997, DNA and cell biology.

[7]  M. Schmid,et al.  Heterochromatin is not an adequate explanation for close proximity of interphase chromosomes 1--Y, 9--Y, and 16--Y in human spermatozoa. , 2001, Experimental cell research.

[8]  B. Baysal,et al.  Germ line insertion of mtDNA at the breakpoint junction of a reciprocal constitutional translocation , 2001, Human Genetics.

[9]  W. Martinez-Lopez,et al.  Chromosome regions enriched in hyperacetylated histone H4 are preferred sites for endonuclease- and radiation-induced breakpoints , 2004, Chromosome Research.

[10]  A. Furano,et al.  Insertion of L1 elements into sites that can form non-B DNA. Interactions of non-B DNA-forming sequences. , 1989, The Journal of biological chemistry.

[11]  M. Gerstein,et al.  Identification and analysis of over 2000 ribosomal protein pseudogenes in the human genome. , 2002, Genome research.

[12]  S. Scherer,et al.  Molecular characterization of a common fragile site (FRA7H) on human chromosome 7 by the cloning of a simian virus 40 integration site. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[13]  B. Kerem,et al.  Mapping of DNAase I sensitive regions on mitotic chromosomes , 1984, Cell.

[14]  T. Tsuzuki,et al.  Presence of mitochondrial-DNA-like sequences in the human nuclear DNA. , 1983, Gene.

[15]  M. V. Filatov,et al.  Camptothecin enhances random integration of transfected DNA into the genome of mammalian cells. , 2000, Biochimica et biophysica acta.

[16]  Marty C. Brandon,et al.  Natural selection shaped regional mtDNA variation in humans , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Philippe Dessen,et al.  Structure and chromosomal distribution of human mitochondrial pseudogenes. , 2002, Genomics.

[18]  J. Blanchard,et al.  Mitochondrial DNA migration events in yeast and humans: integration by a common end-joining mechanism and alternative perspectives on nucleotide substitution patterns. , 1996, Molecular biology and evolution.

[19]  H. Koyama,et al.  DNA topoisomerase II inhibitors enhance random integration of transfected vectors into human chromosomes , 1996, Somatic cell and molecular genetics.

[20]  D. Murdock,et al.  Ancient mtDNA sequences in the human nuclear genome: a potential source of errors in identifying pathogenic mutations. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[21]  D. Turnbull,et al.  Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA , 1999, Nature Genetics.

[22]  D. Wallace,et al.  mtDNA variation in the South African Kung and Khwe-and their genetic relationships to other African populations. , 2000, American journal of human genetics.

[23]  G. Bernardi,et al.  Correlations between isochores and chromosomal bands in the human genome. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[24]  G. Bernardi,et al.  Similar integration but different stability of Alus and LINEs in the human genome. , 2001, Gene.

[25]  D. Hartl,et al.  Mitochondrial pseudogenes: evolution's misplaced witnesses. , 2001, Trends in ecology & evolution.

[26]  Alexander E Vinogradov,et al.  DNA helix: the importance of being AT-rich , 2003, Mammalian Genome.

[27]  F. Sanger,et al.  Sequence and organization of the human mitochondrial genome , 1981, Nature.

[28]  A. Maiti,et al.  Isolation, in silico characterization and chromosomal localization of a group of cDNAs from ciliated epithelial cells after in vitro ciliogenesis , 2001, Genome Biology.

[29]  A. Bodley,et al.  Integration of simian virus 40 into cellular DNA occurs at or near topoisomerase II cleavage hot spots induced by VM-26 (teniposide) , 1993, Molecular and cellular biology.

[30]  J. Slightom,et al.  Mitochondrial D-loop sequences are integrated in the rat nuclear genome. , 1991, Journal of molecular biology.

[31]  P. Ossorio,et al.  Mitochondrial-like DNA sequences flanked by direct and inverted repeats in the nuclear genome of Toxoplasma gondii. , 1991, Journal of molecular biology.

[32]  R. Mahalingam,et al.  Selected nuclear LINE elements with mitochondrial‐DNA‐like inserts are more plentiful and mobile in tumor than in normal tissue of mouse and rat , 1998, Journal of cellular biochemistry.

[33]  D. Roth,et al.  Mechanisms of nonhomologous recombination in mammalian cells , 1985, Molecular and cellular biology.

[34]  S. Pääbo,et al.  Mitochondrial genome variation and the origin of modern humans , 2000, Nature.

[35]  Ronald A. Butow,et al.  Rearranged mitochondrial genes in the yeast nuclear genome , 1983, Nature.

[36]  R. Sinden Biological implications of the DNA structures associated with disease-causing triplet repeats. , 1999, American journal of human genetics.

[37]  J. Rowley,et al.  Genomic DNA breakpoints in AML1/RUNX1 and ETO cluster with topoisomerase II DNA cleavage and DNase I hypersensitive sites in t(8;21) leukemia , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[38]  S. Pääbo,et al.  A nuclear 'fossil' of the mitochondrial D-loop and the origin of modern humans , 1995, Nature.

[39]  M. Woischnik,et al.  Pattern of organization of human mitochondrial pseudogenes in the nuclear genome. , 2002, Genome research.

[40]  Samuel Karlin,et al.  Genes, pseudogenes, and Alu sequence organization across human chromosomes 21 and 22 , 2002, Proceedings of the National Academy of Sciences of the United States of America.