Many Parallel Losses of infA from Chloroplast DNA during Angiosperm Evolution with Multiple Independent Transfers to the Nucleus

We used DNA sequencing and gel blot surveys to assess the integrity of the chloroplast gene infA, which codes for translation initiation factor 1, in >300 diverse angiosperms. Whereas most angiosperms appear to contain an intact chloroplast infA gene, the gene has repeatedly become defunct in ∼24 separate lineages of angiosperms, including almost all rosid species. In four species in which chloroplast infA is defunct, transferred and expressed copies of the gene were found in the nucleus, complete with putative chloroplast transit peptide sequences. The transit peptide sequences of the nuclear infA genes from soybean and Arabidopsis were shown to be functional by their ability to target green fluorescent protein to chloroplasts in vivo. Phylogenetic analysis of infA sequences and assessment of transit peptide homology indicate that the four nuclear infA genes are probably derived from four independent gene transfers from chloroplast to nuclear DNA during angiosperm evolution. Considering this and the many separate losses of infA from chloroplast DNA, the gene has probably been transferred many more times, making infA by far the most mobile chloroplast gene known in plants.

[1]  W. Doolittle You are what you eat: a gene transfer ratchet could account for bacterial genes in eukaryotic nuclear genomes. , 1998, Trends in genetics : TIG.

[2]  M. Sugiura,et al.  Loss of all ndh genes as determined by sequencing the entire chloroplast genome of the black pine Pinus thunbergii. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[3]  J. Palmer,et al.  Multigene analyses identify the three earliest lineages of extant flowering plants , 1999, Current Biology.

[4]  M. Bubunenko,et al.  Protein substitution in chloroplast ribosome evolution. A eukaryotic cytosolic protein has replaced its organelle homologue (L23) in spinach. , 1994, Journal of molecular biology.

[5]  T. Konishi,et al.  Acetyl-CoA carboxylase in higher plants: most plants other than gramineae have both the prokaryotic and the eukaryotic forms of this enzyme. , 1996, Plant & cell physiology.

[6]  M. S. Khan,et al.  Transient expression of green fluorescent protein in various plastid types following microprojectile bombardment , 1998 .

[7]  G. von Heijne Why mitochondria need a genome , 1986, FEBS letters.

[8]  J. Popot,et al.  On the microassembly of integral membrane proteins. , 1990, Annual review of biophysics and biophysical chemistry.

[9]  M. Donoghue,et al.  The root of angiosperm phylogeny inferred from duplicate phytochrome genes. , 1999, Science.

[10]  T. Andrews,et al.  Accelerated Evolution of Cytochrome b in Simian Primates: Adaptive Evolution in Concert with Other Mitochondrial Proteins? , 1998, Journal of Molecular Evolution.

[11]  K. Kousoulas,et al.  Efficient production of single-stranded DNA as long as 2 kb for sequencing of PCR-amplified DNA. , 1992, BioTechniques.

[12]  L. Urbatsch,et al.  PHYLOGENY OF SUBFAMILY EPIDENDROIDEAE (ORCHIDACEAE) INFERRED FROM NDHF CHLOROPLAST GENE SEQUENCES , 1996 .

[13]  J. Felsenstein Cases in which Parsimony or Compatibility Methods will be Positively Misleading , 1978 .

[14]  W. Martin,et al.  Why have organelles retained genomes? , 1999, Trends in genetics : TIG.

[15]  Mark W. Chase,et al.  The earliest angiosperms: evidence from mitochondrial, plastid and nuclear genomes , 1999, Nature.

[16]  F. Takaiwa,et al.  The complete nucleotide sequence of the tobacco chloroplast genome: its gene organization and expression , 1986, The EMBO journal.

[17]  J. Palmer,et al.  The Origin and Evolution of Plastids and Their Genomes , 1998 .

[18]  A. Subramanian,et al.  The Plastid Ribosomal Proteins , 2000, The Journal of Biological Chemistry.

[19]  J. Palmer,et al.  Function and evolution of a minimal plastid genome from a nonphotosynthetic parasitic plant. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[20]  S. Brunak,et al.  Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. , 2000, Journal of molecular biology.

[21]  M. Lynch,et al.  Organellar genes: why do they end up in the nucleus? , 2000, Trends in genetics : TIG.

[22]  J. Laroche,et al.  Molecular evolution of angiosperm mitochondrial introns and exons. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[23]  G. Igloi,et al.  Complete sequence of the maize chloroplast genome: gene content, hotspots of divergence and fine tuning of genetic information by transcript editing. , 1995, Journal of molecular biology.

[24]  J. Palmer,et al.  Transfer of rpl22 to the nucleus greatly preceded its loss from the chloroplast and involved the gain of an intron. , 1991, The EMBO journal.

[25]  L. Spremulli,et al.  Isolation and characterization of cDNA clones for chloroplast translational initiation factor-3 from Euglena gracilis. , 1994, The Journal of biological chemistry.

[26]  L. Spremulli,et al.  Regulation of the Activity of Chloroplast Translational Initiation Factor 3 by NH2- and COOH-Terminal Extensions* , 1998, The Journal of Biological Chemistry.

[27]  M. Boguski,et al.  Evolutionary parameters of the transcribed mammalian genome: an analysis of 2,820 orthologous rodent and human sequences. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Douglas E. Soltis,et al.  Molecular Systematics of Plants , 1992, Springer US.

[29]  Brendan D. McKay,et al.  TrExML: a maximum-likelihood approach for extensive tree-space exploration , 2000, Bioinform..

[30]  J. Smith Phylogenetic resolution within the tribe Episcieae (Gesneriaceae): congruence of ITS and NDHF sequences from parsimony and maximum-likelihood analyses. , 2000, American journal of botany.

[31]  M. Grunberg‐Manago,et al.  Posttranscriptional autoregulation of Escherichia coli threonyl tRNA synthetase expression in vivo , 1986, Journal of bacteriology.

[32]  C. Gualerzi,et al.  The structure of the translational initiation factor IF1 from E.coli contains an oligomer‐binding motif , 1997, The EMBO journal.

[33]  U. Gyllensten Direct Sequencing of In Vitro Amplified DNA , 1989 .

[34]  N. Kubo,et al.  Targeting presequence acquisition after mitochondrial gene transfer to the nucleus occurs by duplication of existing targeting signals. , 1996, The EMBO journal.

[35]  C. Jacq,et al.  Limitations to in vivo import of hydrophobic proteins into yeast mitochondria. The case of a cytoplasmically synthesized apocytochrome b. , 1995, European journal of biochemistry.

[36]  Y. Nakamura,et al.  Complete structure of the chloroplast genome of Arabidopsis thaliana. , 1999, DNA research : an international journal for rapid publication of reports on genes and genomes.

[37]  M. Hasegawa,et al.  Gene transfer to the nucleus and the evolution of chloroplasts , 1998, Nature.

[38]  J. Palmer,et al.  Multiple Independent Losses of Two Genes and One Intron from Legume Chloroplast Genomes , 1995 .

[39]  D. Soltis,et al.  Angiosperm phylogeny inferred from multiple genes as a tool for comparative biology , 1999, Nature.

[40]  R. Mache,et al.  Expression of the rpl23, rpl2 and rps19 genes in spinach chloroplasts. , 1988, Nucleic acids research.

[41]  Reinhold G. Herrmann,et al.  Eukaryotism, Towards a New Interpretation , 1997 .

[42]  K. Harada,et al.  A single nuclear transcript encoding mitochondrial RPS14 and SDHB of rice is processed by alternative splicing: common use of the same mitochondrial targeting signal for different proteins. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[43]  W. Martin,et al.  The evolution of the Calvin cycle from prokaryotic to eukaryotic chromosomes: a case study of functional redundancy in ancient pathways through endosymbiosis , 1997, Current Genetics.

[44]  C. Kurland,et al.  Why mitochondrial genes are most often found in nuclei. , 2000, Molecular biology and evolution.

[45]  A. Nilsson,et al.  Photosynthetic control of chloroplast gene expression , 1999, Nature.

[46]  Wen-Hsiung Li,et al.  Rates of nucleotide substitution vary greatly among plant mitochondrial, chloroplast, and nuclear DNAs. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[47]  T. Sasaki,et al.  Transfer of the mitochondrial rps10 gene to the nucleus in rice: acquisition of the 5′ untranslated region followed by gene duplication , 2000, Molecular and General Genetics MGG.

[48]  Jeffrey D. Palmer,et al.  Repeated, recent and diverse transfers of a mitochondrial gene to the nucleus in flowering plants , 2000, Nature.

[49]  J. Allen,et al.  Control of gene expression by redox potential and the requirement for chloroplast and mitochondrial genomes. , 1993, Journal of theoretical biology.

[50]  Yangrae Cho,et al.  The gain of three mitochondrial introns identifies liverworts as the earliest land plants , 1998, Nature.

[51]  P. Figueroa,et al.  Transfer of rps14 from the mitochondrion to the nucleus in maize implied integration within a gene encoding the iron-sulphur subunit of succinate dehydrogenase and expression by alternative splicing. , 1999, The Plant journal : for cell and molecular biology.

[52]  B F Lang,et al.  Mitochondrial genome evolution and the origin of eukaryotes. , 1999, Annual review of genetics.

[53]  M. Claros,et al.  SUBUNIT III OF CYTOCHROME c OXIDASE IS ENCODED IN THE NUCLEUS OF CHLAMYDOMONAD ALGAE , 2000 .

[54]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[55]  J W Hershey,et al.  Translation initiation factor IF1 is essential for cell viability in Escherichia coli , 1994, Journal of bacteriology.

[56]  R. Olmstead,et al.  The phylogeny of the Asteridae sensu lato based on chloroplast ndhF gene sequences. , 2000, Molecular phylogenetics and evolution.

[57]  A. J. Bendich Why do chloroplasts and mitochondria contain so many copies of their genome? , 1987, BioEssays : news and reviews in molecular, cellular and developmental biology.

[58]  Herrmann,et al.  Gene transfer from organelles to the nucleus: how much, what happens, and Why? , 1998, Plant Physiology.

[59]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[60]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[61]  Yangrae Cho,et al.  Dynamic evolution of plant mitochondrial genomes: mobile genes and introns and highly variable mutation rates. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[62]  G. Heijne,et al.  ChloroP, a neural network‐based method for predicting chloroplast transit peptides and their cleavage sites , 1999, Protein science : a publication of the Protein Society.

[63]  A. Subramanian,et al.  The plastid ribosomal proteins. Identification of all the proteins in the 50 S subunit of an organelle ribosome (chloroplast). , 2000, The Journal of biological chemistry.

[64]  A. Tsugita,et al.  [Protein synthesis in mitochondria]. , 1965, Tanpakushitsu kakusan koso. Protein, nucleic acid, enzyme.

[65]  Limitations to in vivo import of hydrophobic proteins into yeast mitochondria. The case of a cytoplasmically synthesized apocytochrome b. , 1995 .

[66]  P. Herendeen,et al.  Phylogenetic pattern, diversity, and diversification of Eudicots , 1999 .

[67]  Jeffrey D. Palmer,et al.  Use of Chloroplast DNA Rearrangements in Reconstructing Plant Phylogeny , 1992 .

[68]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[69]  M. Cotton,et al.  Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana , 1999, Nature.

[70]  R. Herrmann,et al.  Spinach plastid genes coding for initiation factor IF-1, ribosomal protein S11 and RNA polymerase α -subunit , 1986 .

[71]  G. Wagner,et al.  The eIF1A solution structure reveals a large RNA-binding surface important for scanning function. , 2000, Molecular cell.