The role of insertions/deletions in the evolution of the intergenic region betweenpsbA andtrnH in the chloroplast genome

SummaryTrnH and the intergenic region betweentrnH andpsbA of the chloroplast genomes of alfalfa (Medicago sativa), Fabaceae, and petunia (Petunia hybrida), Solanaceae, were sequenced and compared to published sequences of that region from other members of those families. A striking feature of these comparisons is the occurrence of insertions/deletions between short, nearly perfect AT-rich direct repeats. The directionality of these mutations in the petunia, tobacco andNicotiana debneyi lineages within the Solanaceae cannot be discerned. However, we present several alternative hypotheses that are consistent with Goodspeed's 1954 evolutionary treatment of the genusNicotiana and family Solanaceae. Within the Fabaceae, the major size differences in the intergenic region between alfalfa, pea and soybean are due to insertions/ deletions between direct repeats. The alfalfa intergenic region has an inverted repeat stem-loop structure of 210 bases directly 5′ totrnH. This structure is an insert relative to the liverwort.Marchantia polymorpha. Portions of the insert are found also in pea and soybean as well as in published sequences from other dicots representing diverse orders: petunia, tobacco,N. debneyi (Scrophulariales), spinach (Caryophyllales), and Brassica napus (Capparales). Some of the regions of the insert that are missing in these plants appear to have resulted from deletions of sequences between different imperfect direct repeats within, or 5′ to and within the insert. Other deletions are not flanked by repeated sequences. A shrot insert flanked by imperfect direct repeats inB. napus occurs just witin the longer alfalfa insert suggesting that both alfalfa andB. napus have remnants of an even longer insert relative toM. polymorpha. From these analyses we hypothesize the insertion of a stem-loop structure into anM. polymorpha-like ancestral land plant, followd by deletions of sequences, often between different imperfect direct repeats within and upstream of the insert, leading to thepsbA-trnH intergenic sequences represented by the present-day plants examined.

[1]  C. Howe The endpoints of an inversion in wheat chloroplast DNA are associated with short repeated sequences containing homology toatt-lambda , 2004, Current Genetics.

[2]  Minoru I. Kanehisa,et al.  Pattern recognition in nucleic acid sequences. II. An efficient method for finding locally stable secondary structures , 1982, Nucleic Acids Res..

[3]  J. Vieira,et al.  A new pair of M13 vectors for selecting either DNA strand of double-digest restriction fragments. , 1982, Gene.

[4]  J. Palmer,et al.  Comparative organization of chloroplast genomes. , 1985, Annual review of genetics.

[5]  P. Maliga,et al.  Interspecific chloroplast recombination in a Nicotiana somatic hybrid. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[6]  R. Dale,et al.  A rapid single-stranded cloning strategy for producing a sequential series of overlapping clones for use in DNA sequencing: application to sequencing the corn mitochondrial 18 S rDNA. , 1985, Plasmid.

[7]  G. Zurawski,et al.  Junctions of the large single copy region and the inverted repeats in Spinacia oleracea and Nicotiana debneyi chloroplast DNA: sequence of the genes for tRNAHis and the ribosomal proteins S19 and L2. , 1984, Nucleic acids research.

[8]  N. Straus,et al.  Nucleotide sequence of the chloroplast gene responsible for triazine resistance in canola , 2004, Theoretical and Applied Genetics.

[9]  W. Rottmann,et al.  A mitochondrial gene is lost via homologous recombination during reversion of CMS T maize to fertility , 1987, The EMBO journal.

[10]  E. Stutz,et al.  Nucleotide sequence of soybean chloroplast DNA regions which contain the psb A and trn H genes and cover the ends of the large single copy region and one end of the inverted repeats. , 1983, Nucleic acids research.

[11]  K. Tewari,et al.  Nucleotide sequences of transfer RNA genes in the Pisum sativum chloroplast DNA , 2004, Plant Molecular Biology.

[12]  M. Sugiura,et al.  Nucleotide sequence and transcription of the gene for the 32,000 dalton thylakoid membrane protein from Nicotiana tabacum , 1984, Molecular and General Genetics MGG.

[13]  J. Palmer,et al.  Chloroplast DNA evolution among legumes: Loss of a large inverted repeat occurred prior to other sequence rearrangements , 2004, Current Genetics.

[14]  J. Palmer,et al.  Evolution of Chloroplast and Mitochondrial DNA in Plants and Algae , 1985 .

[15]  G. Link,et al.  Structure of the chloroplast gene for the precursor of the Mr 32,000 photosystem II protein from mustard (Sinapis alba L.). , 1984, Nucleic acids research.

[16]  F. Sanger,et al.  DNA sequencing with chain-terminating inhibitors. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[17]  W. Gruissem,et al.  Biosynthesis of chloroplast transfer RNA in a spinach chloroplast transcription system , 1983, Cell.

[18]  W. B. Turrill,et al.  The Genus Nicotiana , 1955 .

[19]  M. Clegg,et al.  The Nature of Nucleotide Sequence Divergence between Barley and Maize Chloroplast DNA. , 1984, Genetics.

[20]  Sequence of the chloroplast-encoded psbA gene for the QB polypeptide of alfalfa. , 1986, Nucleic acids research.

[21]  H. Bohnert,et al.  Points of rearrangements between plastid chromosomes: location of protein coding regions on broad bean chloroplast DNA , 2004, Current Genetics.

[22]  T. Ellis,et al.  Chloroplast DNA deletions associated with wheat plants regenerated from pollen: possible basis for maternal inheritance of chloroplasts , 1984, Cell.

[23]  M. Sugiura,et al.  Nucleotide sequence of the 16S - 23S spacer region in an rRNA gene cluster from tobacco chloroplast DNA. , 1982, Nucleic acids research.

[24]  A. Steinmetz,et al.  Overlapping divergent genes in the maize chloroplast chromosome and in vitro transcription of the gene for tRNA. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[25]  T. Dyer,et al.  The location and possible evolutionary significance of small dispersed repeats in wheat ctDNA , 1986, Current Genetics.

[26]  M. I. Kanehisa,et al.  Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries , 1982, Nucleic Acids Res..

[27]  J. Palmer,et al.  Physical and gene mapping of chloroplast DNA from Atriplex triangularis and Cucumis sativa. , 1982, Nucleic acids research.

[28]  J. Palmer,et al.  Tripartite structure of the Brassica campestris mitochondrial genome , 1984, Nature.

[29]  Insertion/deletion mutations in the Zea chloroplast genome , 2004, Current Genetics.

[30]  T. Kohchi,et al.  Chloroplast gene organization deduced from complete sequence of liverwort Marchantia polymorpha chloroplast DNA , 1986, Nature.

[31]  M. I. Kanehisa,et al.  Los Alamos sequence analysis package for nucleic acids and proteins , 1982, Nucleic Acids Res..