Coffee and tomato share common gene repertoires as revealed by deep sequencing of seed and cherry transcripts

An EST database has been generated for coffee based on sequences from approximately 47,000 cDNA clones derived from five different stages/tissues, with a special focus on developing seeds. When computationally assembled, these sequences correspond to 13,175 unigenes, which were analyzed with respect to functional annotation, expression profile and evolution. Compared with Arabidopsis, the coffee unigenes encode a higher proportion of proteins related to protein modification/turnover and metabolism—an observation that may explain the high diversity of metabolites found in coffee and related species. Several gene families were found to be either expanded or unique to coffee when compared with Arabidopsis. A high proportion of these families encode proteins assigned to functions related to disease resistance. Such families may have expanded and evolved rapidly under the intense pathogen pressure experienced by a tropical, perennial species like coffee. Finally, the coffee gene repertoire was compared with that of Arabidopsis and Solanaceous species (e.g. tomato). Unlike Arabidopsis, tomato has a nearly perfect gene-for-gene match with coffee. These results are consistent with the facts that coffee and tomato have a similar genome size, chromosome karyotype (tomato, n=12; coffee n=11) and chromosome architecture. Moreover, both belong to the Asterid I clade of dicot plant families. Thus, the biology of coffee (family Rubiacaeae) and tomato (family Solanaceae) may be united into one common network of shared discoveries, resources and information.

[1]  C. A. Pinto-Maglio,et al.  Pachytene Chromosome Morphology in Coffea L. I. Nucleolar Chromosomes , 1987 .

[2]  James F. Smith Phylogenetics of seed plants : An analysis of nucleotide sequences from the plastid gene rbcL , 1993 .

[3]  T. Kutchan,et al.  Alkaloid Biosynthesis[mdash]The Basis for Metabolic Engineering of Medicinal Plants. , 1995, The Plant cell.

[4]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[5]  J. Claverie,et al.  The significance of digital gene expression profiles. , 1997, Genome research.

[6]  R. Michelmore,et al.  The Major Resistance Gene Cluster in Lettuce Is Highly Duplicated and Spans Several Megabases , 1998, Plant Cell.

[7]  M. Gandolfo,et al.  A new fossil flower from the Turonian of New Jersey: Dressiantha bicarpellata gen. et sp. nov. (Capparales). , 1998, American journal of botany.

[8]  S. Moisyadi,et al.  Cloning and characterization of a cDNA encoding xanthosine-n7-methyltransferase from coffee (Coffea arabica) , 1998 .

[9]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[10]  B C Meyers,et al.  Clusters of resistance genes in plants evolve by divergent selection and a birth-and-death process. , 1998, Genome research.

[11]  C. A. Pinto-Maglio,et al.  Pachytene chromosome morphology in Coffea L. II. C. arabica L. complement , 1998 .

[12]  V. Pétiard,et al.  Biochemical and molecular characterization and expression of the 11S-type storage protein from Coffea arabica endosperm , 1999 .

[13]  V. Pétiard,et al.  Molecular cloning of the complete 11S seed storage protein gene of Coffea arabica and promoter analysis in transgenic tobacco plants , 1999 .

[14]  C. V. Jongeneel,et al.  ESTScan: A Program for Detecting, Evaluating, and Reconstructing Potential Coding Regions in EST Sequences , 1999, ISMB.

[15]  X. Huang,et al.  CAP3: A DNA sequence assembly program. , 1999, Genome research.

[16]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[17]  J. Ohlrogge,et al.  A new set of Arabidopsis expressed sequence tags from developing seeds. The metabolic pathway from carbohydrates to seed oil. , 2000, Plant physiology.

[18]  P. Facchini ALKALOID BIOSYNTHESIS IN PLANTS: Biochemistry, Cell Biology, Molecular Regulation, and Metabolic Engineering Applications. , 2001, Annual review of plant physiology and plant molecular biology.

[19]  N. Koizumi,et al.  7-Methylxanthine Methyltransferase of Coffee Plants , 2001, The Journal of Biological Chemistry.

[20]  Alex Bateman,et al.  The InterPro database, an integrated documentation resource for protein families, domains and functional sites , 2001, Nucleic Acids Res..

[21]  Rolf Apweiler,et al.  InterProScan - an integration platform for the signature-recognition methods in InterPro , 2001, Bioinform..

[22]  G. Martin,et al.  Deductions about the Number, Organization, and Evolution of Genes in the Tomato Genome Based on Analysis of a Large Expressed Sequence Tag Collection and Selective Genomic Sequencing Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.010478. , 2002, The Plant Cell Online.

[23]  D. Smyth,et al.  TRANSPARENT TESTA GLABRA2, a Trichome and Seed Coat Development Gene of Arabidopsis, Encodes a WRKY Transcription Factor Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.001404. , 2002, The Plant Cell Online.

[24]  T. Kutchan Alkaloid Biosynthesis -The Basis for Metabolic Engineering of Medicinal Plants , 2002 .

[25]  M. Combes,et al.  Introgression into the allotetraploid coffee (Coffea arabica L.): segregation and recombination of the C. canephora genome in the tetraploid interspecific hybrid (C. arabica×C. canephora) , 2002, Theoretical and Applied Genetics.

[26]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[27]  V. Loyola-Vargas,et al.  Induction of a class III acidic chitinase in foliar explants of Coffea arabica L. during somatic embryogenesis and wounding , 2002 .

[28]  C. Jansson,et al.  A Novel WRKY Transcription Factor, SUSIBA2, Participates in Sugar Signaling in Barley by Binding to the Sugar-Responsive Elements of the iso1 Promoter Online version contains Web-only data. Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.0145 , 2003, The Plant Cell Online.

[29]  G. Pertea,et al.  Comparative Analyses of Potato Expressed Sequence Tag Libraries1 , 2003, Plant Physiology.

[30]  M. Kinnaird,et al.  Caffeine and Conservation , 2003, Science.

[31]  C. J. Rodrigues,et al.  Heat shock-induced susceptibility of green coffee leaves and berries to Colletotrichum gloeosporioides and its association to PR and hsp70 gene expression , 2003 .

[32]  D. Choi,et al.  EST and microarray analyses of pathogen-responsive genes in hot pepper (Capsicum annuum L.) non-host resistance against soybean pustule pathogen (Xanthomonas axonopodis pv. glycines) , 2004, Functional & Integrative Genomics.

[33]  T. Fujimura,et al.  Isolation of a new dual‐functional caffeine synthase gene encoding an enzyme for the conversion of 7‐methylxanthine to caffeine from coffee (Coffea arabica L.)1 , 2003, FEBS letters.

[34]  E. Meyerowitz,et al.  Molecular cloning, genomic organization, expression and evolution of 12S seed storage protein genes of Arabidopsis thaliana , 1988, Plant Molecular Biology.

[35]  J. Ohlrogge,et al.  Comparative analysis of expressed sequence tags from Sesamum indicum and Arabidopsis thaliana developing seeds , 2003, Plant Molecular Biology.

[36]  Zhixiang Chen,et al.  Expression profiles of the Arabidopsis WRKY gene superfamily during plant defense response , 2004, Plant Molecular Biology.

[37]  M. Gandolfo,et al.  Fossil evidence and phylogeny: the age of major angiosperm clades based on mesofossil and macrofossil evidence from Cretaceous deposits. , 2004, American journal of botany.

[38]  Andrey V Kajava,et al.  NMR solution structure of Mob1, a mitotic exit network protein and its interaction with an NDR kinase peptide. , 2004, Journal of molecular biology.

[39]  U. Sonnewald,et al.  A simplified procedure for the subtractive cDNA cloning of photoassimilate-responding genes: isolation of cDNAs encoding a new class of pathogenesis-related proteins , 1995, Plant Molecular Biology.

[40]  G. Schröder,et al.  Three differentially expressed S-adenosylmethionine synthetases from Catharanthus roseus: molecular and functional characterization , 2004, Plant Molecular Biology.