Establishing glucose- and ABA-regulated transcription networks in Arabidopsis by microarray analysis and promoter classification using a Relevance Vector Machine.

Establishing transcriptional regulatory networks by analysis of gene expression data and promoter sequences shows great promise. We developed a novel promoter classification method using a Relevance Vector Machine (RVM) and Bayesian statistical principles to identify discriminatory features in the promoter sequences of genes that can correctly classify transcriptional responses. The method was applied to microarray data obtained from Arabidopsis seedlings treated with glucose or abscisic acid (ABA). Of those genes showing >2.5-fold changes in expression level, approximately 70% were correctly predicted as being up- or down-regulated (under 10-fold cross-validation), based on the presence or absence of a small set of discriminative promoter motifs. Many of these motifs have known regulatory functions in sugar- and ABA-mediated gene expression. One promoter motif that was not known to be involved in glucose-responsive gene expression was identified as the strongest classifier of glucose-up-regulated gene expression. We show it confers glucose-responsive gene expression in conjunction with another promoter motif, thus validating the classification method. We were able to establish a detailed model of glucose and ABA transcriptional regulatory networks and their interactions, which will help us to understand the mechanisms linking metabolism with growth in Arabidopsis. This study shows that machine learning strategies coupled to Bayesian statistical methods hold significant promise for identifying functionally significant promoter sequences.

[1]  Sean R Eddy,et al.  What is Bayesian statistics? , 2004, Nature Biotechnology.

[2]  S. Yanagisawa,et al.  Differential regulation of EIN3 stability by glucose and ethylene signalling in plants , 2003, Nature.

[3]  Roland Eils,et al.  Applying Support Vector Machines for Gene ontology based gene function prediction , 2004, BMC Bioinformatics.

[4]  Q. Shen,et al.  Modular nature of abscisic acid (ABA) response complexes: composite promoter units that are necessary and sufficient for ABA induction of gene expression in barley. , 1996, The Plant cell.

[5]  P. Weisbeek,et al.  The Arabidopsis SUCROSE UNCOUPLED-6 gene is identical to ABSCISIC ACID INSENSITIVE-4: involvement of abscisic acid in sugar responses. , 2000, The Plant journal : for cell and molecular biology.

[6]  D. Mitchell,et al.  Photorepair mutants of Arabidopsis. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[7]  S. Rhee,et al.  AraCyc: A Biochemical Pathway Database for Arabidopsis1 , 2003, Plant Physiology.

[8]  Y. Kamiya,et al.  Identification of cis-Elements That Regulate Gene Expression during Initiation of Axillary Bud Outgrowth in Arabidopsis[w] , 2005, Plant Physiology.

[9]  Filip Rolland,et al.  Role of the Arabidopsis Glucose Sensor HXK1 in Nutrient, Light, and Hormonal Signaling , 2003, Science.

[10]  Ashverya Laxmi,et al.  Global Transcription Profiling Reveals Multiple Sugar Signal Transduction Mechanisms in Arabidopsis , 2004, The Plant Cell Online.

[11]  Gloria M Coruzzi,et al.  Genome-wide investigation of light and carbon signaling interactions in Arabidopsis , 2004, Genome Biology.

[12]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[13]  H. Bussemaker,et al.  Regulatory element detection using correlation with expression , 2001, Nature Genetics.

[14]  C. Büche,et al.  eid1: A New Arabidopsis Mutant Hypersensitive in Phytochrome A–Dependent High-Irradiance Responses , 2000, Plant Cell.

[15]  Yi Li,et al.  Bayesian automatic relevance determination algorithms for classifying gene expression data. , 2002, Bioinformatics.

[16]  S. Rhee,et al.  MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. , 2004, The Plant journal : for cell and molecular biology.

[17]  K. Shinozaki,et al.  OsDREB genes in rice, Oryza sativa L., encode transcription activators that function in drought-, high-salt- and cold-responsive gene expression. , 2003, The Plant journal : for cell and molecular biology.

[18]  M. Pagés,et al.  Maize DRE-binding proteins DBF1 and DBF2 are involved in rab17 regulation through the drought-responsive element in an ABA-dependent pathway. , 2002, The Plant journal : for cell and molecular biology.

[19]  Michael E. Tipping Sparse Bayesian Learning and the Relevance Vector Machine , 2001, J. Mach. Learn. Res..

[20]  Yoshihiro Ugawa,et al.  Plant cis-acting regulatory DNA elements (PLACE) database: 1999 , 1999, Nucleic Acids Res..

[21]  A. Cashmore,et al.  Chimeric Proteins between cry1 and cry2 Arabidopsis Blue Light Photoreceptors Indicate Overlapping Functions and Varying Protein Stability , 1998, Plant Cell.

[22]  J. Mol,et al.  Molecular Analysis of the anthocyanin2 Gene of Petunia and Its Role in the Evolution of Flower Color , 1999, Plant Cell.

[23]  A. J. Gammerman,et al.  Plant promoter prediction with confidence estimation , 2005, Nucleic acids research.

[24]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[25]  J. Görlach,et al.  Growth Stage–Based Phenotypic Analysis of Arabidopsis , 2001, The Plant Cell Online.

[26]  G. Fink,et al.  A Myb homologue, ATR1, activates tryptophan gene expression in Arabidopsis. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Kazuo Shinozaki,et al.  Arabidopsis AtMYC2 (bHLH) and AtMYB2 (MYB) Function as Transcriptional Activators in Abscisic Acid Signaling Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.006130. , 2003, The Plant Cell Online.

[28]  B. Haas,et al.  Complete reannotation of the Arabidopsis genome: methods, tools, protocols and the final release , 2005, BMC Biology.

[29]  R. Stracke,et al.  The R2R3-MYB gene family in Arabidopsis thaliana. , 2001, Current opinion in plant biology.

[30]  P. León,et al.  Analysis of Arabidopsis glucose insensitive mutants, gin5 and gin6, reveals a central role of the plant hormone ABA in the regulation of plant vegetative development by sugar. , 2000, Genes & development.

[31]  A. V. Van Dijken,et al.  Trehalose 6-phosphate is indispensable for carbohydrate utilization and growth in Arabidopsis thaliana , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[32]  K. Shinozaki,et al.  Identification of a cis-regulatory region of a gene in Arabidopsis thaliana whose induction by dehydration is mediated by abscisic acid and requires protein synthesis , 1995, Molecular and General Genetics MGG.

[33]  K. Harter,et al.  Nuclear localization of the Arabidopsis blue light receptor cryptochrome 2. , 1999, The Plant journal : for cell and molecular biology.

[34]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[35]  B. Frey,et al.  The functional landscape of mouse gene expression , 2004, Journal of biology.

[36]  P. Busk,et al.  Regulatory elements in vivo in the promoter of the abscisic acid responsive gene rab17 from maize. , 1997, The Plant journal : for cell and molecular biology.

[37]  M. Ahmad,et al.  Association of flavin adenine dinucleotide with the Arabidopsis blue light receptor CRY1 , 1995, Science.

[38]  R. Rodriguez,et al.  Three cis-elements required for rice α-amylase Amy3D expression during sugar starvation , 1998, Plant Molecular Biology.

[39]  X. Deng,et al.  Arabidopsis bZIP Protein HY5 Directly Interacts with Light-Responsive Promoters in Mediating Light Control of Gene Expression , 1998, Plant Cell.

[40]  Julian I Schroeder,et al.  Microarray Expression Analyses of Arabidopsis Guard Cells and Isolation of a Recessive Abscisic Acid Hypersensitive Protein Phosphatase 2C Mutant Online version contains Web-only data. , 2004, The Plant Cell Online.

[41]  L. Garnier,et al.  Internal telomeric repeats and 'TCP domain' protein-binding sites co-operate to regulate gene expression in Arabidopsis thaliana cycling cells. , 2003, The Plant journal : for cell and molecular biology.

[42]  D. Tremousaygue,et al.  In synergy with various cis‐acting elements, plant insterstitial telomere motifs regulate gene expression in Arabidopsis root meristems , 2000, FEBS letters.

[43]  N. Wei,et al.  Combinatorial interplay of promoter elements constitutes the minimal determinants for light and developmental control of gene expression in Arabidopsis. , 1996, The EMBO journal.

[44]  Chung-An Lu,et al.  Sugar Response Sequence in the Promoter of a Rice α-Amylase Gene Serves as a Transcriptional Enhancer* , 1998, The Journal of Biological Chemistry.

[45]  Barry K Lavine,et al.  Machine learning based pattern recognition applied to microarray data. , 2004, Combinatorial chemistry & high throughput screening.

[46]  E. Yubero-Serrano,et al.  Identification of a strawberry gene encoding a non-specific lipid transfer protein that responds to ABA, wounding and cold stress. , 2003, Journal of experimental botany.

[47]  P. Springer,et al.  The essential Mcm7 protein PROLIFERA is localized to the nucleus of dividing cells during the G(1) phase and is required maternally for early Arabidopsis development. , 2000, Development.

[48]  Michael Q. Zhang,et al.  Identifying tissue-selective transcription factor binding sites in vertebrate promoters. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[49]  K. Shinozaki,et al.  Role of arabidopsis MYC and MYB homologs in drought- and abscisic acid-regulated gene expression. , 1997, The Plant cell.

[50]  P. Springer,et al.  The Arabidopsis gene PROLIFERA is required for proper cytokinesis during seed development , 2001, Planta.

[51]  M. Gerstein,et al.  Genomic analysis of gene expression relationships in transcriptional regulatory networks. , 2003, Trends in genetics : TIG.

[52]  Richard A. Dixon,et al.  Activation Tagging Identifies a Conserved MYB Regulator of Phenylpropanoid Biosynthesis , 2000, Plant Cell.

[53]  Wei-Min Liu,et al.  Robust estimators for expression analysis , 2002, Bioinform..

[54]  S. Tabata,et al.  Comparative genetic studies on the APRR5 and APRR7 genes belonging to the APRR1/TOC1 quintet implicated in circadian rhythm, control of flowering time, and early photomorphogenesis. , 2003, Plant & cell physiology.

[55]  E. Grill,et al.  Homeodomain protein ATHB6 is a target of the protein phosphatase ABI1 and regulates hormone responses in Arabidopsis , 2002, The EMBO journal.

[56]  R. Quatrano,et al.  Abscisic acid-responsive sequences from the em gene of wheat. , 1989, The Plant cell.

[57]  Akira Watanabe,et al.  Identification of a novel gene HYS1/CPR5 that has a repressive role in the induction of leaf senescence and pathogen-defence responses in Arabidopsis thaliana. , 2002, The Plant journal : for cell and molecular biology.

[58]  R. Gangal,et al.  Human pol II promoter prediction: time series descriptors and machine learning , 2005, Nucleic acids research.

[59]  J. Sheen,et al.  Glucose and ethylene signal transduction crosstalk revealed by an Arabidopsis glucose-insensitive mutant. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[60]  J. Grima-Pettenati,et al.  Characterization of cis-elements required for vascular expression of the cinnamoyl CoA reductase gene and for protein-DNA complex formation. , 2000, The Plant journal : for cell and molecular biology.

[61]  Minoru Kanehisa,et al.  The KEGG database. , 2002, Novartis Foundation symposium.

[62]  M. Chabouté,et al.  Cell Cycle Regulation of the Tobacco Ribonucleotide Reductase Small Subunit Gene Is Mediated by E2F-like Elements , 2000, Plant Cell.

[63]  E. Nambara,et al.  A Unique Short-Chain Dehydrogenase/Reductase in Arabidopsis Glucose Signaling and Abscisic Acid Biosynthesis and Functions , 2002, The Plant Cell Online.

[64]  E. Lam Domain analysis of the plant DNA-binding protein GT1a: requirement of four putative alpha-helices for DNA binding and identification of a novel oligomerization region , 1995, Molecular and cellular biology.

[65]  R. Loppes,et al.  Identification of short promoter regions involved in the transcriptional expression of the nitrate reductase gene in Chlamydomonas reinhardtii , 2004, Plant Molecular Biology.

[66]  Michael A. Beer,et al.  Predicting Gene Expression from Sequence , 2004, Cell.

[67]  S. Gibson,et al.  The Arabidopsis sugar-insensitive mutants sis4 and sis5 are defective in abscisic acid synthesis and response. , 2000, The Plant journal : for cell and molecular biology.

[68]  Bernhard Schölkopf,et al.  Kernel Methods in Computational Biology , 2005 .

[69]  Robert Creeley,et al.  St. Martin's , 1971 .

[70]  Wei-Min Liu,et al.  Analysis of high density expression microarrays with signed-rank call algorithms , 2002, Bioinform..

[71]  S. J. Gilmour,et al.  Overexpression of the Arabidopsis CBF3 transcriptional activator mimics multiple biochemical changes associated with cold acclimation. , 2000, Plant physiology.

[72]  S. Ishiguro,et al.  Characterization of a cDNA encoding a novel DNA-binding protein, SPF1, that recognizes SP8 sequences in the 5′ upstream regions of genes coding for sporamin and β-amylase from sweet potato , 1994, Molecular and General Genetics MGG.

[73]  P. León,et al.  Hexokinase as a sugar sensor in higher plants. , 1997, The Plant cell.

[74]  Michael E. Tipping The Relevance Vector Machine , 1999, NIPS.

[75]  J. Yamaguchi,et al.  Promoter elements required for sugar‐repression of the RAmy3D gene for α‐amylase in rice , 1998 .

[76]  T. Ho,et al.  Three novel MYB proteins with one DNA binding repeat mediate sugar and hormone regulation of alpha-amylase gene expression. , 2002, The Plant cell.

[77]  M. Van Montagu,et al.  The trihelix DNA-binding motif in higher plants is not restricted to the transcription factors GT-1 and GT-2. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[78]  J. Chory,et al.  PKS1, a substrate phosphorylated by phytochrome that modulates light signaling in Arabidopsis. , 1999, Science.

[79]  F. Corke,et al.  Impaired sucrose-induction mutants reveal the modulation of sugar-induced starch biosynthetic gene expression by abscisic acid signalling. , 2001, The Plant journal : for cell and molecular biology.

[80]  Geoffrey E. Hinton,et al.  Bayesian Learning for Neural Networks , 1995 .

[81]  J. Strommer,et al.  Myb26: a MYB-like protein of pea flowers with affinity for promoters of phenylpropanoid genes. , 1997, The Plant journal : for cell and molecular biology.

[82]  B Lescure,et al.  Plant interstitial telomere motifs participate in the control of gene expression in root meristems. , 1999, The Plant journal : for cell and molecular biology.

[83]  Y. Zhou,et al.  EID1, an F-box protein involved in phytochrome A-specific light signaling. , 2001, Genes & development.

[84]  P. Busk,et al.  Regulation of abscisic acid-induced transcription , 1998, Plant Molecular Biology.

[85]  D. -. Kim,et al.  A cDNA encoding a putative SPF1-type DNA-binding protein from cucumber. , 1997, Gene.

[86]  J. Mol,et al.  The an11 locus controlling flower pigmentation in petunia encodes a novel WD-repeat protein conserved in yeast, plants, and animals. , 1997, Genes & development.

[87]  Daphne Koller,et al.  Genome-wide discovery of transcriptional modules from DNA sequence and gene expression , 2003, ISMB.