Genome-Wide DNA-Binding Specificity of PIL5, a Arabidopsis Basic Helix-Loop-Helix (bHLH) Transcription Factor

PIL5 is a member of the bHLH transcription factor super family and plays crucial roles in phytochrome mediated seed germination process in Arabidopsis. While our previous study shows that PIL5 binds to the G-box (CACGTG) motif with high affinity, other attributes must be involved in determining regulatory specificity. In this study we performed a ChIP-chip assay to obtain genome-wide PIL5 binding sites and investigated other attributes which would affect to PIL5 DNA-binding specificity. Our results confirmed that the existence of G-box in promoter region was the most significant binding attribute. Additionally we also found that other attributes such as neighboring motif composition, distance from transcription start site, nucleosome density, DNA methylation and average gene expression were also contributing to PIL5 DNA-binding specificity. The Random Forest classifier using these attributes can classify PIL5 binding sites from non-binding sites with accuracy of 93.05%.

[1]  I. Henderson,et al.  Gardening the genome: DNA methylation in Arabidopsis thaliana , 2005, Nature Reviews Genetics.

[2]  Ian Witten,et al.  Data Mining , 2000 .

[3]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[4]  Eunkyoo Oh,et al.  PIL5, a Phytochrome-Interacting bHLH Protein, Regulates Gibberellin Responsiveness by Binding Directly to the GAI and RGA Promoters in Arabidopsis Seeds[W] , 2007, The Plant Cell Online.

[5]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[6]  M. Pellegrini,et al.  Genome-wide High-Resolution Mapping and Functional Analysis of DNA Methylation in Arabidopsis , 2006, Cell.

[7]  Hongyu Zhao,et al.  Pathway analysis using random forests classification and regression , 2006, Bioinform..

[8]  Susan Jones,et al.  An overview of the basic helix-loop-helix proteins , 2004, Genome Biology.

[9]  Kevin Struhl,et al.  Intrinsic histone-DNA interactions and low nucleosome density are important for preferential accessibility of promoter regions in yeast. , 2005, Molecular cell.

[10]  David Baltimore,et al.  A new DNA binding and dimerization motif in immunoglobulin enhancer binding, daughterless, MyoD, and myc proteins , 1989, Cell.

[11]  Y. Kamiya,et al.  Light activates the degradation of PIL5 protein to promote seed germination through gibberellin in Arabidopsis. , 2006, The Plant journal : for cell and molecular biology.

[12]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[13]  Mark Bieda,et al.  Unbiased location analysis of E2F1-binding sites suggests a widespread role for E2F1 in the human genome. , 2006, Genome research.

[14]  J. Collado-Vides,et al.  Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies. , 1998, Journal of molecular biology.

[15]  W. Atchley,et al.  A natural classification of the basic helix-loop-helix class of transcription factors. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Ralf Zimmer,et al.  BioWeka - extending the Weka framework for bioinformatics , 2007, Bioinform..

[17]  Daiya Takai,et al.  Comprehensive analysis of CpG islands in human chromosomes 21 and 22 , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Ingo Dreyer,et al.  Genome-wide analysis of ABA-responsive elements ABRE and CE3 reveals divergent patterns in Arabidopsis and rice , 2007, BMC Genomics.

[19]  James T Kadonaga,et al.  Regulation of RNA Polymerase II Transcription by Sequence-Specific DNA Binding Factors , 2004, Cell.

[20]  S. Henikoff,et al.  Genome-wide analysis of Arabidopsis thaliana DNA methylation uncovers an interdependence between methylation and transcription , 2007, Nature Genetics.

[21]  Jun S. Liu,et al.  An algorithm for finding protein–DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments , 2002, Nature Biotechnology.

[22]  Gordon K. Smyth,et al.  limmaGUI: A graphical user interface for linear modeling of microarray data , 2004, Bioinform..

[23]  Wing Hung Wong,et al.  TileMap: create chromosomal map of tiling array hybridizations , 2005, Bioinform..

[24]  J. Bennetzen,et al.  Transposable element contributions to plant gene and genome evolution , 2004, Plant Molecular Biology.

[25]  Doheon Lee,et al.  Genome-Wide Analysis of Genes Targeted by PHYTOCHROME INTERACTING FACTOR 3-LIKE5 during Seed Germination in Arabidopsis[W] , 2009, The Plant Cell Online.

[26]  William R. Atchley,et al.  Positional Dependence, Cliques, and Predictive Motifs in the bHLH Protein Domain , 1999, Journal of Molecular Evolution.

[27]  Matteo Pellegrini,et al.  Whole-Genome Analysis of Histone H3 Lysine 27 Trimethylation in Arabidopsis , 2007, PLoS biology.

[28]  G. Church,et al.  Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation , 1998, Nature Biotechnology.

[29]  M. Vervoort,et al.  The basic helix-loop-helix protein family: comparative genomics and phylogenetic analysis. , 2001, Genome research.

[30]  Q. Shen,et al.  Modular nature of abscisic acid (ABA) response complexes: composite promoter units that are necessary and sufficient for ABA induction of gene expression in barley. , 1996, The Plant cell.

[31]  A. Bird,et al.  Genomic DNA methylation: the mark and its mediators. , 2006, Trends in biochemical sciences.

[32]  Hong Ma,et al.  Genome-Wide Analysis of Basic/Helix-Loop-Helix Transcription Factor Family in Rice and Arabidopsis1[W] , 2006, Plant Physiology.

[33]  A. Wolffe,et al.  How does DNA methylation repress transcription? , 1997, Trends in genetics : TIG.

[34]  E. Huq,et al.  The Arabidopsis Basic/Helix-Loop-Helix Transcription Factor Family Online version contains Web-only data. Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.013839. , 2003, The Plant Cell Online.

[35]  Y. Kyōgoku,et al.  Crystal structure of PHO4 bHLH domain–DNA complex: flanking base recognition , 1997, The EMBO journal.