DNA sequence and structure properties analysis reveals similarities and differences to promoters of stress responsive genes in Arabidopsis thaliana

Understanding regulatory mechanisms of stress response in plants has important biological and agricultural significances. In this study, we firstly compiled a set of genes responsive to different stresses in Arabidopsis thaliana and then comparatively analysed their promoters at both the DNA sequence and three-dimensional structure levels. Amazingly, the comparison revealed that the profiles of several sequence and structure properties vary distinctly in different regions of promoters. Moreover, the content of nucleotide T and the profile of B-DNA twist are distinct in promoters from different stress groups, suggesting Arabidopsis genes might exploit different regulatory mechanisms in response to various stresses. Finally, we evaluated the performance of two representative promoter predictors including EP3 and PromPred. The evaluation results revealed their strengths and weakness for identifying stress-related promoters, providing valuable guidelines to accelerate the discovery of novel stress-related promoters and genes in plants.

[1]  Shanshan Zheng,et al.  Feature selection for genomic data sets through feature clustering , 2010, Int. J. Data Min. Bioinform..

[2]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[3]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[4]  Manju Bansal,et al.  DNA Free Energy-Based Promoter Prediction and Comparative Analysis of Arabidopsis and Rice Genomes1[C][W][OA] , 2011, Plant Physiology.

[5]  Hao Chen,et al.  KGBassembler: a karyotype-based genome assembler for Brassicaceae species , 2012, Bioinform..

[6]  Aleksandar Milosavljevic,et al.  Abundance and length of simple repeats in vertebrate genomes are determined by their structural properties. , 2008, Genome research.

[7]  Eytan Domany,et al.  Positional distribution of human transcription factor binding sites , 2008, Nucleic acids research.

[8]  Thomas Werner,et al.  The State of the Art of Mammalian Promoter Recognition , 2003, Briefings Bioinform..

[9]  Wilfried Haerty,et al.  Genome-wide evidence for selection acting on single amino acid repeats. , 2010, Genome research.

[10]  A. Krishnamachari,et al.  Computational analysis of plant RNA Pol-II promoters. , 2006, Bio Systems.

[11]  Yvan Saeys,et al.  Large-scale structural analysis of the core promoter in mammalian and plant genomes , 2005, Nucleic acids research.

[12]  Manju Bansal,et al.  Structural properties of promoters: similarities and differences between prokaryotes and eukaryotes , 2005, Nucleic acids research.

[13]  V. Zhurkin,et al.  B-DNA twisting correlates with base-pair morphology. , 1995, Journal of molecular biology.

[14]  Shuigeng Zhou,et al.  A comparison study on feature selection of DNA structural properties for promoter prediction , 2012, BMC Bioinformatics.

[15]  Tomaso Poggio,et al.  Identification and analysis of alternative splicing events conserved in human and mouse. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Kazuo Shinozaki,et al.  Research on plant abiotic stress responses in the post-genome era: past, present and future. , 2010, The Plant journal : for cell and molecular biology.

[17]  H. Bohnert,et al.  Integration of Arabidopsis thaliana stress-related transcript profiles, promoter structures, and cell-specific expression , 2007, Genome Biology.

[18]  Michael P. Snyder,et al.  Discovery of Stress Responsive DNA Regulatory Motifs in Arabidopsis , 2012, PloS one.

[19]  G M Rubin,et al.  Insertion site preferences of the P transposable element in Drosophila melanogaster. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Zheng Kou,et al.  Using amino acid factor scores to predict avian-to-human transmission of avian influenza viruses: a machine learning study. , 2013, Protein and peptide letters.

[21]  E. Bornberg-Bauer,et al.  The AtGenExpress global stress expression data set: protocols, evaluation and model data analysis of UV-B light, drought and cold stress responses. , 2007, The Plant journal : for cell and molecular biology.

[22]  Shuigeng Zhou,et al.  A pattern-based nearest neighbor search approach for promoter prediction using DNA structural profiles , 2009, Bioinform..

[23]  Jia Wang,et al.  Predicting transmission of avian influenza A viruses from avian to human by using informative physicochemical properties , 2013, Int. J. Data Min. Bioinform..

[24]  Martin C. Frith,et al.  Discovering Sequence Motifs with Arbitrary Insertions and Deletions , 2008, PLoS Comput. Biol..

[25]  M. Thomashow,et al.  Cis-regulatory code of stress-responsive transcription in Arabidopsis thaliana , 2011, Proceedings of the National Academy of Sciences.

[26]  Ivanov Vi,et al.  [The A-form of DNA: in search of the biological role]. , 1994 .

[27]  Yvan Saeys,et al.  Toward a gold standard for promoter prediction evaluation , 2009, Bioinform..

[28]  Desmond G. Higgins,et al.  High DNA melting temperature predicts transcription start site location in human and mouse , 2009, Nucleic acids research.

[29]  Weixiong Zhang,et al.  Structural features based genome-wide characterization and prediction of nucleosome organization , 2012, BMC Bioinformatics.

[30]  Manju Bansal,et al.  A novel method for prokaryotic promoter prediction based on DNA stability , 2005, BMC Bioinformatics.

[31]  Mikael Bodén,et al.  MEME Suite: tools for motif discovery and searching , 2009, Nucleic Acids Res..

[32]  Yvan Saeys,et al.  Generic eukaryotic core promoter prediction using structural features of DNA. , 2008, Genome research.

[33]  M. Eisen,et al.  Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering , 2002, Genome Biology.

[34]  Rakesh Tuli,et al.  The TATA-Box Sequence in the Basal Promoter Contributes to Determining Light-Dependent Gene Expression in Plants1[W] , 2006, Plant Physiology.

[35]  Uwe Ohler,et al.  Performance assessment of promoter predictions on ENCODE regions in the EGASP experiment , 2006, Genome Biology.