BATS: a Bayesian user-friendly software for Analyzing Time Series microarray experiments

BackgroundGene expression levels in a given cell can be influenced by different factors, namely pharmacological or medical treatments. The response to a given stimulus is usually different for different genes and may depend on time. One of the goals of modern molecular biology is the high-throughput identification of genes associated with a particular treatment or a biological process of interest. From methodological and computational point of view, analyzing high-dimensional time course microarray data requires very specific set of tools which are usually not included in standard software packages. Recently, the authors of this paper developed a fully Bayesian approach which allows one to identify differentially expressed genes in a 'one-sample' time-course microarray experiment, to rank them and to estimate their expression profiles. The method is based on explicit expressions for calculations and, hence, very computationally efficient.ResultsThe software package BATS (Bayesian Analysis of Time Series) presented here implements the methodology described above. It allows an user to automatically identify and rank differentially expressed genes and to estimate their expression profiles when at least 5–6 time points are available. The package has a user-friendly interface. BATS successfully manages various technical difficulties which arise in time-course microarray experiments, such as a small number of observations, non-uniform sampling intervals and replicated or missing data.ConclusionBATS is a free user-friendly software for the analysis of both simulated and real microarray time course experiments. The software, the user manual and a brief illustrative example are freely available online at the BATS website: http://www.na.iac.cnr.it/bats

[1]  Jeffrey T. Leek,et al.  Erratum: EDGE: Extraction and analysis of differential gene expression (Bioinformatics (2006) vol. 22 (4) (507-508)) , 2006 .

[2]  John D. Storey,et al.  Significance analysis of time course microarray experiments. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[3]  F Hong,et al.  Functional hierarchical models for identifying genes with different time-course expression profiles. , 2006, Biometrics.

[4]  Gary A. Churchill,et al.  Analysis of Variance for Gene Expression Microarray Data , 2000, J. Comput. Biol..

[5]  T. Speed,et al.  On the gene ranking of replicated microarray time course data , 2007 .

[6]  S. Dudoit,et al.  Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. , 2002, Nucleic acids research.

[7]  Marianna Pensky,et al.  Statistical Applications in Genetics and Molecular Biology A Bayesian Approach to Estimation and Testing in Time-course Microarray Experiments , 2011 .

[8]  Felix Abramovich,et al.  Bayesian Maximum a posteriori Multiple Testing Procedure , 2006 .

[9]  Claudio Cobelli,et al.  A quantization method based on threshold optimization for microarray short time series , 2005, BMC Bioinformatics.

[10]  Lucia Altucci,et al.  A genomic view of estrogen actions in human breast cancer cells by expression profiling of the hormone-responsive transcriptome. , 2004, Journal of molecular endocrinology.

[11]  Gordon K. Smyth,et al.  limma: Linear Models for Microarray Data , 2005 .

[12]  Jeffrey T. Leek,et al.  Gene expression EDGE : extraction and analysis of differential gene expression , 2006 .

[13]  Ziv Bar-Joseph,et al.  Analyzing time series gene expression data , 2004, Bioinform..

[14]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Geoffrey J. McLachlan,et al.  Analyzing Microarray Gene Expression Data , 2004 .

[16]  Satoru Miyano,et al.  Statistical analysis of a small set of time-ordered gene expression data using linear splines , 2002, Bioinform..

[17]  Ana Conesa,et al.  maSigPro: a Method to Identify Significantly Differential Expression Profiles in Time-Course Microarray Experiments , 2006, Spanish Bioinformatics Conference.

[18]  T. Speed,et al.  A multivariate empirical Bayes statistic for replicated microarray time course data , 2006, math/0702685.

[19]  X. Cui,et al.  Transformations for cDNA Microarray Data , 2003, Statistical applications in genetics and molecular biology.

[20]  Taesung Park,et al.  Statistical tests for identifying differentially expressed genes in time-course microarray experiments , 2003, Bioinform..

[21]  Michal Linial,et al.  Novel Unsupervised Feature Filtering of Biological Data , 2006, ISMB.

[22]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[23]  David L. Donoho,et al.  De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[24]  Claudia Angelini,et al.  Time-course analysis of genome-wide gene expression data from hormone-responsive human breast cancer cells , 2008, BMC Bioinformatics.

[25]  Rafael A. Irizarry,et al.  Bioinformatics and Computational Biology Solutions using R and Bioconductor , 2005 .

[26]  Ernst Wit,et al.  Statistics for Microarrays : Design, Analysis and Inference , 2004 .