Testing for treatment effects on gene ontology

In studies that use DNA arrays to assess changes in gene expression, it is preferable to measure the significance of treatment effects on a group of genes from a pathway or functional category such as gene ontology terms (GO terms, http://www.geneontology.org) because this facilitates the interpretation of effects and may markedly increase significance. A modified meta-analysis method to combine p-values was developed to measure the significance of an overall treatment effect on such functionally-defined groups of genes, taking into account the correlation structure among genes. For hypothesis testing that allows gene expression to change in both directions, p-values are calculated under the null distribution generated by a Monte Carlo method.As a test of this procedure, we attempted to distinguish altered pathways in microarray studies performed with Mitochips, oligonucleotide microarrays specific to mitochondrial DNA-encoded transcripts. We found that our analytic method improves the specificity of selection for altered pathways, due to incorporation of the inter-gene correlation structure in each pathway. It is thus a practical method to measure treatment effects on GO groups. In many actual applications, microarray experiments measure treatment effects under complicated design structures and with small sample sizes. For such applications to real data of limited statistical power, and also in computer simulations, we demonstrate that our method gives reasonable test results.

[1]  J. Fuscoe,et al.  Development of mitochondria-specific mouse oligonucleotide microarray and validation of data by real-time PCR. , 2007, Mitochondrion.

[2]  John D. Storey,et al.  Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach , 2004 .

[3]  Taewon Lee,et al.  A method for computing the overall statistical significance of a treatment effect among a group of genes , 2006, BMC Bioinformatics.

[4]  S. Wesselingh,et al.  Nucleoside analogues and HIV: the combined cost to mitochondria. , 2003, The Journal of antimicrobial chemotherapy.

[5]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[6]  J. Fuscoe,et al.  Nucleoside reverse transcriptase inhibitors (NRTIs)-induced expression profile of mitochondria-related genes in the mouse liver. , 2008, Mitochondrion.

[7]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[8]  David B. Allison,et al.  A mixture model approach for the analysis of microarray gene expression data , 2002 .

[9]  P. Park,et al.  Discovering statistically significant pathways in expression profiling studies. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Purvesh Khatri,et al.  Ontological analysis of gene expression data: current tools, limitations, and open problems , 2005, Bioinform..

[11]  V. Arango,et al.  Using the Gene Ontology for Microarray Data Mining: A Comparison of Methods and Application to Age Effects in Human Prefrontal Cortex , 2004, Neurochemical Research.

[12]  P. Khatri,et al.  Global functional profiling of gene expression ? ? This work was funded in part by a Sun Microsystem , 2003 .

[13]  J. Fuscoe,et al.  Effect of (+)-usnic acid on mitochondrial functions as measured by mitochondria-specific oligonucleotide microarray in liver of B6C3F1 mice. , 2009, Mitochondrion.

[14]  P. Rustin,et al.  Persistent mitochondrial dysfunction in HIV-1-exposed but uninfected infants: clinical screening in a large prospective cohort , 2003, AIDS.

[15]  R. Tibshirani,et al.  On testing the significance of sets of genes , 2006, math/0610667.

[16]  P. Khatri,et al.  Global functional profiling of gene expression. , 2003, Genomics.

[17]  James C. Fuscoe,et al.  Designing Toxicogenomics Studies that use DNA Array Technology , 2008, Bioinformatics and biology insights.

[18]  Peter J. Park,et al.  A multivariate approach for integrating genome-wide expression data and biological knowledge , 2006, Bioinform..