Multiplicity adjustment for intersection-union test: detecting overlapping genes from multiple microarray gene lists

Detecting overlapping genes from multiple gene lists is a frequently used procedure in microarray- based studies. However in previous studies, the intersection operations of gene lists, formally known as the Intersection-Union Tests (IUTs), were performed without being aware of the incurred changes in Type 1 error rate. We presented an IUT adjustment procedure, called Relaxed IUT (RIUT), which is proved to be less conservative and more powerful for combining independent tests. We demonstrated the superiority of RIUT for detecting overlapping genes by simulation and two real toxicogenomic gene expression data sets. Notably, the better power of RIUT enables it to identify overlapping gene sets leading to known related pathways where traditional TUT fails to find.

[1]  S. S. Young,et al.  Resampling-Based Multiple Testing: Examples and Methods for p-Value Adjustment , 1993 .

[2]  Runtong Zhang,et al.  Fuzzy control of queueing systems with heterogeneous servers , 1999, IEEE Trans. Fuzzy Syst..

[3]  R. Tibshirani,et al.  An introduction to the bootstrap , 1993 .

[4]  Steven H. Low,et al.  Optimization flow control—I: basic algorithm and convergence , 1999, TNET.

[5]  R. Srikant,et al.  Stable, scalable, fair congestion control and AQM schemes that achieve high utilization in the Internet , 2003, IEEE Trans. Autom. Control..

[6]  Hanlee P. Ji,et al.  The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements. , 2006, Nature biotechnology.

[7]  Weida Tong,et al.  Multi-class cancer classification by total principal component regression (TPCR) using microarray gene expression data , 2005, Nucleic acids research.

[8]  Xue Liu,et al.  A feedback control scheme for resource allocation in wireless multi-hop ad hoc networks , 2005, The Second Annual International Conference on Mobile and Ubiquitous Systems: Networking and Services.

[9]  Roger L. Berger,et al.  Multiparameter Hypothesis Testing and Acceptance Sampling , 1982 .

[10]  Azra Bozcaarmutlu,et al.  Effect of mercury, cadmium, nickel, chromium and zinc on kinetic properties of NADPH-cytochrome P450 reductase purified from leaping mullet (Liza saliens). , 2007, Toxicology in vitro : an international journal published in association with BIBRA.

[11]  José Ferreira de Rezende,et al.  QoS provisioning across a DiffServ domain using policy-based management , 2001, GLOBECOM'01. IEEE Global Telecommunications Conference (Cat. No.01CH37270).

[12]  John D. Storey A direct approach to false discovery rates , 2002 .

[13]  Masaharu Komatsu,et al.  Involvement of mitogen-activated protein kinase signaling pathways in microcystin-LR-induced apoptosis after its selective uptake mediated by OATP1B1 and OATP1B3. , 2007, Toxicological sciences : an official journal of the Society of Toxicology.

[14]  Brad T. Sherman,et al.  DAVID: Database for Annotation, Visualization, and Integrated Discovery , 2003, Genome Biology.

[15]  Stephen C. Harris,et al.  Rat toxicogenomic study reveals analytical consistency across microarray platforms , 2006, Nature Biotechnology.

[16]  Antai Wang,et al.  Gene selection for microarray data analysis using principal component analysis , 2005, Statistics in medicine.

[17]  Daniel L Farkas,et al.  Transcriptomic fingerprinting of bone marrow-derived hepatic beta2m-/Thy-1+ stem cells. , 2005, Biochemical and biophysical research communications.

[18]  Janina Moniuszko-Jakoniuk,et al.  Effects of chronic exposure to cadmium on renal cytochrome P450-dependent monooxygenase system in rats , 2004, Archives of Toxicology.

[19]  Klara Nahrstedt,et al.  Distributed quality-of-service routing in ad hoc networks , 1999, IEEE J. Sel. Areas Commun..

[20]  S. Dudoit,et al.  STATISTICAL METHODS FOR IDENTIFYING DIFFERENTIALLY EXPRESSED GENES IN REPLICATED cDNA MICROARRAY EXPERIMENTS , 2002 .

[21]  Pierre Baldi,et al.  A Bayesian framework for the analysis of microarray expression data: regularized t -test and statistical inferences of gene changes , 2001, Bioinform..

[22]  L. Hedges,et al.  Statistical Methods for Meta-Analysis , 1987 .

[23]  Charles Wang,et al.  Integrating time-course microarray gene expression profiles with cytotoxicity for identification of biomarkers in primary rat hepatocytes exposed to cadmium , 2006, Bioinform..

[24]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[25]  R. Berger,et al.  Bioequivalence trials, intersection-union tests and equivalence confidence sets , 1996 .

[26]  M. Kanehisa A database for post-genome analysis. , 1997, Trends in genetics : TIG.

[27]  D. Allison,et al.  Microarray data analysis: from disarray to consolidation and consensus , 2006, Nature Reviews Genetics.

[28]  Steven H. Low,et al.  Understanding TCP Vegas: a duality model , 2002 .

[29]  A M Stella,et al.  Lead toxicity in cyanobacterial porphyrin metabolism , 2001, Environmental toxicology.

[30]  Rupert G. Miller Simultaneous Statistical Inference , 1966 .

[31]  Felix Naef,et al.  In vivo transcriptional profile analysis reveals RNA splicing and chromatin remodeling as prominent processes for adult neurogenesis , 2006, Molecular and Cellular Neuroscience.

[32]  Aaron Aragaki,et al.  Significance testing for small microarray experiments , 2005, Statistics in medicine.

[33]  Klara Nahrstedt,et al.  iPass: an incentive compatible auction scheme to enable packet forwarding service in MANET , 2004, 24th International Conference on Distributed Computing Systems, 2004. Proceedings..

[34]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[35]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.