A multiple testing procedure for multi-dimensional pairwise comparisons with application to gene expression studies

BackgroundOften researchers are interested in comparing multiple experimental groups (e.g. tumor size) with a reference group (e.g. normal tissue) on the basis of thousands of features (e.g. genes) and determine if a differentially expressed feature is up or down regulated in a pairwise comparison. There are two sources of false discoveries, one due to multiple testing involving several pairwise comparisons and the second due to falsely declaring a feature to be up (or down) regulated when it is not (known as directional error). Together, the total error rate is called the mixed directional false discovery rate (mdFDR).ResultsWe develop a general powerful mdFDR controlling testing procedure and illustrate the methodology by analyzing uterine fibroid gene expression data (PLoS ONE 8:63909, 2013). We identify several differentially expressed genes (DEGs) and pathways that are specifically enriched according to the size of a uterine fibroid.ConclusionsThe proposed general procedure strongly controls mdFDR. Several specific methodologies can be derived from this general methodology by using appropriate testing procedures at different steps of the general procedure. Thus we are providing a general framework for making multiple pairwise comparisons. Our analysis of the uterine fibroid growth gene expression data suggests that molecular characteristics of a fibroid changes with size. Our powerful methodology allowed us to draw several interesting conclusions regarding the molecular characteristics of uterine fibroids. For example, IL-1 signaling pathway (Sci STKE 2003:3, 2003), associated with inflammation and known to activate prostaglandins that are implicated in the progression of fibroids, is significantly enriched only in small tumors (volume < 5.7 cm3). It appears that the molecular apparatus necessary for fibroid growth and development is established during tumor development. A complete list of all DEGs and the corresponding enriched pathways according to tumor size is provided for researchers to mine these data. Identification of these DEGs and the pathways may potentially have clinical implications.

[1]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[2]  V. Kristensen,et al.  Gene Expression Profile Analysis of T1 and T2 Breast Cancer Reveals Different Activation Pathways , 2013, ISRN oncology.

[3]  H. Finner,et al.  Stepwise multiple test procedures and control of directional errors , 1999 .

[4]  Control of directional errors with step-up multiple tests , 1997 .

[5]  H. Jabbour,et al.  Prostaglandin receptors are mediators of vascular function in endometrial pathologies , 2006, Molecular and Cellular Endocrinology.

[6]  Y. Benjamini,et al.  Screening for Partial Conjunction Hypotheses , 2008, Biometrics.

[7]  D. Baird,et al.  Gene Expression in Uterine Leiomyoma from Tumors Likely to Be Growing (from Black Women over 35) and Tumors Likely to Be Non-Growing (from White Women over 35) , 2013, PloS one.

[8]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[9]  T. Kennedy,et al.  Netrins: versatile extracellular cues with diverse functions , 2011, Development.

[10]  Y. Hochberg A sharper Bonferroni procedure for multiple tests of significance , 1988 .

[11]  R. Tibshirani,et al.  Empirical bayes methods and false discovery rates for microarrays , 2002, Genetic epidemiology.

[12]  Y. Benjamini,et al.  False Discovery Rate–Adjusted Multiple Confidence Intervals for Selected Parameters , 2005 .

[13]  D. Baird,et al.  Growth of uterine leiomyomata among premenopausal black and white women , 2008, Proceedings of the National Academy of Sciences.

[14]  I. Pogribny,et al.  Aloe vera Non-Decolorized Whole Leaf Extract-Induced Large Intestinal Tumors in F344 Rats Share Similar Molecular Pathways with Human Sporadic Colorectal Tumors , 2011, Toxicologic pathology.

[15]  C. Dunnett A Multiple Comparison Procedure for Comparing Several Treatments with a Control , 1955 .

[16]  Juliet Popper Shaffer Control of Directional Errors with Stagewise Multiple Test Procedures , 1980 .

[17]  A. Dunne,et al.  The Interleukin-1 Receptor/Toll-Like Receptor Superfamily: Signal Transduction During Inflammation and Host Defense , 2000, Science's STKE.

[18]  K. Kwon,et al.  Growth inhibition and apoptosis induced in human leiomyoma cells by treatment with the PPAR gamma ligand ciglitizone. , 2007, Molecular human reproduction.

[19]  Michael Z. Man,et al.  The influence of tumor size and environment on gene expression in commonly used human tumor lines , 2004, BMC Cancer.

[20]  D. Vitkup,et al.  Heterogeneity of tumor-induced gene expression changes in the human metabolic network , 2013, Nature Biotechnology.

[21]  S. Auerbach,et al.  Differential Transcriptomic Analysis of Spontaneous Lung Tumors in B6C3F1 Mice: Comparison to Human Non–Small Cell Lung Cancer , 2012, Toxicologic pathology.

[22]  Massimo Cristofanilli,et al.  β4 integrin subunit gene expression correlates with tumor size and nuclear grade in early breast cancer , 2005, Modern Pathology.

[23]  I. Manyonda,et al.  Progesterone and progesterone receptor modulators in the management of symptomatic uterine fibroids. , 2012, European journal of obstetrics, gynecology, and reproductive biology.

[24]  S. Dudoit,et al.  Multiple Hypothesis Testing in Microarray Experiments , 2003 .

[25]  S. Peddada,et al.  Hepatocellular Carcinomas in B6C3F1 Mice Treated with Ginkgo biloba Extract for Two Years Differ from Spontaneous Liver Tumors in Cancer Gene Mutations and Genomic Pathways , 2013, Toxicologic pathology.

[26]  BMC Bioinformatics , 2005 .

[27]  Gregory R. Grant,et al.  A flexible two-stage procedure for identifying gene sets that are differentially expressed , 2009, Bioinform..

[28]  H. Ishwaran,et al.  Lung metastasis genes couple breast tumor size and metastatic spread , 2007, Proceedings of the National Academy of Sciences.

[29]  A. Tamhane,et al.  Step-down multiple tests for comparing treatments with a control in unbalanced one-way layouts. , 1991, Statistics in medicine.

[30]  M. Poupon,et al.  Netrin-4 inhibits angiogenesis via binding to neogenin and recruitment of Unc5B , 2008, Proceedings of the National Academy of Sciences.

[31]  S. Auerbach,et al.  Global Gene Profiling of Spontaneous Hepatocellular Carcinoma in B6C3F1 Mice , 2011, Toxicologic pathology.

[32]  Md Soriful Islam,et al.  Growth factors and myometrium: biological effects in uterine fibroid and possible clinical implications. , 2011, Human reproduction update.

[33]  On two results in multiple testing , 2004 .

[34]  S. Jaworski,et al.  [Fibroblast growth factor gene expression in uterine leiomyomas]. , 2008, Ginekologia polska.

[35]  J. Copland,et al.  Inhibition of proliferation and estrogen receptor signaling by peroxisome proliferator-activated receptor gamma ligands in uterine leiomyoma. , 2003, Cancer research.

[36]  Charles W. Dunnett,et al.  A Step-Up Multiple Test Procedure , 1992 .

[37]  Wenge Guo,et al.  Controlling False Discoveries in Multidimensional Directional Decisions, with Applications to Gene Expression Data on Ordered Categories , 2010, Biometrics.

[38]  W. Driever,et al.  The Rac1 Regulator ELMO1 Controls Vascular Morphogenesis in Zebrafish , 2010, Circulation research.

[39]  John D. Storey,et al.  Empirical Bayes Analysis of a Microarray Experiment , 2001 .