A statistical method for identifying differential gene-gene co-expression patterns

MOTIVATION To understand cancer etiology, it is important to explore molecular changes in cellular processes from normal state to cancerous state. Because genes interact with each other during cellular processes, carcinogenesis related genes may form differential co-expression patterns with other genes in different cell states. In this study, we develop a statistical method for identifying differential gene-gene co-expression patterns in different cell states. RESULTS For efficient pattern recognition, we extend the traditional F-statistic and obtain an Expected Conditional F-statistic (ECF-statistic), which incorporates statistical information of location and correlation. We also propose a statistical method for data transformation. Our approach is applied to a microarray gene expression dataset for prostate cancer study. For a gene of interest, our method can select other genes that have differential gene-gene co-expression patterns with this gene in different cell states. The 10 most frequently selected genes, include hepsin, GSTP1 and AMACR, which have recently been proposed to be associated with prostate carcinogenesis. However, genes GSTP1 and AMACR cannot be identified by studying differential gene expression alone. By using tumor suppressor genes TP53, PTEN and RB1, we identify seven genes that also include hepsin, GSTP1 and AMACR. We show that genes associated with cancer may have differential gene-gene expression patterns with many other genes in different cell states. By discovering such patterns, we may be able to identify carcinogenesis related genes.

[1]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[2]  D. Eisenberg,et al.  A combined algorithm for genome-wide prediction of protein function , 1999, Nature.

[3]  Tapio Visakorpi,et al.  Molecular genetics of prostate cancer , 2001, Urology.

[4]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[5]  S. Gygi,et al.  Correlation between Protein and mRNA Abundance in Yeast , 1999, Molecular and Cellular Biology.

[6]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[7]  S. Shapiro,et al.  An Analysis of Variance Test for Normality (Complete Samples) , 1965 .

[8]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[9]  B. Mishra,et al.  Shrinkage-based similarity metric for cluster analysis of microarray data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Ker-Chau Li,et al.  Genome-wide coexpression dynamics: Theory and application , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Jean Y. J. Wang,et al.  Coordinated regulation of life and death by RB , 2003, Nature Reviews Cancer.

[12]  Xinbin Chen,et al.  Mutant p53 exerts a dominant negative effect by preventing wild-type p53 from binding to the promoter of its target genes , 2004, Oncogene.

[13]  P. Brown,et al.  Yeast microarrays for genome wide parallel genetic and gene expression analysis. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[14]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[15]  David E. Misek,et al.  Discordant Protein and mRNA Expression in Lung Adenocarcinomas * , 2002, Molecular & Cellular Proteomics.

[16]  W. Hahn,et al.  Modelling the molecular circuitry of cancer , 2002, Nature Reviews Cancer.

[17]  W. Isaacs,et al.  Human prostate cancer precursors and pathobiology. , 2003, Urology.

[18]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[19]  N. Perkins,et al.  Active repression of antiapoptotic gene expression by RelA(p65) NF-kappa B. , 2004, Molecular cell.

[20]  E. Lander,et al.  Gene expression correlates of clinical prostate cancer behavior. , 2002, Cancer cell.

[21]  N. Perkins,et al.  Active Repression of Antiapoptotic Gene Expression by RelA(p65) NF-κB , 2004 .

[22]  E. Winzeler,et al.  Protein pathway and complex clustering of correlated mRNA and protein expression analyses in Saccharomyces cerevisiae , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[24]  J. Royston An Extension of Shapiro and Wilk's W Test for Normality to Large Samples , 1982 .

[25]  Xin Lu,et al.  Live or let die: the cell's response to p53 , 2002, Nature Reviews Cancer.

[26]  P. Brown,et al.  Exploring the metabolic and genetic control of gene expression on a genomic scale. , 1997, Science.

[27]  S. Loening,et al.  Hepsin is highly over expressed in and a new candidate for a prognostic indicator in prostate cancer. , 2004, The Journal of urology.

[28]  W. Isaacs,et al.  For Personal Use. Only Reproduce with Permission from the Lancet Publishing Group. Pathological and Molecular Aspects of Prostate Cancer Prostate Cancer Ii , 2022 .