New network topology approaches reveal differential correlation patterns in breast cancer

BackgroundAnalysis of genome-wide data is often carried out using standard methods such as differential expression analysis, clustering analysis and heatmaps. Beyond that, differential correlation analysis was suggested to identify changes in the correlation patterns between disease states. The detection of differential correlation is a demanding task, as the number of entries in the gene-by-gene correlation matrix is large. Currently, there is no gold standard for the detection of differential correlation and statistical validation.ResultsWe developed two untargeted algorithms (DCloc and DCglob) that identify differential correlation patterns by comparing the local or global topology of correlation networks. Construction of networks from correlation structures requires fixing of a correlation threshold. Instead of a single cutoff, the algorithms systematically investigate a series of correlation thresholds and permit to detect different kinds of correlation changes at the same level of significance: strong changes of a few genes and moderate changes of many genes. Comparing the correlation structure of 208 ER- breast carcinomas and 208 ER+ breast carcinomas, DCloc detected 770 differentially correlated genes with a FDR of 12.8%, while DCglob detected 630 differentially correlated genes with a FDR of 12.1%. In two-fold cross-validation, the reproducibility of the list of the top 5% differentially correlated genes in 140 ER- tumors and in 140 ER+ tumors was 49% for DCloc and 33% for DCglob.ConclusionsWe developed two correlation network topology based algorithms for the detection of differential correlations in different disease states. Clusters of differentially correlated genes could be interpreted biologically and included the marker genes hydroxyprostaglandin dehydrogenase (PGDH) and acyl-CoA synthetase medium chain 1 (ACSM1) of invasive apocrine carcinomas that were differentially correlated, but not differentially expressed. Using random subsampling and cross-validation, DCloc and DCglob were shown to identify specific and reproducible lists of differentially correlated genes.

[1]  A. Giuliano,et al.  FOXC1 is a potential prognostic biomarker with functional significance in basal-like breast cancer. , 2010, Cancer research.

[2]  Carsten Denkert,et al.  Genome-wide Gene Expression Profiling of Formalin-fixed Paraffin-Embedded Breast Cancer Core Biopsies Using Microarrays , 2011, The journal of histochemistry and cytochemistry : official journal of the Histochemistry Society.

[3]  Trey Ideker,et al.  Cytoscape 2.8: new features for data integration and network visualization , 2010, Bioinform..

[4]  Carsten Denkert,et al.  Quantitative Determination of Estrogen Receptor, Progesterone Receptor, and HER2 mRNA in Formalin-fixed Paraffin-embedded Tissue—A New Option for Predictive Biomarker Assessment in Breast Cancer , 2011, Diagnostic molecular pathology : the American journal of surgical pathology, part B.

[5]  Dong Xu,et al.  Pathway Correlation Profile of Gene-Gene Co-Expression for Identifying Pathway Perturbation , 2012, PloS one.

[6]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[7]  David Warde-Farley,et al.  Dynamic modularity in protein interaction networks predicts breast cancer outcome , 2009, Nature Biotechnology.

[8]  Rainer Breitling,et al.  DiffCoEx: a simple and sensitive method to find differentially coexpressed gene modules , 2010, BMC Bioinformatics.

[9]  Liang Chen,et al.  A statistical method for identifying differential gene-gene co-expression patterns , 2004, Bioinform..

[10]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[11]  S. Bergmann,et al.  Comparative Gene Expression Analysis by a Differential Clustering Approach: Application to the Candida albicans Transcription Program , 2005, PLoS genetics.

[12]  C. Parise,et al.  articleUse of ER / PR / HER 2 subtypes in conjunction with the 2007 St Gallen Consensus Statement for early breast cancer , 2010 .

[13]  M. Ringnér,et al.  High-resolution genomic and expression analyses of copy number alterations in HER2-amplified breast cancer , 2010, Breast Cancer Research.

[14]  Ron Shamir,et al.  Dissection of Regulatory Networks that Are Altered in Disease via Differential Co-expression , 2013, PLoS Comput. Biol..

[15]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[16]  Carsten O. Peterson,et al.  Estrogen receptor status in breast cancer is associated with remarkably distinct gene expression patterns. , 2001, Cancer research.

[17]  David Cameron,et al.  2-year follow-up of trastuzumab after adjuvant chemotherapy in HER2-positive breast cancer: a randomised controlled trial , 2007, The Lancet.

[18]  Ker-Chau Li,et al.  Genome-wide coexpression dynamics: Theory and application , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[19]  W. Gerald,et al.  An estrogen receptor-negative breast cancer subset characterized by a hormonally regulated transcriptional program and response to androgen , 2006, Oncogene.

[20]  Michael Watson,et al.  CoXpress: differential co-expression in gene expression data , 2006, BMC Bioinformatics.

[21]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[22]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Takuji Iwase,et al.  Molecular characterization of apocrine carcinoma of the breast: Validation of an apocrine protein signature in a well‐defined cohort , 2009, Molecular oncology.

[24]  Simone Severini,et al.  Increased entropy of signal transduction in the cancer metastasis phenotype , 2010, BMC Systems Biology.

[25]  Rainer Spang,et al.  Finding disease specific alterations in the co-expression of genes , 2004, ISMB/ECCB.

[26]  D. Allison,et al.  Microarray data analysis: from disarray to consolidation and consensus , 2006, Nature Reviews Genetics.

[27]  Christina Kendziorski,et al.  Statistical methods for gene set co-expression analysis , 2009, Bioinform..

[28]  Robert Clarke,et al.  Differential dependency network analysis to identify condition-specific topological changes in biological networks , 2009, Bioinform..

[29]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[30]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[31]  Karin Jirström,et al.  15-Prostaglandin Dehydrogenase Expression Alone or in Combination with ACSM1 Defines a Subgroup of the Apocrine Molecular Subtype of Breast Carcinoma*S , 2008, Molecular & Cellular Proteomics.

[32]  Michael A. Langston,et al.  Extracting Gene Networks for Low-Dose Radiation Using Graph Theoretical Algorithms , 2006, PLoS Comput. Biol..

[33]  J. H. Steiger Tests for comparing elements of a correlation matrix. , 1980 .

[34]  Z. Szallasi,et al.  An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients , 2010, Breast Cancer Research and Treatment.

[35]  A. G. de la Fuente From 'differential expression' to 'differential networking' - identification of dysfunctional regulatory networks in diseases. , 2010, Trends in genetics : TIG.

[36]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[37]  Benjamin M. Bolstad,et al.  affy - analysis of Affymetrix GeneChip data at the probe level , 2004, Bioinform..

[38]  Lucinda K. Southworth,et al.  Aging Mice Show a Decreasing Correlation of Gene Expression within Genetic Modules , 2009, PLoS genetics.

[39]  Sangsoo Kim,et al.  Gene expression Differential coexpression analysis using microarray data and its application to human cancer , 2005 .

[40]  Carsten Denkert,et al.  Androgen receptor expression in primary breast cancer and its predictive and prognostic value in patients treated with neoadjuvant chemotherapy , 2011, Breast Cancer Research and Treatment.

[41]  R. Tibshirani,et al.  Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Mohammad Asim,et al.  Differential C3NET reveals disease networks of direct physical interactions , 2011, BMC Bioinformatics.

[43]  Mario Medvedovic,et al.  A semi-parametric Bayesian model for unsupervised differential co-expression analysis , 2010, BMC Bioinformatics.