Investigation of association estimators in network inference algorithms on breast cancer proteomic data

In this study, association estimators applied in the network inference methods used to determine disease-related molecular interactions using breast cancer, which is the most common type of cancer in women, proteomic data were examined and hub genes in the gene-gene interaction network related to the disease were identified. Proteomic data of 901 breast cancer patients were generated using reverse phase protein array provided by The Cancer Proteome Atlas (TCPA) as a data set. Correlations and mutual information (MI) based estimators used in the literature were compared in the study, and WGCNA and minet R packages were used. As a result, it is seen that the MI based shrink estimator method has more successful results than the correlation-based adjacency function used in the estimation of biological networks in the WGCNA package. Achievement rates have ranged from 0.67 to 1.00 in the shrink estimation, with adjacency functions ranging from 0.33 to 0.86 for different module counts. In addition, hub genes and inferenced networks of successful results are presented for the review of biologists.

[1]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[2]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Gianluca Bontempi,et al.  minet: A R/Bioconductor Package for Inferring Large Transcriptional Networks Using Mutual Information , 2008, BMC Bioinformatics.

[4]  S. Horvath,et al.  Analysis of oncogenic signaling networks in glioblastoma identifies ASPM as a molecular target , 2006, Proceedings of the National Academy of Sciences.

[5]  Chunyan Yu,et al.  SMAD4 expression in breast ductal carcinoma correlates with prognosis. , 2015, Oncology letters.

[6]  P. Gregersen,et al.  Overlapping Probabilities of Top Ranking Gene Lists, Hypergeometric Distribution, and Stringency of Gene Selection Criterion , 2006, 2006 International Conference of the IEEE Engineering in Medicine and Biology Society.

[7]  Jingchun Zhu,et al.  Realizing the Promise of Reverse Phase Protein Arrays for Clinical, Translational, and Basic Research: A Workshop Report , 2014, Molecular & Cellular Proteomics.

[8]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Melissa J. Davis,et al.  Gene regulatory network inference: evaluation and application to ovarian cancer allows the prioritization of drug targets , 2012, Genome Medicine.

[10]  M. Kendall Statistical Methods for Research Workers , 1937, Nature.

[11]  Z. Dong,et al.  PDK1 promotes tumor growth and metastasis in a spontaneous breast cancer model , 2016, Oncogene.

[12]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[13]  Jun Li,et al.  TCPA: a resource for cancer functional proteomics data , 2013, Nature Methods.

[14]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[15]  Núria Queralt-Rosinach,et al.  DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants , 2016, Nucleic Acids Res..

[17]  S. Horvath,et al.  Weighted gene coexpression network analysis strategies applied to mouse weight , 2007, Mammalian Genome.

[18]  Gianluca Bontempi,et al.  On the Impact of Entropy Estimation on Transcriptional Regulatory Network Inference Based on Mutual Information , 2008, EURASIP J. Bioinform. Syst. Biol..

[19]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[20]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[21]  Yiling Lu,et al.  Use of Reverse Phase Protein Microarrays and Reference Standard Development for Molecular Network Analysis of Metastatic Ovarian Carcinoma* , 2005, Molecular & Cellular Proteomics.

[22]  Piet Demeester,et al.  NIMEFI: Gene Regulatory Network Inference using Multiple Ensemble Feature Importance Algorithms , 2014, PloS one.