Unveiling network-based functional features through integration of gene expression into protein networks.

Decoding health and disease phenotypes is one of the fundamental objectives in biomedicine. Whereas high-throughput omics approaches are available, it is evident that any single omics approach might not be adequate to capture the complexity of phenotypes. Therefore, integrated multi-omics approaches have been used to unravel genotype-phenotype relationships such as global regulatory mechanisms and complex metabolic networks in different eukaryotic organisms. Some of the progress and challenges associated with integrated omics studies have been reviewed previously in comprehensive studies. In this work, we highlight and review the progress, challenges and advantages associated with emerging approaches, integrating gene expression and protein-protein interaction networks to unravel network-based functional features. This includes identifying disease related genes, gene prioritization, clustering protein interactions, developing the modules, extract active subnetworks and static protein complexes or dynamic/temporal protein complexes. We also discuss how these approaches contribute to our understanding of the biology of complex traits and diseases. This article is part of a Special Issue entitled: Cardiac adaptations to obesity, diabetes and insulin resistance, edited by Professors Jan F.C. Glatz, Jason R.B. Dyck and Christine Des Rosiers.

[1]  Seyed Shahriar Arab,et al.  CentiServer: A Comprehensive Resource, Web-Based Application and R Package for Centrality Analysis , 2015, PloS one.

[2]  SantoniDaniele,et al.  An Integrated Approach (CLuster Analysis Integration Method) to Combine Expression Data and Protein–Protein Interaction Networks in Agrigenomics: Application on Arabidopsis thaliana , 2014 .

[3]  Yves Moreau,et al.  PINTA: a web server for network-based gene prioritization from expression data , 2011, Nucleic Acids Res..

[4]  Lin Gao,et al.  Biological network analysis: insights into structure and functions. , 2012, Briefings in functional genomics.

[5]  Fang-Xiang Wu,et al.  Prioritizing human disease genes by multiple data integration , 2013, 2013 IEEE International Conference on Bioinformatics and Biomedicine.

[6]  John Quackenbush,et al.  Integrating transcriptional and protein interaction networks to prioritize condition-specific master regulators , 2015, BMC Systems Biology.

[7]  P. Maji,et al.  Significance and Functional Similarity for Identification of Disease Genes , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[8]  Jonathan L. Robinson,et al.  Integrative analysis of human omics data using biomolecular networks. , 2016, Molecular bioSystems.

[9]  Yang Yang,et al.  Integrating Gene Expression and Protein Interaction Data for Signaling Pathway Prediction of Alzheimer's Disease , 2014, Comput. Math. Methods Medicine.

[10]  Peng Li,et al.  Mining protein complexes based on connected affinity clique extension , 2013, 2013 IEEE International Conference on Bioinformatics and Biomedicine.

[11]  Peng Yang,et al.  Detecting temporal protein complexes from dynamic protein-protein interaction networks , 2014, BMC Bioinformatics.

[12]  Ron Shamir,et al.  Identifying functional modules using expression profiles and confidence-scored protein interactions , 2009, Bioinform..

[13]  Yi Pan,et al.  Identifying dynamic protein complexes based on gene expression profiles and PPI networks , 2013, BIBM.

[14]  Hon Wai Leong,et al.  Temporal dynamics of protein complexes in PPI Networks: a case study using yeast cell cycle dynamics , 2012, BMC Bioinformatics.

[15]  Anastasios Bezerianos,et al.  Growing functional modules from a seed protein via integration of protein interaction and gene expression data , 2007, BMC Bioinformatics.

[16]  Chao Liu,et al.  Complex-based analysis of dysregulated cellular processes in cancer , 2014, BMC Systems Biology.

[17]  Mehmet Koyutürk,et al.  DADA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization , 2011, BioData Mining.

[18]  Fei Yao-ping Essential protein discovery method based on integration of PPI and gene expression data , 2013 .

[19]  Pradipta Maji,et al.  RelSim: An integrated method to identify disease genes using gene expression profiles and PPIN based similarity measure , 2017, Inf. Sci..

[20]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[21]  Zhenran Jiang,et al.  Identification of Genes Involved in Breast Cancer Metastasis by Integrating Protein-Protein Interaction Information with Expression Data , 2017, J. Comput. Biol..

[22]  Saeed Jalili,et al.  BiCAMWI: A Genetic-Based Biclustering Algorithm for Detecting Dynamic Protein Complexes , 2016, PloS one.

[23]  Tobias Müller,et al.  Bioinformatics Applications Note Systems Biology Bionet: an R-package for the Functional Analysis of Biological Networks , 2022 .

[24]  Y. Moreau,et al.  Finding the targets of a drug by integration of gene expression data with a protein interaction network. , 2013, Molecular bioSystems.

[25]  Y. Moreau,et al.  Computational tools for prioritizing candidate genes: boosting disease gene discovery , 2012, Nature Reviews Genetics.

[26]  Tobias Müller,et al.  Identifying functional modules in protein–protein interaction networks: an integrated exact approach , 2008, ISMB.

[27]  Tao Li,et al.  Combining Gene Expression Profiles and Protein-Protein Interactions for Identifying Functional Modules , 2012, 2012 11th International Conference on Machine Learning and Applications.

[28]  Fang-Xiang Wu,et al.  Identifying disease genes by integrating multiple data sources , 2014, BMC Medical Genomics.

[29]  Saeed Jalili,et al.  PCD-GED: Protein complex detection considering PPI dynamics based on time series gene expression data. , 2015, Journal of theoretical biology.

[30]  Zhang-Zhi Hu,et al.  Omics-based molecular target and biomarker identification. , 2011, Methods in molecular biology.

[31]  Xiaohua Hu,et al.  Neighbor affinity based algorithm for discovering temporal protein complex from dynamic PPI network. , 2016, Methods.

[32]  Jianhua Ruan,et al.  Identification of biomarkers in breast cancer metastasis by integrating protein-protein interaction network and gene expression data , 2011, 2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS).

[33]  Jiajia Chen,et al.  Network Biomarkers Constructed from Gene Expression and Protein-Protein Interaction Data for Accurate Prediction of Leukemia , 2017, Journal of Cancer.

[34]  Charu C. Aggarwal,et al.  Graph Clustering , 2010, Encyclopedia of Machine Learning and Data Mining.

[35]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2006, Nucleic Acids Res..

[36]  Olaf Wolkenhauer,et al.  Evolution of Centrality Measurements for the Detection of Essential Proteins in Biological Networks , 2016, Front. Physiol..

[37]  Hsiang-Yuan Yeh,et al.  Biomarker Identification for Prostate Cancer and Lymph Node Metastasis from Microarray Data and Protein Interaction Network Using Gene Prioritization Method , 2012, TheScientificWorldJournal.

[38]  Yang Chen,et al.  Identification of responsive gene modules by network-based gene clustering and extending: application to inflammation and angiogenesis , 2010, BMC Systems Biology.

[39]  Yi Pan,et al.  A new essential protein discovery method based on the integration of protein-protein interaction and gene expression data , 2012, BMC Systems Biology.

[40]  A. Lusis,et al.  Considerations for the design of omics studies , 2017 .

[41]  V. S. Rao,et al.  Protein-Protein Interaction Detection: Methods and Analysis , 2014, International journal of proteomics.

[42]  Benno Schwikowski,et al.  Discovering regulatory and signalling circuits in molecular interaction networks , 2002, ISMB.

[43]  Fei Luo,et al.  Discovering conditional co-regulated protein complexes by integrating diverse data sources , 2010, BMC Systems Biology.

[44]  Yan Lin,et al.  DEG 5.0, a database of essential genes in both prokaryotes and eukaryotes , 2008, Nucleic Acids Res..

[45]  H. Mewes,et al.  Functional modules by relating protein interaction networks and gene expression. , 2003, Nucleic acids research.

[46]  Giulio Superti-Furga,et al.  Protein complexes and proteome organization from yeast to man. , 2003, Current opinion in chemical biology.

[47]  Yi Pan,et al.  Towards the identification of protein complexes and functional modules by integrating PPI network and gene expression data , 2011, BMC Bioinformatics.

[48]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[49]  Yves Moreau,et al.  Network Analysis of Differential Expression for the Identification of Disease-Causing Genes , 2009, PloS one.

[50]  Hyunju Lee,et al.  WMAXC: A Weighted Maximum Clique Method for Identifying Condition-Specific Sub-Network , 2014, PloS one.

[51]  Linton C. Freeman,et al.  Going the Wrong Way on a One-Way Street: Centrality in Physics and Biology , 2008, J. Soc. Struct..

[52]  Yi Pan,et al.  Identifying essential proteins from active PPI networks constructed with dynamic gene expression , 2015, BMC Genomics.

[53]  Liwei Qian,et al.  Improving prediction of drug therapy outcome via integration of time series gene expression and Protein Protein Interaction network , 2012, 2012 IEEE 6th International Conference on Systems Biology (ISB).

[54]  P. Bork,et al.  Dynamic Complex Formation During the Yeast Cell Cycle , 2005, Science.

[55]  Gary D. Bader,et al.  Pathguide: a Pathway Resource List , 2005, Nucleic Acids Res..

[56]  Robert Clarke,et al.  Identifying protein interaction subnetworks by a bagging Markov random field-based method , 2012, Nucleic acids research.

[57]  Bart De Moor,et al.  Candidate gene prioritization by network analysis of differential expression using machine learning approaches , 2010, BMC Bioinformatics.

[58]  Yi Pan,et al.  Construction of the spatial and temporal active protein interaction network for identifying protein complexes , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[59]  Wen-Tsao Pan,et al.  A new Fruit Fly Optimization Algorithm: Taking the financial distress model as an example , 2012, Knowl. Based Syst..

[60]  Nazar Zaki,et al.  Detecting Protein Complexes in Protein Interaction Networks Modeled as Gene Expression Biclusters , 2015, PloS one.

[61]  Ping Luo,et al.  Identifying disease genes from PPI networks weighted by gene expression under different conditions , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[62]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2005, Nucleic Acids Res..

[63]  Shekhar C. Mande,et al.  Dynamic Changes in Protein Functional Linkage Networks Revealed by Integration with Gene Expression Data , 2008, PLoS Comput. Biol..

[64]  Yi Pan,et al.  Predicting Essential Proteins Based on Weighted Degree Centrality , 2014, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[65]  Wei Li,et al.  Correlating interactions with gene expressions to detect protein complexes in protein interaction networks , 2014, 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[66]  Jie Zhao,et al.  Identifying protein complexes in dynamic protein-protein interaction networks based on Cuckoo Search algorithm , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[67]  Yi Pan,et al.  Active Protein Interaction Network and Its Application on Protein Complex Detection , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine.

[68]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[69]  Christian V. Forst,et al.  Differential network expression during drug and stress response , 2005, Bioinform..

[70]  Jing Zhao,et al.  Degree-adjusted algorithm for prioritisation of candidate disease genes from gene expression and protein interactome. , 2014, IET systems biology.

[71]  Daniele Santoni,et al.  An integrated approach (CLuster Analysis Integration Method) to combine expression data and protein-protein interaction networks in agrigenomics: application on Arabidopsis thaliana. , 2014, Omics : a journal of integrative biology.

[72]  Ron Shamir,et al.  Identification of functional modules using network topology and high-throughput data , 2007, BMC Systems Biology.

[73]  Maozu Guo,et al.  Mining disease genes using integrated protein–protein interaction and gene–gene co-regulation information , 2015, FEBS open bio.

[74]  Atul J. Butte,et al.  Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks , 2005, BMC Bioinformatics.

[75]  Jin Xu,et al.  A New Method for the Discovery of Essential Proteins , 2013, PloS one.

[76]  Yijia Zhang,et al.  Construction of dynamic probabilistic protein interaction networks for protein complex identification , 2016, BMC Bioinformatics.

[77]  David Correa Martins,et al.  Multiview Clustering on PPI Network for Gene Selection and Enrichment from Microarray Data , 2014, 2014 IEEE International Conference on Bioinformatics and Bioengineering.

[78]  Xianjun Shen,et al.  An Edge-based Protein Complex Identification Algorithm With Gene Co-expression Data (PCIA-GeCo) , 2014, IEEE Transactions on NanoBioscience.

[79]  Judy H. Cho,et al.  Guilt by rewiring: gene prioritization through network rewiring in genome wide association studies. , 2014, Human molecular genetics.

[80]  Jing Zhu,et al.  Edge-based scoring and searching method for identifying condition-responsive protein-protein interaction sub-network , 2007, Bioinform..

[81]  Yi Pan,et al.  A comparison of the functional modules identified from time course and static PPI network data , 2011, BMC Bioinformatics.

[82]  Kang Tu,et al.  Combining gene expression profiles and protein-protein interaction data to infer gene functions. , 2006, Journal of biotechnology.

[83]  T. Ideker,et al.  Network-based classification of breast cancer metastasis , 2007, Molecular systems biology.

[84]  Mark A. Ragan,et al.  Systematic tracking of dysregulated modules identifies novel genes in cancer , 2013, Bioinform..

[85]  David Botstein,et al.  SGD: Saccharomyces Genome Database , 1998, Nucleic Acids Res..

[86]  Hong Qin,et al.  Detection of Changes in Transitive Associations by Shortest-path Analysis of Protein Interaction Networks Integrated with Gene Expression Profiles , 2008, 2008 International Conference on BioMedical Engineering and Informatics.

[87]  S. Kasif,et al.  Network-Based Analysis of Affected Biological Processes in Type 2 Diabetes Models , 2007, PLoS genetics.

[88]  Jianxin Wang,et al.  Identifying protein complexes based on the integration of PPI network and gene expression data , 2015, Int. J. Bioinform. Res. Appl..

[89]  Fang-Xiang Wu,et al.  Discovering biological patterns from short time-series gene expression profiles with integrating PPI data , 2014, Neurocomputing.

[90]  Ben Lehner,et al.  Tissue specificity and the human protein interaction network , 2009, Molecular systems biology.

[91]  R. Sharan,et al.  Protein networks in disease. , 2008, Genome research.

[92]  Sanghamitra Bandyopadhyay,et al.  A NMF based approach for integrating multiple data sources to predict HIV-1–human PPIs , 2016, BMC Bioinformatics.

[93]  Chung-Yen Lin,et al.  A hub-attachment based method to detect functional modules from confidence-scored protein interactions and expression profiles , 2010, BMC Bioinformatics.

[94]  Yi Pan,et al.  Identifying essential proteins via integration of protein interaction and gene expression data , 2012, 2012 IEEE International Conference on Bioinformatics and Biomedicine.

[95]  Zhu-Hong You,et al.  Integration of Genomic and Proteomic Data to Predict Synthetic Genetic Interactions Using Semi-supervised Learning , 2009, ICIC.

[96]  Xiufen Zou,et al.  Detecting Essential Proteins Based on Network Topology, Gene Expression Data and Gene Ontology Information. , 2016, IEEE/ACM transactions on computational biology and bioinformatics.

[97]  Xianjun Shen,et al.  Mining Temporal Protein Complex Based on the Dynamic PIN Weighted with Connected Affinity and Gene Co-Expression , 2016, PloS one.

[98]  Pradipta Maji,et al.  Rough Hypercuboid and Modified Kulczynski Coefficient for Disease Gene Identification , 2017, ACIIDS.

[99]  Yi Pan,et al.  Construction and application of dynamic protein interaction network based on time course gene expression data , 2013, Proteomics.

[100]  K. Strimbu,et al.  What are biomarkers? , 2010, Current opinion in HIV and AIDS.

[101]  Yongjin Park,et al.  How networks change with time , 2012, Bioinform..

[102]  David Warde-Farley,et al.  Dynamic modularity in protein interaction networks predicts breast cancer outcome , 2009, Nature Biotechnology.

[103]  Haidong Wang,et al.  Discovering molecular pathways from protein interaction and gene expression data , 2003, ISMB.

[104]  Holger Fröhlich,et al.  Network and Data Integration for Biomarker Signature Discovery via Network Smoothed T-Statistics , 2013, PloS one.

[105]  Xiujuan Lei,et al.  Identification of dynamic protein complexes based on fruit fly optimization algorithm , 2016, Knowl. Based Syst..

[106]  Petter Holme,et al.  Ranking Candidate Disease Genes from Gene Expression and Protein Interaction: A Katz-Centrality Based Approach , 2011, PloS one.

[107]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[108]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[109]  Rosy Das Sarmah,et al.  Weighted edge based clustering to identify protein complexes in protein-protein interaction networks incorporating gene expression profile , 2016, Comput. Biol. Chem..

[110]  R. Shamir,et al.  Regulatory networks define phenotypic classes of human stem cell lines , 2008, Nature.

[111]  Maricel G. Kann,et al.  Chapter 4: Protein Interactions and Disease , 2012, PLoS Comput. Biol..

[112]  Hongyu Zhao,et al.  COSINE: COndition-SpecIfic sub-NEtwork identification using a global optimization method , 2011, Bioinform..