HOGMMNC: a higher order graph matching with multiple network constraints model for gene‐drug regulatory modules identification

Motivation: The emergence of large amounts of genomic, chemical, and pharmacological data provides new opportunities and challenges. Identifying gene‐drug associations is not only crucial in providing a comprehensive understanding of the molecular mechanisms of drug action, but is also important in the development of effective treatments for patients. However, accurately determining the complex associations among pharmacogenomic data remains challenging. We propose a higher order graph matching with multiple network constraints (HOGMMNC) model to accurately identify gene‐drug modules. The HOGMMNC model aims to capture the inherent structural relations within data drawn from multiple sources by hypergraph matching. The proposed technique seamlessly integrates prior constraints to enhance the accuracy and reliability of the identified relations. An effective numerical solution is combined with a novel sampling strategy to solve the problem efficiently. Results: The superiority and effectiveness of our proposed method are demonstrated through a comparison with four state‐of‐the‐art techniques using synthetic and empirical data. The experiments on synthetic data show that the proposed method clearly outperforms other methods, especially in the presence of noise and irrelevant samples. The HOGMMNC model identifies eighteen gene‐drug modules in the empirical data. The modules are validated to have significant associations via pathway analysis. Significance: The modules identified by HOGMMNC provide new insights into the molecular mechanisms of drug action and provide patients with more effective treatments. Our proposed method can be applied to the study of other biological correlated module identification problems (e.g. miRNA‐gene, gene‐methylation, and gene‐disease). Availability and implementation: A matlab package of HOGMMNC is available at https://github.com/scutbioinformatics/HOGMMNC/. Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Shi-Hua Zhang,et al.  Integrative analysis for identifying joint modular patterns of gene-expression and drug-response data , 2016, Bioinform..

[2]  Roded Sharan,et al.  Global alignment of protein-protein interaction networks. , 2013, Methods in molecular biology.

[3]  Saeed Talebi,et al.  The Effect of Sortilin Silencing on Ovarian Carcinoma Cells , 2014, Avicenna journal of medical biotechnology.

[4]  Yang Wang,et al.  RRM1 gene expression in peripheral blood is predictive of shorter survival in Chinese patients with advanced non-small-cell lung cancer treated by gemcitabine and platinum , 2011, Journal of Zhejiang University SCIENCE B.

[5]  Martial Hebert,et al.  A spectral technique for correspondence problems using pairwise constraints , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[6]  Matthias Hein,et al.  A flexible tensor block coordinate ascent scheme for hypergraph matching , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  X-S Zhang,et al.  Predicting cooperative drug effects through the quantitative cellular profiling of response to individual drugs , 2014, CPT: pharmacometrics & systems pharmacology.

[8]  Wenmin Wang,et al.  Second- and High-Order Graph Matching for Correspondence Problems , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  C Conover Talbot,et al.  Abnormalities of the large ribosomal subunit protein, Rpl35a, in Diamond-Blackfan anemia. , 2008, Blood.

[10]  Xiaobo Zhou,et al.  Semi-supervised drug-protein interaction prediction from heterogeneous biological spaces , 2010, BMC Systems Biology.

[11]  Jian Peng,et al.  A network integration approach for drug-target interaction prediction and computational drug repositioning from heterogeneous information , 2017, Nature Communications.

[12]  George Fountzilas,et al.  Expression of DNA repair and replication genes in non-small cell lung cancer (NSCLC): a role for thymidylate synthetase (TYMS) , 2012, BMC Cancer.

[13]  Juan Liu,et al.  A novel computational framework for simultaneous integration of multiple types of genomic data to identify microRNA-gene regulatory modules , 2011, Bioinform..

[14]  Jean Ponce,et al.  A tensor-based algorithm for high-order graph matching , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  John Attia,et al.  Sortilin is associated with breast cancer aggressiveness and contributes to tumor cell adhesion and invasion , 2015, Oncotarget.

[16]  Guoqiang Han,et al.  WaveDec: A Wavelet Approach to Identify Both Shared and Individual Patterns of Copy-Number Variations , 2018, IEEE Transactions on Biomedical Engineering.

[17]  P. Laird,et al.  Discovery of multi-dimensional modules by integrative analysis of cancer genomic data , 2012, Nucleic acids research.

[18]  S. Ramaswamy,et al.  Systematic identification of genomic markers of drug sensitivity in cancer cells , 2012, Nature.

[19]  Ulrich Bodenhofer,et al.  FABIA: factor analysis for bicluster acquisition , 2010 .

[20]  Deng Pan,et al.  DGIdb 2.0: mining clinically relevant drug–gene interactions , 2015, Nucleic Acids Res..

[21]  Minsu Cho,et al.  Reweighted Random Walks for Graph Matching , 2010, ECCV.

[22]  Shannon L. Risacher,et al.  Structured sparse canonical correlation analysis for brain imaging genetics: an improved GraphNet method , 2016, Bioinform..

[23]  Koji Tsuda,et al.  An in silico model for interpreting polypharmacology in drug-target networks. , 2013, Methods in molecular biology.

[24]  Tom E. Bishop,et al.  Blind Image Restoration Using a Block-Stationary Signal Model , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[25]  K. K. Laktionov,et al.  Arf6, RalA, and BIRC5 protein expression in nonsmall cell lung cancer , 2011, Molecular Biology.

[26]  Gema Moreno-Bueno,et al.  Melanoma exosomes educate bone marrow progenitor cells toward a pro-metastatic phenotype through MET , 2012, Nature Medicine.

[27]  E A Davidson,et al.  Canine U2 snRNA gene: nucleotide sequence, characterization and implications in RNA processing and cancer biology. , 1994, Cancer biochemistry biophysics.

[28]  Gary D. Bader,et al.  Pathway Commons, a web resource for biological pathway data , 2010, Nucleic Acids Res..

[29]  Phillip A. Regalia,et al.  The higher-order power method revisited: convergence proofs and effective initialization , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[30]  Shuxing Zhang,et al.  Polypharmacology: drug discovery for the future , 2013, Expert review of clinical pharmacology.

[31]  N. Hanna,et al.  RRM1 Modulated In Vitro and In Vivo Efficacy of Gemcitabine and Platinum in Non–Small-Cell Lung Cancer , 2008 .

[32]  Toru Hirota,et al.  Chromosome segregation machinery and cancer , 2009, Cancer science.

[33]  Crispin R Dass,et al.  Doxorubicin: an update on anticancer molecular action, toxicity and novel drug delivery systems , 2013, The Journal of pharmacy and pharmacology.

[34]  Claus Storgaard Sørensen,et al.  DNA replication and cancer: From dysfunctional replication origin activities to therapeutic opportunities. , 2016, Seminars in cancer biology.

[35]  Wei Xu,et al.  The disease and gene annotations (DGA): an annotation resource for human disease , 2012, Nucleic Acids Res..

[36]  Yves Pommier,et al.  DNA topoisomerases and their poisoning by anticancer and antibacterial drugs. , 2010, Chemistry & biology.

[37]  Wolfgang Link,et al.  Protein localization in disease and therapy , 2011, Journal of Cell Science.

[38]  C. Sander,et al.  Mutual exclusivity analysis identifies oncogenic network modules. , 2012, Genome research.

[39]  J. Medina-Franco,et al.  Shifting from the single to the multitarget paradigm in drug discovery. , 2013, Drug discovery today.

[40]  Ke-Feng Wang,et al.  Comparison of gemcitabine, oxaliplatin and L-asparaginase and etoposide, vincristine, doxorubicin, cyclophosphamide and prednisone as first-line chemotherapy in patients with stage IE to IIE extranodal natural killer/T-cell lymphoma: a multicenter retrospective study , 2015, Leukemia & lymphoma.

[41]  Katherina Baranova,et al.  Genomic signatures for paclitaxel and gemcitabine resistance in breast cancer derived by machine learning , 2016, Molecular oncology.

[42]  Alfredo Brusco,et al.  Diamond-Blackfan anemia: genotype-phenotype correlations in Italian patients with RPL5 and RPL11 mutations , 2010, Haematologica.

[43]  Wenwen Min,et al.  A Two-Stage Method to Identify Joint Modules From Matched MicroRNA and mRNA Expression Data , 2016, IEEE Transactions on NanoBioscience.

[44]  Hyun Cheol Chung,et al.  Ribonucleotide reductase M1 (RRM1) 2464G>A polymorphism shows an association with gemcitabine chemosensitivity in cancer cell lines , 2006, Pharmacogenetics and genomics.

[45]  Ulrich Bodenhofer,et al.  FABIA: factor analysis for bicluster acquisition , 2010, Bioinform..

[46]  Barbara McGrogan,et al.  Taxanes, microtubules and chemoresistant breast cancer. , 2008, Biochimica et biophysica acta.

[47]  Shi-Hua Zhang,et al.  Identifying multi-layer gene regulatory modules from multi-dimensional genomic data , 2012, Bioinform..