An Entropy-Based Directed Random Walk for Cancer Classification Using Gene Expression Data Based on Bi-Random Walk on Two Separated Networks

The integration of microarray technologies and machine learning methods has become popular in predicting the pathological condition of diseases and discovering risk genes. Traditional microarray analysis considers pathways as a simple gene set, treating all genes in the pathway identically while ignoring the pathway network’s structure information. This study proposed an entropy-based directed random walk (e-DRW) method to infer pathway activities. Two enhancements from the conventional DRW were conducted, which are (1) to increase the coverage of human pathway information by constructing two inputting networks for pathway activity inference, and (2) to enhance the gene-weighting method in DRW by incorporating correlation coefficient values and t-test statistic scores. To test the objectives, gene expression datasets were used as input datasets while the pathway datasets were used as reference datasets to build two directed graphs. The within-dataset experiments indicated that e-DRW method demonstrated robust and superior performance in terms of classification accuracy and robustness of the predicted risk-active pathways compared to the other methods. In conclusion, the results revealed that e-DRW not only improved the prediction performance, but also effectively extracted topologically important pathways and genes that were specifically related to the corresponding cancer types.

[1]  Xiaojing Zheng,et al.  The Critical Gene Screening to Prevent Chromophobe Cell Renal Carcinoma Metastasis through TCGA and WGCNA , 2022, Journal of oncology.

[2]  D. Newby,et al.  The therapeutic potential of apelin in kidney disease , 2021, Nature Reviews Nephrology.

[3]  Dalwinder Singh,et al.  Investigating the impact of data normalization on classification performance , 2020, Appl. Soft Comput..

[4]  Longlong Wang,et al.  Nucleotide de novo synthesis increases breast cancer stemness and metastasis via cGMP-PKG-MAPK signaling pathway , 2020, PLoS biology.

[5]  Yan Li,et al.  Apelin enhances biological functions in lung cancer A549 cells by downregulating exosomal miR-15a-5p. , 2020, Carcinogenesis.

[6]  T. Phesse,et al.  Targeting Wnt Signaling for the Treatment of Gastric Cancer , 2020, International journal of molecular sciences.

[7]  Wenbin Liu,et al.  Classification of Cancers Based on a Comprehensive Pathway Activity Inferred by Genes and Their Interactions , 2020, IEEE Access.

[8]  J. Saunus,et al.  Calcium signalling and breast cancer. , 2019, Seminars in cell & developmental biology.

[9]  B. Baradaran,et al.  The relation between PI3K/AKT signalling pathway and cancer. , 2019, Gene.

[10]  Kyung-ah Sohn,et al.  Robust pathway-based multi-omics data integration using directed random walks for survival prediction in multiple cancer studies , 2019, Biology Direct.

[11]  Y. Gong,et al.  Bioinformatic analysis and identification of potential prognostic microRNAs and mRNAs in thyroid cancer , 2018, PeerJ.

[12]  Ying Wang,et al.  The role of Hippo signal pathway in breast cancer metastasis , 2018, OncoTargets and therapy.

[13]  S. Steinberg,et al.  Safety in treatment of hepatocellular carcinoma with immune checkpoint inhibitors as compared to melanoma and non-small cell lung cancer , 2017, Journal of Immunotherapy for Cancer.

[14]  Mohd Saberi Mohamad,et al.  An enhanced topologically significant directed random walk in cancer classification using gene expression datasets , 2017, Saudi journal of biological sciences.

[15]  Liwu Fu,et al.  Targeting calcium signaling in cancer therapy , 2016, Acta pharmaceutica Sinica. B.

[16]  Gang Chen,et al.  Human papillomavirus as a potential risk factor for gastric cancer: a meta-analysis of 1,917 cases , 2016, OncoTargets and therapy.

[17]  David W. Johnson,et al.  The role of cGMP and its signaling pathways in kidney disease. , 2016, American journal of physiology. Renal physiology.

[18]  Tae Kyun Kim,et al.  T test as a parametric statistic , 2015, Korean journal of anesthesiology.

[19]  Chris Sander,et al.  PaxtoolsR: pathway analysis in R using Pathway Commons , 2015, bioRxiv.

[20]  Yongtang Shi,et al.  Entropy of Weighted Graphs with Randi'c Weights , 2015, Entropy.

[21]  Xiaoxia Liu,et al.  Identification of Key Genes and Pathways in Renal Cell Carcinoma Through Expression Profiling Data , 2015, Kidney and Blood Pressure Research.

[22]  Hengwei Zhang,et al.  Gene expression profile analyze the molecular mechanism of CXCR7 regulating papillary thyroid carcinoma growth and metastasis , 2015, Journal of experimental & clinical cancer research : CR.

[23]  S. Linnarsson,et al.  Unbiased classification of sensory neuron types by large-scale single-cell RNA sequencing , 2014, Nature Neuroscience.

[24]  Hiroshi Mamitsuka,et al.  NetPathMiner: R/Bioconductor package for network path mining through gene expression , 2014, Bioinform..

[25]  Guangxi Zhou,et al.  Effects of the hippo signaling pathway in human gastric cancer. , 2013, Asian Pacific journal of cancer prevention : APJCP.

[26]  Fan Zhang,et al.  Topologically inferring risk-active pathways toward precise cancer classification by directed random walk , 2013, Bioinform..

[27]  Stavros K Archondakis,et al.  Thyroid gland metastasis from small cell lung cancer: an unusual site of metastatic spread. , 2013, Journal of thoracic disease.

[28]  V. Detours,et al.  A general method to derive robust organ-specific gene expression-based differentiation indices: application to thyroid cancer diagnostic , 2012, Oncogene.

[29]  M. Kon,et al.  Pathway-based classification of cancer subtypes , 2012, Biology Direct.

[30]  Xing-Ming Zhao,et al.  Identifying dysregulated pathways in cancers from pathway interaction networks , 2012, BMC Bioinformatics.

[31]  H. Ji,et al.  A network-based gene-weighting approach for pathway analysis , 2011, Cell Research.

[32]  David Galas,et al.  Systems biology of interstitial lung diseases: integration of mRNA and microRNA expression changes , 2011, BMC Medical Genomics.

[33]  Ivan Rusyn,et al.  Gene expression in nontumoral liver tissue and recurrence-free survival in hepatitis C virus-positive hepatocellular carcinoma , 2010, Molecular Cancer.

[34]  E. Dougherty,et al.  Accurate and Reliable Cancer Classification Based on Probabilistic Inference of Pathway Activity , 2009, PloS one.

[35]  Alessandro Giuliani,et al.  Genome-wide expression profile of sporadic gastric cancers with microsatellite instability. , 2009, European journal of cancer.

[36]  Doheon Lee,et al.  Inferring Pathway Activity toward Precise Disease Classification , 2008, PLoS Comput. Biol..

[37]  Kenneth H. Buetow,et al.  PID: the Pathway Interaction Database , 2008, Nucleic Acids Res..

[38]  Arthur Liberzon,et al.  Using GenePattern for Gene Expression Analysis , 2008, Current protocols in bioinformatics.

[39]  S. Wacholder,et al.  Gene Expression Signature of Cigarette Smoking and Its Role in Lung Adenocarcinoma Development and Survival , 2008, PloS one.

[40]  Emmanuel Barillot,et al.  Classification of microarray data using gene networks , 2007, BMC Bioinformatics.

[41]  Jingchun Chen,et al.  Detecting functional modules in the yeast protein-protein interaction network , 2006, Bioinform..

[42]  Jeffrey T. Chang,et al.  Oncogenic pathway signatures in human cancers as a guide to targeted therapies , 2006, Nature.

[43]  Jun Lu,et al.  Pathway level analysis of gene expression using singular value decomposition , 2005, BMC Bioinformatics.

[44]  P. Hall,et al.  An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[45]  Randall T. Moon,et al.  Wnt and calcium signaling: β-Catenin-independent pathways , 2005 .

[46]  Marie Joseph,et al.  Gene Signatures of Progression and Metastasis in Renal Cell Cancer , 2005, Clinical Cancer Research.

[47]  Qing Wang,et al.  Towards precise classification of cancers based on robust gene functional expression profiles , 2005, BMC Bioinformatics.

[48]  R. Karp,et al.  From the Cover : Conserved patterns of protein interaction in multiple species , 2005 .

[49]  Enrique Casado,et al.  PI3K/Akt signalling pathway and cancer. , 2004, Cancer treatment reviews.

[50]  Louise R Howe,et al.  Wnt Signaling and Breast Cancer , 2004, Cancer biology & therapy.

[51]  Haidong Wang,et al.  Discovering molecular pathways from protein interaction and gene expression data , 2003, ISMB.

[52]  Kazuhiro Yoshida,et al.  Expression of integrin-linked kinase is closely correlated with invasion and metastasis of gastric carcinoma , 2003, Virchows Archiv.

[53]  Benno Schwikowski,et al.  Discovering regulatory and signalling circuits in molecular interaction networks , 2002, ISMB.

[54]  D. Rader,et al.  Antioxidant therapy and atherosclerosis: animal and human studies. , 2001, Trends in cardiovascular medicine.

[55]  G. Sheu,et al.  The association of human papillomavirus 16/18 infection with lung cancer among nonsmoking Taiwanese women. , 2001, Cancer research.

[56]  M. Thangaraju,et al.  Effect of tamoxifen on lipids and lipid metabolising marker enzymes in experimental atherosclerosis in Wistar rats , 1997, Molecular and Cellular Biochemistry.

[57]  F. Scinicariello,et al.  Detection of human papillomavirus in primary hepatocellular carcinoma. , 1992, Anticancer Research.

[58]  Yihua Zhu,et al.  gwSPIA: Improved Signaling Pathway Impact Analysis With Gene Weights , 2019, IEEE Access.

[59]  M. Franco,et al.  Integrin-Linked Kinase (ILK) Expression Correlates with Tumor Severity in Clear Cell Renal Carcinoma , 2012, Pathology & Oncology Research.

[60]  Max Kuhn,et al.  The caret Package , 2007 .

[61]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[62]  Supplemental Information 2: Kyoto Encyclopedia of genes and genomes. , 2022 .