Seed-weighted random walk ranking for cancer biomarker prioritisation: a case study in leukaemia

A central focus of clinical proteomics for cancer is to identify protein biomarkers with diagnostic and therapeutic application potential. Network-based analyses have been used in computational disease-related gene prioritisation for several years. The Random Walk Ranking (RWR) algorithm has been successfully applied to prioritising disease-related gene candidates by exploiting global network topology in a Protein-Protein Interaction (PPI) network. Increasing the specificity and sensitivity ofbiomarkers may require consideration of similar or closely-related disease phenotypes and molecular pathological mechanisms shared across different disease phenotypes. In this paper, we propose a method called Seed-Weighted Random Walk Ranking (SW-RWR) for prioritizing cancer biomarker candidates. This method uses the information of cancer phenotype association to assign to each gene a disease-specific, weighted value to guide the RWR algorithm in a global human PPI network. In a case study of prioritizing leukaemia biomarkers, SW-RWR outperformed a typical local network-based analysis in coverage and also showed better accuracy and sensitivity than the original RWR method (global network-based analysis). Our results suggest that the tight correlation among different cancer phenotypes could play an important role in cancer biomarker discovery.

[1]  A. Barabasi,et al.  Human disease classification in the postgenomic era: A complex systems approach to human pathobiology , 2007, Molecular systems biology.

[2]  Peter R. Hoskins,et al.  25th Annual Conference of the IEEE Engineering in Medicine and Biology Society , 2003 .

[3]  Sudipto Saha,et al.  Dissecting the human plasma proteome and inflammatory response biomarkers , 2009, Proteomics.

[4]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[5]  Yongjin Li,et al.  Discovering disease-genes by topological features in human protein-protein interaction network , 2006, Bioinform..

[6]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[7]  C. Sawyers The cancer biomarker problem , 2008, Nature.

[8]  Ragini Pandey,et al.  Network topological reordering revealing systemic patterns in yeast protein interaction networks , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[9]  P. Robinson,et al.  Walking the interactome for prioritization of candidate disease genes. , 2008, American journal of human genetics.

[10]  Xiaoyan Zhu,et al.  Building Disease-Specific Drug-Protein Connectivity Maps from Molecular Interaction Networks and PubMed Abstracts , 2009, PLoS Comput. Biol..

[11]  Jake Yue Chen,et al.  Reordering based integrative expression profiling for microarray classification , 2012, BMC Bioinformatics.

[12]  Joel Dudley,et al.  Identification of Discriminating Biomarkers for Human Disease Using Integrative Network Biology , 2008, Pacific Symposium on Biocomputing.

[13]  M. DePamphilis,et al.  HUMAN DISEASE , 1957, The Ulster Medical Journal.

[14]  B. Snel,et al.  Predicting disease genes using protein–protein interactions , 2006, Journal of Medical Genetics.

[15]  Liya Gu,et al.  Evidence That a Mutation in the MLH1 3′-Untranslated Region Confers a Mutator Phenotype and Mismatch Repair Deficiency in Patients with Relapsed Leukemia* , 2008, Journal of Biological Chemistry.

[16]  Jing Chen,et al.  Disease candidate gene identification and prioritization using protein interaction networks , 2009, BMC Bioinformatics.

[17]  Manfred Schmitt,et al.  Tumor Markers , 2003, Molecular & Cellular Proteomics.

[18]  J. Chen,et al.  Disease gene-fishing in molecular interaction networks: A case study in colorectal cancer , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[19]  Fan Zhang,et al.  Discovery of pathway biomarkers from coupled proteomics and systems biology methods , 2010, BMC Genomics.

[20]  Changyu Shen,et al.  Mining Alzheimer Disease Relevant Proteins from Integrated Protein Interactome Data , 2005, Pacific Symposium on Biocomputing.

[21]  N. Anderson,et al.  A List of Candidate Cancer Biomarkers for Targeted Proteomics , 2006, Biomarker insights.

[22]  Ambuj K. Singh,et al.  Analysis of protein-protein interaction networks using random walks , 2005, BIOKDD.

[23]  Jake Yue Chen,et al.  Initial large-scale exploration of protein-protein interactions in human brain , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[24]  A. Sivachenko,et al.  Data mining in protein interactomics , 2005, IEEE Engineering in Medicine and Biology Magazine.

[25]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[26]  Martin Klabusay,et al.  Detection and treatment of molecular relapse in acute myeloid leukemia with RUNX1 (AML1), CBFB, or MLL gene translocations: frequent quantitative monitoring of molecular markers in different compartments and correlation with WT1 gene expression. , 2009, Experimental hematology.

[27]  Jake Yue Chen,et al.  ProteoLens: a visual analytic tool for multi-scale database-driven biological network data mining , 2008, BMC Bioinformatics.

[28]  Kyungja Han,et al.  Acute myeloid leukemia with MYC amplification in the homogeneous staining regions and double minutes. , 2009, Cancer genetics and cytogenetics.

[29]  N Asou,et al.  Fusion of TEL/ETV6 to a novel ACS2 in myelodysplastic syndrome and acute myelogenous leukemia with t(5;12)(q31;p13) , 1999, Genes, chromosomes & cancer.

[30]  J. Chen,et al.  HAPPI: an online database of comprehensive human annotated and predicted protein interactions , 2009, BMC Genomics.

[31]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2002, Nucleic Acids Res..