论文信息 - Seed-weighted random walk ranking for cancer biomarker prioritisation: a case study in leukaemia

Seed-weighted random walk ranking for cancer biomarker prioritisation: a case study in leukaemia

A central focus of clinical proteomics for cancer is to identify protein biomarkers with diagnostic and therapeutic application potential. Network-based analyses have been used in computational disease-related gene prioritisation for several years. The Random Walk Ranking (RWR) algorithm has been successfully applied to prioritising disease-related gene candidates by exploiting global network topology in a Protein-Protein Interaction (PPI) network. Increasing the specificity and sensitivity ofbiomarkers may require consideration of similar or closely-related disease phenotypes and molecular pathological mechanisms shared across different disease phenotypes. In this paper, we propose a method called Seed-Weighted Random Walk Ranking (SW-RWR) for prioritizing cancer biomarker candidates. This method uses the information of cancer phenotype association to assign to each gene a disease-specific, weighted value to guide the RWR algorithm in a global human PPI network. In a case study of prioritizing leukaemia biomarkers, SW-RWR outperformed a typical local network-based analysis in coverage and also showed better accuracy and sensitivity than the original RWR method (global network-based analysis). Our results suggest that the tight correlation among different cancer phenotypes could play an important role in cancer biomarker discovery.

[1] A. Barabasi,et al. Human disease classification in the postgenomic era: A complex systems approach to human pathobiology , 2007, Molecular systems biology.

[2] Peter R. Hoskins,et al. 25th Annual Conference of the IEEE Engineering in Medicine and Biology Society , 2003 .

[3] Sudipto Saha,et al. Dissecting the human plasma proteome and inflammatory response biomarkers , 2009, Proteomics.

[4] Jon Kleinberg,et al. Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[5] Yongjin Li,et al. Discovering disease-genes by topological features in human protein-protein interaction network , 2006, Bioinform..

[6] A. Barabasi,et al. The human disease network , 2007, Proceedings of the National Academy of Sciences.

[7] C. Sawyers. The cancer biomarker problem , 2008, Nature.

[8] Ragini Pandey,et al. Network topological reordering revealing systemic patterns in yeast protein interaction networks , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[9] P. Robinson,et al. Walking the interactome for prioritization of candidate disease genes. , 2008, American journal of human genetics.

[10] Xiaoyan Zhu,et al. Building Disease-Specific Drug-Protein Connectivity Maps from Molecular Interaction Networks and PubMed Abstracts , 2009, PLoS Comput. Biol..

[11] Jake Yue Chen,et al. Reordering based integrative expression profiling for microarray classification , 2012, BMC Bioinformatics.

[12] Joel Dudley,et al. Identification of Discriminating Biomarkers for Human Disease Using Integrative Network Biology , 2008, Pacific Symposium on Biocomputing.

[13] M. DePamphilis,et al. HUMAN DISEASE , 1957, The Ulster Medical Journal.

[14] B. Snel,et al. Predicting disease genes using protein–protein interactions , 2006, Journal of Medical Genetics.

[15] Liya Gu,et al. Evidence That a Mutation in the MLH1 3′-Untranslated Region Confers a Mutator Phenotype and Mismatch Repair Deficiency in Patients with Relapsed Leukemia* , 2008, Journal of Biological Chemistry.

[16] Jing Chen,et al. Disease candidate gene identification and prioritization using protein interaction networks , 2009, BMC Bioinformatics.

[17] Manfred Schmitt,et al. Tumor Markers , 2003, Molecular & Cellular Proteomics.

[18] J. Chen,et al. Disease gene-fishing in molecular interaction networks: A case study in colorectal cancer , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[19] Fan Zhang,et al. Discovery of pathway biomarkers from coupled proteomics and systems biology methods , 2010, BMC Genomics.

[20] Changyu Shen,et al. Mining Alzheimer Disease Relevant Proteins from Integrated Protein Interactome Data , 2005, Pacific Symposium on Biocomputing.

[21] N. Anderson,et al. A List of Candidate Cancer Biomarkers for Targeted Proteomics , 2006, Biomarker insights.

[22] Ambuj K. Singh,et al. Analysis of protein-protein interaction networks using random walks , 2005, BIOKDD.

[23] Jake Yue Chen,et al. Initial large-scale exploration of protein-protein interactions in human brain , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[24] A. Sivachenko,et al. Data mining in protein interactomics , 2005, IEEE Engineering in Medicine and Biology Magazine.

[25] Rajeev Motwani,et al. The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[26] Martin Klabusay,et al. Detection and treatment of molecular relapse in acute myeloid leukemia with RUNX1 (AML1), CBFB, or MLL gene translocations: frequent quantitative monitoring of molecular markers in different compartments and correlation with WT1 gene expression. , 2009, Experimental hematology.

[27] Jake Yue Chen,et al. ProteoLens: a visual analytic tool for multi-scale database-driven biological network data mining , 2008, BMC Bioinformatics.

[28] Kyungja Han,et al. Acute myeloid leukemia with MYC amplification in the homogeneous staining regions and double minutes. , 2009, Cancer genetics and cytogenetics.

[29] N Asou,et al. Fusion of TEL/ETV6 to a novel ACS2 in myelodysplastic syndrome and acute myelogenous leukemia with t(5;12)(q31;p13) , 1999, Genes, chromosomes & cancer.

[30] J. Chen,et al. HAPPI: an online database of comprehensive human annotated and predicted protein interactions , 2009, BMC Genomics.

[31] Alan F. Scott,et al. Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2002, Nucleic Acids Res..