A Greedy Algorithm for Detecting Mutually Exclusive Patterns in Cancer Mutation Data

Some somatic mutations are reported to present mutually exclusive patterns. It is a basic computational problem to efficiently extracting mutually exclusive patterns from cancer mutation data. In this article, we focus on the inter-set mutual exclusion problem, which is to group the genes into at least two sets, with the mutations in the different sets mutually exclusive. The proposed algorithm improves the calculation of the score of mutual exclusion. The improved measurement considers the percentage of supporting cases, the approximate exclusivity degree and the pair-wise similarities of two genes. Moreover, the proposed algorithm adopts a greedy strategy to generate the sets of genes. Different from the existing approaches, the greedy strategy considers the scores of mutual exclusion between both the genes and virtual genes, which benefits the selection with the size restrictions. We conducted a series of experiments to verify the performance on simulation datasets and TCGA dataset consisting of 477 real cases with more than 10 million mutations within 28507 genes. According to the results, our algorithm demonstrated good performance under different simulation configurations. In addition, it outperformed CoMEt, a widely-accepted algorithm, in recall rates and accuracies on simulation datasets. Moreover, some of the exclusive patterns detected from TCGA dataset were supported by published literatures.

[1]  Christopher A. Miller,et al.  Discovering functional modules by identifying recurrent and mutually exclusive mutational patterns in tumors , 2011, BMC Medical Genomics.

[2]  Matthew B. Callaway,et al.  MuSiC: Identifying mutational significance in cancer genomes , 2012, Genome research.

[3]  Payam Shahi,et al.  The Transcriptional Repressor ZNF503/Zeppo2 Promotes Mammary Epithelial Cell Proliferation and Enhances Cell Invasion* , 2014, The Journal of Biological Chemistry.

[4]  Li Ding,et al.  Patterns and functional implications of rare germline variants across 12 cancer types , 2015, Nature Communications.

[5]  Richard R. Hudson,et al.  Generating samples under a Wright-Fisher neutral model of genetic variation , 2002, Bioinform..

[6]  Teresa M. Przytycka,et al.  MEMCover: integrated analysis of mutual exclusivity and functional network reveals dysregulated pathways across multiple cancer types , 2015, Bioinform..

[7]  Richard P Lifton,et al.  Mutations in KDSR Cause Recessive Progressive Symmetric Erythrokeratoderma. , 2017, American journal of human genetics.

[8]  Wei Cheng,et al.  A new approach for persistent cloaca: Laparoscopically assisted anorectoplasty and modified repair of urogenital sinus. , 2015, Journal of pediatric surgery.

[9]  Ping Li,et al.  Decreased LIPF expression is correlated with DGKA and predicts poor outcome of gastric cancer , 2016, Oncology reports.

[10]  Steven J. M. Jones,et al.  Pathogenic Germline Variants in 10,389 Adult Cancers. , 2018, Cell.

[11]  Jie Chen,et al.  LncRNA NR2F1‐AS1 regulates hepatocellular carcinoma oxaliplatin resistance by targeting ABCC1 via miR‐363 , 2018, Journal of cellular and molecular medicine.

[12]  Marie-Paule Strub,et al.  The preliminary solution structure of human p13MTCP1, an oncogenic protein encoded by the MTCP1 gene, using 2D homonuclear NMR , 1998 .

[13]  L. Poliseno Pseudogenes: Newly Discovered Players in Human Cancer , 2012, Science Signaling.

[14]  C. Sander,et al.  Mutual exclusivity analysis identifies oncogenic network modules. , 2012, Genome research.

[15]  Benjamin J. Raphael,et al.  CoMEt: a statistical approach to identify combinations of mutually exclusive alterations in cancer , 2015, Genome Biology.

[16]  Benjamin J. Raphael,et al.  De novo discovery of mutated driver pathways in cancer , 2011 .

[17]  Mohammad Reza Abbaszadegan,et al.  Familial Esophageal Squamous Cell Carcinoma with damaging rare/germline mutations in KCNJ12/KCNJ18 and GPRIN2 genes. , 2017, Cancer genetics.

[18]  Po-Ming Chen,et al.  IFI44L is a novel tumor suppressor in human hepatocellular carcinoma affecting cancer stemness, metastasis, and drug resistance via regulating met/Src signaling pathway , 2018, BMC Cancer.

[19]  Benjamin J. Raphael,et al.  A weighted exact test for mutually exclusive mutations in cancer , 2016, Bioinform..

[20]  H. Immervoll,et al.  Expression of DSG1 and DSC1 are prognostic markers in anal carcinoma patients , 2012, British Journal of Cancer.

[21]  Ling Lin,et al.  PathScan: a tool for discerning mutational significance in groups of putative cancer genes , 2011, Bioinform..