Identifying driver mutations from sequencing data of heterogeneous tumors in the era of personalized genome sequencing

Distinguishing driver mutations from passenger mutations is critical to the understanding of the molecular mechanisms of carcinogenesis and for identifying prognostic and diagnostic markers as well as therapeutic targets. We reviewed the current approaches and software for identifying driver mutations from passenger mutations including both biology-based approaches and machine-learning-based approaches. We also reviewed approaches to identify driver mutations in the context of pathways or gene sets. Finally, we discussed the challenges of predicting driver mutations considering the complexities of inter- and intra-tumor heterogeneity as well as the evolution and progression of tumors.

[1]  A. Sparks,et al.  The Genomic Landscapes of Human Breast and Colorectal Cancers , 2007, Science.

[2]  R. Tibshirani,et al.  Comment on "The Consensus Coding Sequences of Human Breast and Colorectal Cancers" , 2007, Science.

[3]  Thomas J. Hardcastle,et al.  Genomic analysis of genetic heterogeneity and evolution in high-grade serous ovarian carcinoma , 2010, Oncogene.

[4]  A. McKenna,et al.  The Mutational Landscape of Head and Neck Squamous Cell Carcinoma , 2011, Science.

[5]  Shamil R. Sunyaev,et al.  Impact of deleterious passenger mutations on cancer progression , 2012, Proceedings of the National Academy of Sciences.

[6]  R. Gibbs,et al.  Exome Sequencing of Head and Neck Squamous Cell Carcinoma Reveals Inactivating Mutations in NOTCH1 , 2011, Science.

[7]  David Tamborero,et al.  Oncodrive-CIS: A Method to Reveal Likely Driver Genes Based on the Impact of Their Copy Number Changes on Expression , 2013, PloS one.

[8]  Ling Lin,et al.  PathScan: a tool for discerning mutational significance in groups of putative cancer genes , 2011, Bioinform..

[9]  Steven J. M. Jones,et al.  Frequent mutation of histone modifying genes in non-Hodgkin lymphoma , 2011, Nature.

[10]  Steven Henikoff,et al.  SIFT: predicting amino acid changes that affect protein function , 2003, Nucleic Acids Res..

[11]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[12]  Eric S. Lander,et al.  Discovery and prioritization of somatic mutations in diffuse large B-cell lymphoma (DLBCL) by whole-exome sequencing , 2012, Proceedings of the National Academy of Sciences.

[13]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[14]  Matthew B. Callaway,et al.  MuSiC: Identifying mutational significance in cancer genomes , 2012, Genome research.

[15]  A. Gonzalez-Perez,et al.  Improving the prediction of the functional impact of cancer mutations by baseline tolerance transformation , 2012, Genome Medicine.

[16]  Xiaobo Zhou,et al.  A novel missense-mutation-related feature extraction scheme for 'driver' mutation identification , 2012, Bioinform..

[17]  M. Stratton,et al.  The cancer genome , 2009, Nature.

[18]  A. Gonzalez-Perez,et al.  Functional impact bias reveals cancer drivers , 2012, Nucleic acids research.

[19]  C. Sander,et al.  Predicting the functional impact of protein mutations: application to cancer genomics , 2011, Nucleic acids research.

[20]  Benjamin J. Raphael,et al.  Integrated Genomic Analyses of Ovarian Carcinoma , 2011, Nature.

[21]  Raul Rabadan,et al.  Analysis of the Coding Genome of Diffuse Large B-Cell Lymphoma , 2011, Nature Genetics.

[22]  S. Henikoff,et al.  Predicting deleterious amino acid substitutions. , 2001, Genome research.

[23]  Leyla Isik,et al.  Cancer-specific high-throughput annotation of somatic mutations: computational prediction of driver missense mutations. , 2009, Cancer research.

[24]  David Dunson,et al.  Genetic heterogeneity of diffuse large B-cell lymphoma , 2013, Proceedings of the National Academy of Sciences.

[25]  Yan Zhang,et al.  CanPredict: a computational tool for predicting cancer-associated missense mutations , 2007, Nucleic Acids Res..

[26]  Peng Yue,et al.  SNPs3D: Candidate gene and SNP selection for association studies , 2006, BMC Bioinformatics.

[27]  Hannah Carter,et al.  CHASM and SNVBox: toolkit for detecting biologically important single nucleotide mutations in cancer , 2011, Bioinform..

[28]  James R. Downing,et al.  Genomic Analysis of the Clonal Origins of Relapsed Acute Lymphoblastic Leukemia , 2008, Science.

[29]  G. Parmigiani,et al.  The Consensus Coding Sequences of Human Breast and Colorectal Cancers , 2006, Science.

[30]  A. Gonzalez-Perez,et al.  Improving the assessment of the outcome of nonsynonymous SNVs with a consensus deleteriousness score, Condel. , 2011, American journal of human genetics.

[31]  C. Sander,et al.  Determinants of protein function revealed by combinatorial entropy optimization , 2007, Genome Biology.

[32]  Modesto Orozco,et al.  PMUT: a web-based tool for the annotation of pathological mutations on proteins , 2005, Bioinform..

[33]  Giovanni Parmigiani,et al.  Patient-oriented gene set analysis for cancer mutation data , 2010, Genome Biology.

[34]  Kenneth H. Buetow,et al.  Large-scale analysis of non-synonymous coding region single nucleotide polymorphisms , 2004, Bioinform..

[35]  J. Uhm An Integrated Genomic Analysis of Human Glioblastoma Multiforme , 2009 .

[36]  A. Sidow,et al.  Physicochemical constraint violation by missense substitutions mediates impairment of protein function and disease severity. , 2005, Genome research.

[37]  B. Rost,et al.  SNAP: predict effect of non-synonymous polymorphisms on function , 2007, Nucleic acids research.

[38]  M. Stratton,et al.  The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website , 2004, British Journal of Cancer.

[39]  Gregory A Petsko,et al.  No stone unturned , 2010, Genome Biology.

[40]  S. Henikoff,et al.  Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm , 2009, Nature Protocols.

[41]  Jonathan W. Pillow,et al.  POSTER PRESENTATION Open Access , 2013 .

[42]  B. Peters,et al.  Distinguishing cancer-associated missense mutations from common polymorphisms. , 2007, Cancer research.

[43]  G. Parmigiani,et al.  Core Signaling Pathways in Human Pancreatic Cancers Revealed by Global Genomic Analyses , 2008, Science.

[44]  J. Miller,et al.  Predicting the Functional Effect of Amino Acid Substitutions and Indels , 2012, PloS one.