Towards automation of germline variant curation in clinical cancer genetics

Cancer care professionals are confronted with interpreting results from multiplexed gene sequencing of patients at hereditary risk for cancer. Assessments for variant classification now require orthogonal data searches, requiring aggregation of multiple lines of evidence from diverse resources. The burden of evidence for each variant to meet thresholds for pathogenicity or actionability now poses a growing challenge for those seeking to counsel patients and families following germline genetic testing. A computational algorithm that automates, provides uniformity and significantly accelerates this interpretive process is needed. The tool described here, Pathogenicity of Mutation Analyzer (PathoMAN) automates germline genomic variant curation from clinical sequencing based on ACMG guidelines. PathoMAN aggregates multiple tracks of genomic, protein and disease specific information from public sources. We compared expert manually curated variant data from studies on (i) prostate cancer (ii) breast cancer and (iii) ClinVar to assess performance. PathoMAN achieves high concordance (83.1% pathogenic, 75.5% benign) and negligible discordance (0.04% pathogenic, 0.9% benign) when contrasted against expert curation. Some loss of resolution (8.6% pathogenic, 23.64% benign) and gain of resolution (6.6% pathogenic, 1.6% benign) was also observed. We highlight the advantages and weaknesses related to the programmable automation of variant classification. We also propose a new nosology for the five ACMG classes to facilitate more accurate reporting to ClinVar. The proposed refinements will enhance utility of ClinVar to allow further automation in cancer genetics. PathoMAN will reduce the manual workload of domain level experts. It provides a substantial advance in rapid classification of genetic variants by generating robust models using a knowledge-base of diverse genetic data https://pathoman.mskcc.org.

[1]  Yongwook Choi,et al.  PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels , 2015, Bioinform..

[2]  Sue Healey,et al.  ENIGMA—Evidence‐based network for the interpretation of germline mutant alleles: An international initiative to evaluate risk and clinical significance associated with sequence variation in BRCA1 and BRCA2 genes , 2012, Human mutation.

[3]  A. Børresen-Dale,et al.  Cancer in patients with ataxia-telangiectasia and in their relatives in the nordic countries. , 2001, Journal of the National Cancer Institute.

[4]  Risha Govind,et al.  CardioClassifier – demonstrating the power of disease- and gene-specific computational decision support for clinical genome interpretation , 2017, bioRxiv.

[5]  Donavan T. Cheng,et al.  Memorial Sloan Kettering-Integrated Mutation Profiling of Actionable Cancer Targets (MSK-IMPACT): A Hybridization Capture-Based Next-Generation Sequencing Clinical Assay for Solid Tumor Molecular Oncology. , 2015, The Journal of molecular diagnostics : JMD.

[6]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[7]  Tom R. Gaunt,et al.  Predicting the functional consequences of cancer-associated amino acid substitutions , 2013, Bioinform..

[8]  D. Cooper,et al.  Human Gene Mutation Database , 1996, Human Genetics.

[9]  N. Rahner,et al.  Hereditary cancer syndromes. , 2008, Deutsches Arzteblatt international.

[10]  R. W. Davis,et al.  Global analysis of ATM polymorphism reveals significant functional constraint. , 2001, American journal of human genetics.

[11]  Amit Kumar Sharma,et al.  The Curation of Genetic Variants: Difficulties and Possible Solutions , 2012, Genom. Proteom. Bioinform..

[12]  Gill Bejerano,et al.  M-CAP eliminates a majority of variants of uncertain significance in clinical exomes at high sensitivity , 2016, Nature Genetics.

[13]  G. Jarvik,et al.  Consideration of Cosegregation in the Pathogenicity Classification of Genomic Variants. , 2016, American journal of human genetics.

[14]  P. Stenson,et al.  The Human Gene Mutation Database: towards a comprehensive repository of inherited mutation data for medical research, genetic diagnosis and next-generation sequencing studies , 2017, Human Genetics.

[15]  Heidi L Rehm,et al.  ClinGen--the Clinical Genome Resource. , 2015, The New England journal of medicine.

[16]  C. Sander,et al.  Predicting the functional impact of protein mutations: application to cancer genomics , 2011, Nucleic acids research.

[17]  H. Carter,et al.  Identifying Mendelian disease genes with the Variant Effect Scoring Tool , 2013, BMC Genomics.

[18]  Ricardo Villamarín-Salomón,et al.  ClinVar: public archive of interpretations of clinically relevant variants , 2015, Nucleic Acids Res..

[19]  F. Couch,et al.  Functional Assays for Analysis of Variants of Uncertain Significance in BRCA2 , 2014, Human mutation.

[20]  Jay Shendure,et al.  Saturation Editing of Genomic Regions by Multiplex Homology-Directed Repair , 2014, Nature.

[21]  Yuya Kobayashi,et al.  Clinical Actionability of Multigene Panel Testing for Hereditary Breast and Ovarian Cancer Risk Assessment. , 2015, JAMA oncology.

[22]  Ahmet Zehir,et al.  Comprehensive detection of germline variants by MSK-IMPACT, a clinical diagnostic platform for solid tumor molecular oncology and concurrent cancer predisposition testing , 2017, BMC Medical Genomics.

[23]  Jay Shendure,et al.  A multiplexed homology-directed DNA repair assay reveals the impact of ~1,700 BRCA1 variants on protein function , 2018, bioRxiv.

[24]  Shawn Rynearson,et al.  ClinGen Pathogenicity Calculator: a configurable system for assessing pathogenicity of genetic variants , 2017, Genome Medicine.

[25]  J. Shendure,et al.  A general framework for estimating the relative pathogenicity of human genetic variants , 2014, Nature Genetics.

[26]  E. Boerwinkle,et al.  dbNSFP v3.0: A One‐Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice‐Site SNVs , 2016, Human mutation.

[27]  Bale,et al.  Standards and Guidelines for the Interpretation of Sequence Variants: A Joint Consensus Recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology , 2015, Genetics in Medicine.

[28]  Soma Das,et al.  Clinical Laboratories Collaborate to Resolve Differences in Variant Interpretations Submitted to ClinVar , 2017, Genetics in Medicine.

[29]  Christophe Béroud,et al.  BRCA Share: A Collection of Clinical BRCA Gene Variants , 2016, Human mutation.

[30]  Xiaoming Liu,et al.  In Silico Prediction of Deleteriousness for Nonsynonymous and Splice-Altering Single Nucleotide Variants in the Human Genome. , 2017, Methods in molecular biology.

[31]  Donavan T. Cheng,et al.  Mutational Landscape of Metastatic Cancer Revealed from Prospective Clinical Sequencing of 10,000 Patients , 2017, Nature Medicine.

[32]  Maitreya J. Dunham,et al.  Variant Interpretation: Functional Assays to the Rescue. , 2017, American journal of human genetics.

[33]  D. Karolchik,et al.  The UCSC Genome Browser database: 2016 update , 2015, bioRxiv.

[34]  Jana Marie Schwarz,et al.  MutationTaster evaluates disease-causing potential of sequence alterations , 2010, Nature Methods.

[35]  Birgit Funke,et al.  Adaptation and validation of the ACMG/AMP variant classification framework for MYH7-associated inherited cardiomyopathies: recommendations by ClinGen’s Inherited Cardiomyopathy Expert Panel , 2018, Genetics in Medicine.

[36]  Robert C. Green,et al.  Erratum: Performance of ACMG-AMP Variant-Interpretation Guidelines among Nine Laboratories in the Clinical Sequencing Exploratory Research Consortium (American Journal of Human Genetics (2016) 98(6) (1067–1076) (S0002929716300593) (10.1016/j.ajhg.2016.03.024)) , 2016 .

[37]  Dmitriy Sonkin,et al.  TP53 Variations in Human Cancers: New Lessons from the IARC TP53 Database and Genomics Data , 2016, Human mutation.

[38]  Nansheng Chen,et al.  Using RepeatMasker to Identify Repetitive Elements in Genomic Sequences , 2009, Current protocols in bioinformatics.

[39]  S. Henikoff,et al.  Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm , 2009, Nature Protocols.

[40]  Kosuke M. Teshima,et al.  Natural Selection on Genes that Underlie Human Disease Susceptibility , 2008, Current Biology.

[41]  Peter Kraft,et al.  COMPLEXO: identifying the missing heritability of breast cancer via next generation collaboration , 2013, Breast Cancer Research.

[42]  David L. Young,et al.  Massively Parallel Functional Analysis of BRCA1 RING Domain Variants , 2015, Genetics.

[43]  Nazneen Rahman,et al.  CSN and CAVA: variant annotation tools for rapid, robust next-generation sequencing analysis in the clinical setting , 2015, Genome Medicine.

[44]  C. Bonaïti‐pellié,et al.  Drastic Effect of Germline TP53 Missense Mutations in Li–Fraumeni Patients , 2013, Human mutation.

[45]  M. Daly,et al.  A Recurrent ERCC3 Truncating Mutation Confers Moderate Risk for Breast Cancer. , 2016, Cancer discovery.

[46]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[47]  C. Eng,et al.  Protean PTEN: form and function. , 2002, American journal of human genetics.

[48]  Raymond M. Moore,et al.  Evaluation of ACMG-Guideline-Based Variant Classification of Cancer Susceptibility and Non-Cancer-Associated Genes in Families Affected by Breast Cancer. , 2016, American journal of human genetics.

[49]  P. Kantoff,et al.  Mutation Detection in Patients With Advanced Cancer by Universal Sequencing of Cancer-Related Genes in Tumor and Normal DNA vs Guideline-Based Germline Testing , 2017, JAMA.

[50]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[51]  Quan Li,et al.  InterVar: Clinical Interpretation of Genetic Variants by the 2015 ACMG-AMP Guidelines. , 2017, American journal of human genetics.

[52]  David Haussler,et al.  The UCSC Genome Browser database: 2018 update , 2017, Nucleic Acids Res..

[53]  James P Evans,et al.  An informatics approach to analyzing the incidentalome , 2012, Genetics in Medicine.

[54]  Genica,et al.  Commonly studied single-nucleotide polymorphisms and breast cancer: Results from the Breast Cancer Association Consortium , 2006 .

[55]  Sanjeena Subedi,et al.  Genome-wide expression profiling of maize in response to individual and combined water and nitrogen stresses , 2013, BMC Genomics.

[56]  K. Offit Multigene Testing for Hereditary Cancer: When, Why, and How. , 2017, The Journal of the National Comprehensive Cancer Network.

[57]  Keith Nykamp,et al.  Sherloc: a comprehensive refinement of the ACMG–AMP variant classification criteria , 2017, Genetics in Medicine.

[58]  Justin C. Fay,et al.  Identification of deleterious mutations within three human genomes. , 2009, Genome research.

[59]  R. Gibbs,et al.  Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies. , 2015, Human molecular genetics.

[60]  M. Daly,et al.  A Recurrent ERCC 3 Truncating Mutation Confers Moderate Risk for Breast Cancer , 2016 .

[61]  E. Boerwinkle,et al.  dbNSFP v2.0: A Database of Human Non‐synonymous SNVs and Their Functional Predictions and Annotations , 2013, Human mutation.

[62]  James Y. Zou Analysis of protein-coding genetic variation in 60,706 humans , 2015, Nature.

[63]  Dong Liang,et al.  PALB2,CHEK2 and ATM rare variants and cancer risk: Data from COGS , 2016 .

[64]  Sharon E. Plon,et al.  Evaluation of in silico algorithms for use with ACMG/AMP clinical variant interpretation guidelines , 2017, Genome Biology.

[65]  Matthew S. Lebo,et al.  Performance of ACMG-AMP Variant-Interpretation Guidelines among Nine Laboratories in the Clinical Sequencing Exploratory Research Consortium. , 2016, American journal of human genetics.

[66]  Xiaolei Zhang,et al.  ClinVar data parsing , 2017, Wellcome open research.

[67]  Vijai Joseph,et al.  Conflicting Interpretation of Genetic Variants and Cancer Risk by Commercial Laboratories as Assessed by the Prospective Registry of Multiplex Testing , 2016, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[68]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[69]  Ahmet Zehir,et al.  Inherited DNA-Repair Gene Mutations in Men with Metastatic Prostate Cancer. , 2016, The New England journal of medicine.

[70]  Deanna M. Church,et al.  ClinVar: public archive of relationships among sequence variation and human phenotype , 2013, Nucleic Acids Res..

[71]  Steven N. Hart,et al.  Collaborative science in the next-generation sequencing era: a viewpoint on how to combine exome sequencing data across sites to identify novel disease susceptibility genes , 2016, Briefings Bioinform..

[72]  A. Knudson Mutation and cancer: statistical study of retinoblastoma. , 1971, Proceedings of the National Academy of Sciences of the United States of America.

[73]  Georgia Chenevix-Trench,et al.  An international initiative to identify genetic modifiers of cancer risk in BRCA1 and BRCA2 mutation carriers: the Consortium of Investigators of Modifiers of BRCA1 and BRCA2 (CIMBA) , 2007, Breast Cancer Research.

[74]  Paul D.P. Pharoah,et al.  Commonly studied single-nucleotide polymorphisms and breast cancer: results from the Breast Cancer Association Consortium. , 2007, Journal of the National Cancer Institute.

[75]  P. Dundr,et al.  The PALB2 Gene Is a Strong Candidate for Clinical Testing in BRCA1- and BRCA2-Negative Hereditary Breast Cancer , 2013, Cancer Epidemiology, Biomarkers & Prevention.

[76]  E. Boerwinkle,et al.  dbNSFP: A Lightweight Database of Human Nonsynonymous SNPs and Their Functional Predictions , 2011, Human mutation.

[77]  Eric Boerwinkle,et al.  In silico prediction of splice-altering single nucleotide variants in the human genome , 2014, Nucleic acids research.