Data-driven unbiased curation of the TP53 tumor suppressor gene mutation database and validation by ultradeep sequencing of human tumors

Cancer mutation databases are expected to play central roles in personalized medicine by providing targets for drug development and biomarkers to tailor treatments to each patient. The accuracy of reported mutations is a critical issue that is commonly overlooked, which leads to mutation databases that include a sizable number of spurious mutations, either sequencing errors or passenger mutations. Here we report an analysis of the latest version of the TP53 mutation database, including 34,453 mutations. By using several data-driven methods on multiple independent quality criteria, we obtained a quality score for each report contributing to the database. This score can now be used to filter for high-confidence mutations and reports within the database. Sequencing the entire TP53 gene from various types of cancer using next-generation sequencing with ultradeep coverage validated our approach for curation. In summary, 9.7% of all collected studies, mostly comprising numerous tumors with multiple infrequent TP53 mutations, should be excluded when analyzing TP53 mutations. Thus, by combining statistical and experimental analyses, we provide a curated mutation database for TP53 mutations and a framework for mutation database analysis.

[1]  C. Harris,et al.  Clinical outcomes and correlates of TP53 mutations and cancer. , 2010, Cold Spring Harbor perspectives in biology.

[2]  G. Giaccone,et al.  EGFR and K-ras Mutation Analysis in Non-Small Cell Lung Cancer: Comparison of Paraffin Embedded versus Frozen Specimens , 2007, Cellular oncology : the official journal of the International Society for Cellular Oncology.

[3]  S. Kern,et al.  Multiple-criterion evaluation of reported mutations: A proposed scoring system for the intragenic somatic mutation literature , 2006, Cancer biology & therapy.

[4]  A. Marchetti,et al.  Assessing EGFR mutations. , 2006, The New England journal of medicine.

[5]  P. Platzer,et al.  Breast-cancer stromal cells with TP53 mutations and nodal metastases. , 2007, The New England journal of medicine.

[6]  P. Nederlof,et al.  High incidence of protein-truncating TP53 mutations in BRCA1-related breast cancer. , 2009, Cancer research.

[7]  S. Kato,et al.  The UMD TP53 database and website: update and revisions , 2006, Human mutation.

[8]  I. Haviv,et al.  Breast-Cancer Stromal Cells with TP 53 Mutations , 2008 .

[9]  Mingming Jia,et al.  COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer , 2010, Nucleic Acids Res..

[10]  C. Ishioka,et al.  Locus-specific mutation databases: pitfalls and good practice based on the p53 experience , 2006, Nature Reviews Cancer.

[11]  S. Kern,et al.  Elegance, silence and nonsense in the mutations literature for solid tumors , 2006, Cancer biology & therapy.

[12]  Magali Olivier,et al.  TP53 mutations in human cancers: origins, consequences, and clinical use. , 2010, Cold Spring Harbor perspectives in biology.

[13]  I. Haviv,et al.  Breast-cancer stromal cells with TP53 mutations. , 2008, The New England journal of medicine.

[14]  S. Kato,et al.  Reassessment of the TP53 mutation database in human disease by data mining with a library of TP53 missense mutations , 2005, Human mutation.

[15]  S. Gabriel,et al.  Advances in understanding cancer genomes through second-generation sequencing , 2010, Nature Reviews Genetics.

[16]  R J DUBOS,et al.  Health and disease. , 1960, JAMA.

[17]  S. Chanock,et al.  The devil is in the DNA , 2007, Nature Genetics.

[18]  T. Frebourg,et al.  Metastatic colorectal cancer KRAS genotyping in routine practice: results and pitfalls , 2011, Modern Pathology.

[19]  Thierry Soussi,et al.  TP53 tumor suppressor gene: A model for investigating human mutagenesis , 1992, Genes, chromosomes & cancer.

[20]  A. Inga,et al.  Functional mutants of the sequence-specific transcription factor p53 and implications for master genes of diversity , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[21]  T. Soussi,et al.  TP53 Mutations in Human Cancer: Database Reassessment and Prospects for the Next Decade , 2014, Human mutation.

[22]  S. Kato,et al.  Meta-analysis of the p53 Mutation Database for Mutant p53 Biological Activity Reveals a Methodologic Bias in Mutation Detection , 2006, Clinical Cancer Research.

[23]  A. Fersht,et al.  The tumor suppressor p53: from structures to drug discovery. , 2010, Cold Spring Harbor perspectives in biology.

[24]  Karen H. Vousden,et al.  p53 in health and disease , 2007, Nature Reviews Molecular Cell Biology.

[25]  E. Birney,et al.  Patterns of somatic mutation in human cancer genomes , 2007, Nature.

[26]  F. Pontén,et al.  A high frequency of sequence alterations is due to formalin fixation of archival specimens. , 1999, The American journal of pathology.

[27]  S. Kato,et al.  Understanding the function–structure and function–mutation relationships of p53 tumor suppressor protein by high-resolution missense mutation analysis , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[28]  B. Vogelstein,et al.  p53 mutations in human cancers. , 1991, Science.

[29]  Thierry Soussi,et al.  Analysis of p53 mutation status in human cancer cell lines: a paradigm for cell line cross-contamination , 2008, Cancer biology & therapy.

[30]  J. Halgunset,et al.  Low copy number DNA template can render polymerase chain reaction error prone in a sequence-dependent manner. , 2005, The Journal of molecular diagnostics : JMD.

[31]  M. Stratton,et al.  The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website , 2004, British Journal of Cancer.

[32]  T. Soussi Advances in carcinogenesis: a historical perspective from observational studies to tumor genome sequencing and TP53 mutation spectrum analysis. , 2011, Biochimica et biophysica acta.

[33]  A. Fersht,et al.  Structure–function–rescue: the diverse nature of common p53 cancer mutants , 2007, Oncogene.

[34]  Kylie L. Gorringe,et al.  No evidence of clonal somatic genetic alterations in cancer-associated fibroblasts from human breast and ovarian carcinomas , 2008, Nature Genetics.