Integrative annotation and knowledge discovery of kinase post-translational modifications and cancer-associated mutations through federated protein ontologies and resources

Many bioinformatics resources with unique perspectives on the protein landscape are currently available. However, generating new knowledge from these resources requires interoperable workflows that support cross-resource queries. In this study, we employ federated queries linking information from the Protein Kinase Ontology, iPTMnet, Protein Ontology, neXtProt, and the Mouse Genome Informatics to identify key knowledge gaps in the functional coverage of the human kinome and prioritize understudied kinases, cancer variants and post-translational modifications (PTMs) for functional studies. We identify 32 functional domains enriched in cancer variants and PTMs and generate mechanistic hypotheses on overlapping variant and PTM sites by aggregating information at the residue, protein, pathway and species level from these resources. We experimentally test the hypothesis that S768 phosphorylation in the C-helix of EGFR is inhibitory by showing that oncogenic variants altering S768 phosphorylation increase basal EGFR activity. In contrast, oncogenic variants altering conserved phosphorylation sites in the ‘hydrophobic motif’ of PKCβII (S660F and S660C) are loss-of-function in that they reduce kinase activity and enhance membrane translocation. Our studies provide a framework for integrative, consistent, and reproducible annotation of the cancer kinomes.

[1]  Krys J. Kochut,et al.  SPARQLing: A Graphical Interface for SPARQL , 2015, International Semantic Web Conference.

[2]  A. Newton,et al.  Protein kinase C beta II suppresses colorectal cancer by regulating IGF-1 mediated cell survival , 2016, Oncotarget.

[3]  Yili Hong,et al.  On computing the distribution function for the Poisson binomial distribution , 2013, Comput. Stat. Data Anal..

[4]  S. Gabriel,et al.  Pan-cancer patterns of somatic copy-number alteration , 2013, Nature Genetics.

[5]  U. Kikkawa,et al.  Protein kinase C in melanoma , 2005, Cancer and Metastasis Reviews.

[6]  M. Moran,et al.  The human phosphotyrosine signaling network: Evolution and hotspots of hijacking in cancer , 2012, Genome research.

[7]  John Kuriyan,et al.  An Allosteric Mechanism for Activation of the Kinase Domain of Epidermal Growth Factor Receptor , 2006, Cell.

[8]  J. Schlessinger,et al.  Signaling by Receptor Tyrosine Kinases , 1993 .

[9]  Jian Zhang,et al.  Protein Ontology (PRO): enhancing and scaling up the representation of protein entities , 2016, Nucleic Acids Res..

[10]  Andrew F Neuwald,et al.  The hallmark of AGC kinase functional divergence is its C-terminal tail, a cis-acting regulatory module , 2007, Proceedings of the National Academy of Sciences.

[11]  Dietrich Rebholz-Schuhmann,et al.  BioFed: federated query processing over life sciences linked open data , 2017, J. Biomed. Semant..

[12]  Shima Dastgheib,et al.  KinView: a visual comparative sequence analysis tool for integrated kinome research. , 2016, Molecular bioSystems.

[13]  N. Kannan,et al.  Mechanistic Insights into R776H Mediated Activation of Epidermal Growth Factor Receptor Kinase. , 2015, Biochemistry.

[14]  E. Wang,et al.  Predictive genomics: a cancer hallmark network framework for predicting tumor clinical phenotypes using genome sequencing data. , 2014, Seminars in cancer biology.

[15]  L. L. Cam,et al.  An approximation theorem for the Poisson binomial distribution. , 1960 .

[16]  N. Kannan,et al.  Computational and Experimental Characterization of Patient Derived Mutations Reveal an Unusual Mode of Regulatory Spine Assembly and Drug Sensitivity in EGFR Kinase. , 2017, Biochemistry.

[17]  Ricardo Villamarín-Salomón,et al.  ClinVar: public archive of interpretations of clinically relevant variants , 2015, Nucleic Acids Res..

[18]  Zhiyong Lu,et al.  PubTator: a web-based text mining tool for assisting biocuration , 2013, Nucleic Acids Res..

[19]  Runjun D. Kumar,et al.  Analysis of somatic mutations across the kinome reveals loss-of-function mutations in multiple cancer types , 2017, Scientific Reports.

[20]  Martin Ringwald,et al.  Mouse Genome Informatics (MGI): Resources for Mining Mouse Genetic, Genomic, and Biological Data in Support of Primary and Translational Research. , 2017, Methods in molecular biology.

[21]  A. Bardelli,et al.  Genetic analysis of the kinome and phosphatome in cancer , 2005, Cellular and Molecular Life Sciences CMLS.

[22]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[23]  C. Perou,et al.  Allele-specific copy number analysis of tumors , 2010, Proceedings of the National Academy of Sciences.

[24]  D. Hanahan,et al.  Hallmarks of Cancer: The Next Generation , 2011, Cell.

[25]  Akhilesh Pandey,et al.  Identifying novel targets of oncogenic EGF receptor signaling in lung cancer through global phosphoproteomics , 2015, Proteomics.

[26]  Roger Y. Tsien,et al.  A genetically encoded fluorescent reporter reveals oscillatory phosphorylation by protein kinase C , 2003, The Journal of cell biology.

[27]  Eric T. Kim,et al.  Oncogenic Mutations Counteract Intrinsic Disorder in the EGFR Kinase and Promote Receptor Dimerization , 2012, Cell.

[28]  Emily Kang,et al.  Cancer-Associated Protein Kinase C Mutations Reveal Kinase’s Role as Tumor Suppressor , 2015, Cell.

[29]  Mingming Jia,et al.  COSMIC: exploring the world's knowledge of somatic mutations in human cancer , 2014, Nucleic Acids Res..

[30]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[31]  Susan S. Taylor,et al.  ProKinO: A Unified Resource for Mining the Cancer Kinome , 2014, Human mutation.

[32]  J. Violin,et al.  Intramolecular conformational changes optimize protein kinase C signaling. , 2014, Chemistry & biology.

[33]  Rajarshi Guha,et al.  Pharos: Collating protein information to shed light on the druggable genome , 2016, Nucleic Acids Res..

[34]  K. Katoh,et al.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. , 2002, Nucleic acids research.

[35]  Amos Bairoch,et al.  The neXtProt knowledgebase on human proteins: 2017 update , 2016, Nucleic Acids Res..

[36]  A. Newton,et al.  Targeting Protein Kinase C Activity Reporter to Discrete Intracellular Regions Reveals Spatiotemporal Differences in Agonist-dependent Signaling* , 2006, Journal of Biological Chemistry.

[37]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[38]  J. Schlessinger Cell Signaling by Receptor Tyrosine Kinases , 2000, Cell.

[39]  S. Swamy,et al.  PICNIC: an algorithm to predict absolute allelic copy number variation with microarray cancer data , 2009, Biostatistics.

[40]  HER2 Overexpression and amplification is present in a subset of ovarian mucinous carcinomas and can be targeted with trastuzumab therapy , 2010 .

[41]  N. Schork,et al.  Kinase mutations in human disease: interpreting genotype–phenotype relationships , 2010, Nature Reviews Genetics.

[42]  M. Clausen,et al.  FDA-approved small-molecule kinase inhibitors. , 2015, Trends in pharmacological sciences.

[43]  C. Lindskog,et al.  A pathology atlas of the human cancer transcriptome , 2017, Science.

[44]  Susan S. Taylor,et al.  Regulation of protein kinases; controlling activity through activation segment conformation. , 2004, Molecular cell.

[45]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[46]  Philippe Broët,et al.  HER2 Status in Ovarian Carcinomas: A Multicenter GINECO Study of 320 Patients , 2007, PloS one.

[47]  T. Fukagawa,et al.  CENP‐R acts bilaterally as a tumor suppressor and as an oncogene in the two‐stage skin carcinogenesis model , 2017, Cancer science.

[48]  Michael Q. Zhang,et al.  Identification of Tumor Suppressors and Oncogenes from Genomic and Epigenetic Features in Ovarian Cancer , 2011, PloS one.

[49]  Friedrich Rippmann,et al.  KinMap: a web-based tool for interactive navigation through human kinome data , 2017, BMC Bioinformatics.

[50]  Nicole Tourigny,et al.  Bio2RDF: Towards a mashup to build bioinformatics knowledge systems , 2008, J. Biomed. Informatics.

[51]  Cathy H. Wu,et al.  iPTMnet: Integrative Bioinformatics for Studying PTM Networks. , 2017, Methods in molecular biology.