Leveraging a Joint of Phenotypic and Genetic Features on Cancer Patient Subgrouping

Background: Cancer is responsible for millions of deaths worldwide every year. Although significant progress has been achieved in cancer medicine, many issues remain to be addressed for improving cancer therapy. Appropriate cancer patient stratification is the prerequisite for selecting appropriate treatment plan, as cancer patients are of known heterogeneous genetic make-ups and phenotypic differences. In this study, built upon deep phenotypic characterizations extractable from Mayo Clinic electronic health records (EHRs) and genetic test reports for a collection of cancer patients, we developed a system leveraging a joint of phenotypic and genetic features for cancer patient subgrouping.

[1]  Bruce M Altevogt,et al.  Improving and Accelerating Therapeutic Development for Nervous System Disorders: Workshop Summary , 2014 .

[2]  Joshua L. Deignan,et al.  ACMG clinical laboratory standards for next-generation sequencing , 2013, Genetics in Medicine.

[3]  Feichen Shen,et al.  HPO2Vec+: Leveraging heterogeneous knowledge resources to enrich node embeddings for the Human Phenotype Ontology , 2019, J. Biomed. Informatics.

[4]  Funda Meric-Bernstam,et al.  Differential Response to Neoadjuvant Chemotherapy Among 7 Triple-Negative Breast Cancer Molecular Subtypes , 2013, Clinical Cancer Research.

[5]  S. Lenk,et al.  TP53 gene mutations in prostate cancer progression. , 2010, Anticancer research.

[6]  X. Chen,et al.  Identification of human triple-negative breast cancer subtypes and preclinical models for selection of targeted therapies. , 2011, The Journal of clinical investigation.

[7]  S. Soddu,et al.  Variants of uncertain significance in the era of high-throughput genome sequencing: a lesson from breast and ovary cancers , 2020, Journal of Experimental & Clinical Cancer Research.

[8]  Timothy A. Miller,et al.  DeepPhe: A Natural Language Processing System for Extracting Cancer Phenotypes from Clinical Records. , 2017, Cancer research.

[9]  B. Piperdi,et al.  KRAS Mutation in Colon Cancer: A Marker of Resistance to EGFR-I Therapy , 2009, Annals of Surgical Oncology.

[10]  C. Sawyers,et al.  Targeted cancer therapy , 2004, Nature.

[11]  Gilad Mishne,et al.  LEAP: Using machine learning to support variant classification in a clinical setting , 2020, Human mutation.

[12]  Yi Shi,et al.  DeepGene: an advanced cancer type classifier based on deep learning and somatic point mutations , 2016, BMC Bioinformatics.

[13]  Deborah Schrag,et al.  Precision Oncology: Who, How, What, When, and When Not? , 2017, American Society of Clinical Oncology educational book. American Society of Clinical Oncology. Annual Meeting.

[14]  Harry Hochheiser,et al.  Abstract 5114: Deep phenotype extraction to facilitate cancer research: Extending DeepPhe to ovarian cancer , 2019, Bioinformatics, Convergence Science, and Systems Biology.

[15]  Riccardo Bellazzi,et al.  Patient similarity for precision medicine: A systematic review , 2018, J. Biomed. Informatics.

[16]  Frank McCormick,et al.  KRAS as a Therapeutic Target , 2015, Clinical Cancer Research.

[17]  Hongfang Liu,et al.  Constructing Node Embeddings for Human Phenotype Ontology to Assist Phenotypic Similarity Measurement , 2018, 2018 IEEE International Conference on Healthcare Informatics Workshop (ICHI-W).

[18]  Bale,et al.  Standards and Guidelines for the Interpretation of Sequence Variants: A Joint Consensus Recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology , 2015, Genetics in Medicine.

[19]  N. Nonomura,et al.  Role of Androgen Receptor in Prostate Cancer: A Review , 2018, The world journal of men's health.

[20]  P. Robinson,et al.  The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease. , 2008, American journal of human genetics.

[21]  Deborah Schrag,et al.  Precision Oncology: Who, How, What, When, and When Not? , 2017, American Society of Clinical Oncology educational book. American Society of Clinical Oncology. Annual Meeting.

[22]  Sijia Liu,et al.  Desiderata for delivering NLP to accelerate healthcare AI advancement and a Mayo Clinic NLP-as-a-service implementation , 2019, npj Digital Medicine.

[23]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[24]  Gil Alterovitz,et al.  Recommendations for patient similarity classes: results of the AMIA 2019 workshop on defining patient similarity , 2020, J. Am. Medical Informatics Assoc..

[25]  Michael Feuerstein,et al.  Defining cancer survivorship , 2007, Journal of cancer survivorship : research and practice.

[26]  Sanda M Harabagiu,et al.  Adversarial Learning of Knowledge Embeddings for the Unified Medical Language System. , 2019, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[27]  Jaewoo Kang,et al.  Mut2Vec: distributed representation of cancerous mutations , 2018, BMC Medical Genomics.

[28]  Kenji Mase,et al.  How to initialize the CNN for small datasets: Extracting discriminative filters from pre-trained model , 2015, 2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR).

[29]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[30]  Hongfang Liu,et al.  Phenotypic Analysis of Clinical Narratives Using Human Phenotype Ontology , 2020, MedInfo.

[31]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[32]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.