Subtyping: What It is and Its Role in Precision Medicine

Precision medicine is an emerging approach that considers variability in genes, environment, and lifestyle in order to better treat individuals. This article gives an overview of the diverse approaches to subtyping, from early accounts based on clinical practice to more recent approaches that focus on computationally derived subtypes based on molecular and electronic health record (EHR) data. This field is expansive and growing rapidly; the authors juxtapose approaches taken by different communities and highlight examples of significant open computational problems.

[1]  J. Pathak,et al.  Electronic health records-driven phenotyping: challenges, recent advances, and perspectives. , 2013, Journal of the American Medical Informatics Association : JAMIA.

[2]  Jennifer Jie Xu,et al.  Knowledge Discovery and Data Mining , 2014, Computing Handbook, 3rd ed..

[3]  George Hripcsak,et al.  Next-generation phenotyping of electronic health records , 2012, J. Am. Medical Informatics Assoc..

[4]  Pablo Tamayo,et al.  Metagenes and molecular pattern discovery using matrix factorization , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Suchi Saria,et al.  A $3 Trillion Challenge to Computational Scientists: Transforming Healthcare Delivery , 2014, IEEE Intelligent Systems.

[6]  Adnan Custovic,et al.  Asthma endotypes: a new approach to classification of disease entities within the asthma syndrome. , 2011, The Journal of allergy and clinical immunology.

[7]  D. Mould Models for Disease Progression: New Approaches and Uses , 2012, Clinical pharmacology and therapeutics.

[8]  J. Ewing Diffuse endothelioma of bone , 1972 .

[9]  Melissa A. Basford,et al.  Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data , 2013, Nature Biotechnology.

[10]  Christian A. Rees,et al.  Molecular portraits of human breast tumours , 2000, Nature.

[11]  T. Robbins,et al.  Heterogeneity of Parkinson’s disease in the early clinical stages using a data driven approach , 2005, Journal of Neurology, Neurosurgery & Psychiatry.

[12]  C. Chute,et al.  Electronic Medical Records for Genetic Research: Results of the eMERGE Consortium , 2011, Science Translational Medicine.

[13]  Zhuowen Tu,et al.  Similarity network fusion for aggregating data types on a genomic scale , 2014, Nature Methods.

[14]  Yuan Qi,et al.  Integrated Genomic Analysis Identifies Clinically Relevant Subtypes of Glioblastoma Characterized by Abnormalities in PDGFRA , IDH 1 , EGFR , and NF 1 Citation Verhaak , 2010 .

[15]  David T. W. Jones,et al.  Hotspot mutations in H3F3A and IDH1 define distinct epigenetic and biological subgroups of glioblastoma. , 2012, Cancer cell.

[16]  W. Chung,et al.  Defining a comprehensive verotype using electronic health records for personalized medicine. , 2013, Journal of the American Medical Informatics Association : JAMIA.

[17]  I. Kohane Using electronic health records to drive discovery in disease genomics , 2011, Nature Reviews Genetics.

[18]  Adam B. Olshen,et al.  Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis , 2009, Bioinform..

[19]  Atul J. Butte,et al.  Clinical Arrays of Laboratory Measures, or "Clinarrays", Built from an Electronic Health Record Enable Disease Subtyping by Severity , 2007, AMIA.

[20]  Zoubin Ghahramani,et al.  Bayesian correlated clustering to integrate multiple datasets , 2012, Bioinform..

[21]  L. Hood,et al.  Predictive, personalized, preventive, participatory (P4) cancer medicine , 2011, Nature Reviews Clinical Oncology.

[22]  Finale Doshi-Velez,et al.  Comorbidity Clusters in Autism Spectrum Disorders: An Electronic Health Record Time-Series Analysis , 2014, Pediatrics.

[23]  Ara Darzi,et al.  Preparing for precision medicine. , 2012, The New England journal of medicine.

[24]  Jimeng Sun,et al.  Marble: high-throughput phenotyping from electronic health records via sparse nonnegative tensor factorization , 2014, KDD.

[25]  Suchi Saria,et al.  Combining Structured and Free-text Data for Automatic Coding of Patient Outcomes. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[26]  M. Ritchie,et al.  Methods of integrating data to uncover genotype–phenotype interactions , 2015, Nature Reviews Genetics.

[27]  A. Frigessi,et al.  Principles and methods of integrative genomic analyses in cancer , 2014, Nature Reviews Cancer.

[28]  S. Brunak,et al.  Mining electronic health records: towards better research applications and clinical care , 2012, Nature Reviews Genetics.

[29]  Suchi Saria,et al.  Clustering Longitudinal Clinical Marker Trajectories from Electronic Health Data: Applications to Phenotyping and Endotype Discovery , 2015, AAAI.

[30]  K. Aldape,et al.  Integrated array-comparative genomic hybridization and expression array profiles identify clinically relevant molecular subtypes of glioblastoma. , 2005, Cancer research.