Subgrouping Rare Disease Patients Leveraging the Human Phenotype Ontology Embeddings

In the USA, rare diseases are defined as those affecting fewer than 200,000 patients at any given time. It usually takes substantial time and long journey for rare disease patients to seek care before receiving a correct diagnosis. Making the right phenotypic characterization is the initial step to speed up such differential diagnosis at early time and the Human Phenotype Ontology (HPO) is a comprehensive knowledgebase supporting this goal. Previously, we have constructed various node embeddings for the HPO incorporating heterogeneous biomedical knowledge repositories. In this study, we applied unsupervised learning strategies over different HPO embeddings, aiming to further subgroup rare disease patients based on phenotypic characterizations.

[1]  Peng Liu,et al.  VDBSCAN: Varied Density Based Spatial Clustering of Applications with Noise , 2007, 2007 International Conference on Service Systems and Service Management.

[2]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[3]  P. Robinson,et al.  The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease. , 2008, American journal of human genetics.

[4]  Ashish Sharma,et al.  An Enhanced Density Based Spatial Clustering of Applications with Noise , 2009, 2009 IEEE International Advance Computing Conference.

[5]  M. Field,et al.  Rare Diseases and Orphan Products , 2010 .

[6]  M. Field,et al.  Rare Diseases and Orphan Products: Accelerating Research and Development , 2010 .

[7]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[8]  Hongfang Liu,et al.  Phenotypic Analysis of Clinical Narratives Using Human Phenotype Ontology , 2020, MedInfo.

[9]  Hongfang Liu,et al.  Constructing Node Embeddings for Human Phenotype Ontology to Assist Phenotypic Similarity Measurement , 2018, 2018 IEEE International Conference on Healthcare Informatics Workshop (ICHI-W).

[10]  Feichen Shen,et al.  HPO2Vec+: Leveraging heterogeneous knowledge resources to enrich node embeddings for the Human Phenotype Ontology , 2019, J. Biomed. Informatics.

[11]  Feichen Shen,et al.  Rare disease knowledge enrichment through a data-driven approach , 2019, BMC Medical Informatics and Decision Making.