GeneYenta: A Phenotype­Based Rare Disease Case Matching Tool Based on Online Dating Algorithms for the Acceleration of Exome Interpretation

Advances in next‐generation sequencing (NGS) technologies have helped reveal causal variants for genetic diseases. In order to establish causality, it is often necessary to compare genomes of unrelated individuals with similar disease phenotypes to identify common disrupted genes. When working with cases of rare genetic disorders, finding similar individuals can be extremely difficult. We introduce a web tool, GeneYenta, which facilitates the matchmaking process, allowing clinicians to coordinate detailed comparisons for phenotypically similar cases. Importantly, the system is focused on phenotype annotation, with explicit limitations on highly confidential data that create barriers to participation. The procedure for matching of patient phenotypes, inspired by online dating services, uses an ontologybased semantic case matching algorithm with attribute weighting. We evaluate the capacity of the system using a curated reference data set and 19 clinician entered cases comparing four matching algorithms. We find that the inclusion of clinician weights can augment phenotype matching.

[1]  Dan Ariely,et al.  What makes you click?—Mate preferences in online dating , 2010 .

[2]  A. Adeyemo,et al.  Ethical and legal implications of whole genome and whole exome sequencing in African populations , 2013, BMC medical ethics.

[3]  P. Shannon,et al.  Exome sequencing identifies the cause of a Mendelian disorder , 2009, Nature Genetics.

[4]  Bradley P. Coe,et al.  Copy number variation detection and genotyping from exome sequence data , 2012, Genome research.

[5]  Morris A. Swertz,et al.  OntoCAT -- simple ontology search and integration in Java, R and REST/JavaScript , 2011, BMC Bioinformatics.

[6]  Christian Gilissen,et al.  A Post‐Hoc Comparison of the Utility of Sanger Sequencing and Exome Sequencing for the Diagnosis of Heterogeneous Diseases , 2013, Human mutation.

[7]  Christian Gilissen,et al.  Disease gene identification strategies for exome sequencing , 2012, European Journal of Human Genetics.

[8]  Alison M. Meynert,et al.  Variant detection sensitivity and biases in whole genome and exome sequencing , 2014, BMC Bioinformatics.

[9]  Judy Kay,et al.  RECON: a reciprocal recommender for online dating , 2010, RecSys '10.

[10]  Damian Smedley,et al.  Improved exome prioritization of disease genes through cross-species phenotype comparison , 2014, Genome research.

[11]  S. Mundlos,et al.  The Human Phenotype Ontology , 2010, Clinical genetics.

[12]  Michael Brudno,et al.  PhenoTips: Patient Phenotyping Software for Clinical and Research Use , 2013, Human mutation.

[13]  Emily H Turner,et al.  Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome , 2010, Nature Genetics.

[14]  Life Technologies,et al.  A map of human genome variation from population-scale sequencing , 2011 .

[15]  Sihem Amer-Yahia,et al.  Relevance and ranking in online dating systems , 2010, SIGIR.

[16]  Marcel H. Schulz,et al.  Clinical diagnostics in human genetics with semantic similarity searches in ontologies. , 2009, American journal of human genetics.

[17]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[18]  Sharon R Grossman,et al.  Integrating common and rare genetic variation in diverse human populations , 2010, Nature.

[19]  Emily H Turner,et al.  Targeted Capture and Massively Parallel Sequencing of Twelve Human Exomes , 2009, Nature.