Diagnosis of genetic diseases in seriously ill children by rapid whole-genome sequencing and automated phenotyping and interpretation

Automated phenotyping and interpretation of rapid whole-genome sequencing improve time to diagnosis of genetic diseases in hospitalized children. A streamlined genetic diagnosis pipeline When treating seriously ill children, time is of the essence. Clark et al. built an automated pipeline to analyze EHR data and genome sequencing data from dried blood spots to deliver a potential diagnosis for hospitalized, often critically ill, children with suspected genetic diseases. Their pipeline required minimal user intervention, increasing usability and shortening time to diagnosis, delivering a provisional finding in a median time of less than 24 hours. Although this pipeline would need to be adapted for use at different hospital systems, such an automated tool could aid clinicians to expedite an accurate genetic disease diagnosis, potentially hastening lifesaving changes to patient care. By informing timely targeted treatments, rapid whole-genome sequencing can improve the outcomes of seriously ill children with genetic diseases, particularly infants in neonatal and pediatric intensive care units (ICUs). The need for highly qualified professionals to decipher results, however, precludes widespread implementation. We describe a platform for population-scale, provisional diagnosis of genetic diseases with automated phenotyping and interpretation. Genome sequencing was expedited by bead-based genome library preparation directly from blood samples and sequencing of paired 100-nt reads in 15.5 hours. Clinical natural language processing (CNLP) automatically extracted children’s deep phenomes from electronic health records with 80% precision and 93% recall. In 101 children with 105 genetic diseases, a mean of 4.3 CNLP-extracted phenotypic features matched the expected phenotypic features of those diseases, compared with a match of 0.9 phenotypic features used in manual interpretation. We automated provisional diagnosis by combining the ranking of the similarity of a patient’s CNLP phenome with respect to the expected phenotypic features of all genetic diseases, together with the ranking of the pathogenicity of all of the patient’s genomic variants. Automated, retrospective diagnoses concurred well with expert manual interpretation (97% recall and 99% precision in 95 children with 97 genetic diseases). Prospectively, our platform correctly diagnosed three of seven seriously ill ICU infants (100% precision and recall) with a mean time saving of 22:19 hours. In each case, the diagnosis affected treatment. Genome sequencing with automated phenotyping and interpretation in a median of 20:10 hours may increase adoption in ICUs and, thereby, timely implementation of precise treatments.

John Reynders | Martin G. Reese | Patrick Mulrooney | Matthew N. Bainbridge | Shimul Chowdhury | Stephen F. Kingsmore | Julie Ryu | Paul D. McDonagh | Christina Clarke | Peter Schols | Calum Yacoubian | Richard Gain | Alison Frith | Thomas Defay | S. Kingsmore | K. Hall | M. G. Reese | M. Bainbridge | P. McDonagh | Julie A Cakici | Serge Batalov | N. Veeraraghavan | L. Farnaes | S. Chowdhury | D. Dimmock | Kristen Wigby | Michelle M Clark | Amber Hildreth | Yan Ding | K. Watkins | Katarzyna A Ellsworth | Brandon Camp | C. Kint | C. Yacoubian | Curtis Beebe | Joshua J. A. Braun | Margaret Bray | Jeanne Carroll | Sara A Caylor | C. Clarke | Mitchell P. Creed | J. Friedman | A. Frith | Richard Gain | Mary Gaughran | Shauna George | S. Gilmer | J. Gleeson | Jeremy Gore | H. Grunenwald | Raymond L Hovey | Marie L Janes | Kejia Lin | K. McBride | Patrick Mulrooney | S. Nahas | D. Oh | A. Oriol | L. Puckett | Z. Rady | J. Ryu | Lisa Salz | E. Sanford | Lawrence Stewart | Nathaly M Sweeney | M. Tokita | Luca Van Der Kraan | Sarah White | Brett Williams | Terence Wong | M. Wright | Catherine Yamada | P. Schols | J. Reynders | Thomas Defay | Meredith S. Wright | Narayanan Veeraraghavan | Albert Oriol | Haiying Grunenwald | David Dimmock | Jennifer Friedman | Mari Tokita | Lauge Farnaes | Shareef Nahas | Sarah White | Yan Ding | Jeanne Carroll | Erica Sanford | Joseph Gleeson | Kevin Hall | Michelle M. Clark | Amber Hildreth | Sergey Batalov | Kelly Watkins | Katarzyna Ellsworth | Brandon Camp | Cyrielle I. Kint | Curtis Beebe | Margaret Bray | Julie A. Cakici | Sara A. Caylor | Mary Gaughran | Shauna George | Sheldon Gilmer | Jeremy Gore | Raymond L. Hovey | Marie L. Janes | Kejia Lin | Kyle McBride | Daeheon Oh | Laura Puckett | Zia Rady | Lisa Salz | Lawrence Stewart | Nathaly Sweeney | Luca Van Der Kraan | Kristen Wigby | Brett Williams | Terence Wong | Catherine Yamada | L. Van Der Kraan | Nathaly M. Sweeney | K. Wigby | Terence C Wong | S. Chowdhury | Haiying Grunenwald | S. White

[1]  Ernest Turro,et al.  ontologyX: a suite of R packages for working with ontological data , 2016, Bioinform..

[2]  E. Palmaer,et al.  Clinical diagnostic exome evaluation for an infant with a lethal disorder: genetic diagnosis of TARP syndrome and expansion of the phenotype in a patient with a newly reported RBM10 alteration , 2017, BMC Medical Genetics.

[3]  Gunnar E. Carlsson,et al.  Topology and data , 2009 .

[4]  The NSIGHT1 Randomized Controlled Trial: Rapid Whole Genome Sequencing for Accelerated Etiologic Diagnosis in Critically Ill Infants , 2017 .

[5]  Anita Burgun-Parenthoine,et al.  A clinician friendly data warehouse oriented toward narrative reports: Dr. Warehouse , 2018, J. Biomed. Informatics.

[6]  S. Kingsmore,et al.  Rapid whole-genome sequencing identifies a novel GABRA1 variant associated with West syndrome , 2017, Cold Spring Harbor molecular case studies.

[7]  Quan Li,et al.  InterVar: Clinical Interpretation of Genetic Variants by the 2015 ACMG-AMP Guidelines. , 2017, American journal of human genetics.

[8]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[9]  Laurie D. Smith,et al.  The NSIGHT1-randomized controlled trial: rapid whole-genome sequencing for accelerated etiologic diagnosis in critically ill infants , 2017, npj Genomic Medicine.

[10]  Xiaoyu Chen,et al.  Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications , 2016, Bioinform..

[11]  Bale,et al.  Standards and Guidelines for the Interpretation of Sequence Variants: A Joint Consensus Recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology , 2015, Genetics in Medicine.

[12]  A. Burgun,et al.  Next generation phenotyping using narrative reports in a rare disease clinical data warehouse , 2018, Orphanet Journal of Rare Diseases.

[13]  K. Boycott,et al.  Benchmarking outcomes in the Neonatal Intensive Care Unit: Cytogenetic and molecular diagnostic rates in a retrospective cohort , 2017, American journal of medical genetics. Part A.

[14]  M J Khoury,et al.  Contribution of birth defects and genetic diseases to pediatric hospitalizations. A population-based study. , 1997, Archives of pediatrics & adolescent medicine.

[15]  Shimul Chowdhury,et al.  Novel Factor XIII variant identified through whole-genome sequencing in a child with intracranial hemorrhage , 2018, Cold Spring Harbor molecular case studies.

[16]  Hannah Mandel Performance evaluation of a natural language processing tool to extract infectious disease problems , 2013 .

[17]  George Hripcsak,et al.  High-fidelity phenotyping: richness and freedom from bias , 2017, J. Am. Medical Informatics Assoc..

[18]  Allyn McConkie-Rosell,et al.  The utility of the traditional medical genetics diagnostic evaluation in the context of next-generation sequencing for undiagnosed genetic disorders , 2013, Genetics in Medicine.

[19]  M. Gerstein,et al.  CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. , 2011, Genome research.

[20]  Mark Yandell,et al.  VAAST 2.0: Improved Variant Classification and Disease-Gene Identification Using a Conservation-Controlled Amino Acid Substitution Matrix , 2013, Genetic epidemiology.

[21]  Laurie D. Smith,et al.  Rapid whole genome sequencing and precision neonatology. , 2015, Seminars in perinatology.

[22]  S. Kingsmore,et al.  Rapid whole-genome sequencing decreases infant morbidity and cost of hospitalization , 2018, npj Genomic Medicine.

[23]  Zornitza Stark,et al.  Meta-analysis of the diagnostic and clinical utility of genome and exome sequencing and chromosomal microarray in children with suspected genetic diseases , 2018, npj Genomic Medicine.

[24]  O. P. Kurganova,et al.  Dr. , 2019, D37. TOPICS IN GLOBAL HEALTH SERVICES RESEARCH.

[25]  Stephen F. Kingsmore,et al.  Rapid whole-genome sequencing identifies a novel AIRE variant associated with autoimmune polyendocrine syndrome type 1 , 2018, Cold Spring Harbor molecular case studies.

[26]  Xin Sun,et al.  Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence , 2019, Nature Medicine.

[27]  Rolf Schröder,et al.  Clinical exome sequencing: results from 2819 samples reflecting 1000 families , 2016, European Journal of Human Genetics.

[28]  Melbourne Genomics Health Alliance,et al.  Meeting the challenges of implementing rapid genomic testing in acute pediatric care , 2018, Genetics in Medicine.

[29]  B. Knoppers,et al.  Genome-wide sequencing in acutely ill infants: genomic medicine’s critical application? , 2018, Genetics in Medicine.

[30]  Boris Campillo-Gimenez,et al.  Full-text Automated Detection of Surgical Site Infections Secondary to Neurosurgery in Rennes, France , 2013, MedInfo.

[31]  Fan Xia,et al.  Use of Exome Sequencing for Infants in Intensive Care Units: Ascertainment of Severe Single-Gene Disorders and Effect on Medical Management , 2017, JAMA pediatrics.

[32]  S. Scherer,et al.  Periodic reanalysis of whole-genome sequencing data enhances the diagnostic advantage over standard clinical genetic testing , 2018, European Journal of Human Genetics.

[33]  Pedro M. Valero-Mora,et al.  ggplot2: Elegant Graphics for Data Analysis , 2010 .

[34]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[35]  Martin G Reese,et al.  Clinical analysis of genome next-generation sequencing data using the Omicia platform , 2013, Expert review of molecular diagnostics.

[36]  M. Bitner-Glindzicz,et al.  Rapid Paediatric Sequencing (RaPS): comprehensive real-life workflow for rapid diagnosis of critically ill children , 2018, Journal of Medical Genetics.

[37]  Matthew S. Lebo,et al.  Performance of ACMG-AMP Variant-Interpretation Guidelines among Nine Laboratories in the Clinical Sequencing Exploratory Research Consortium. , 2016, American journal of human genetics.

[38]  George Hripcsak,et al.  Deep Phenotyping on Electronic Health Records Facilitates Genetic Diagnosis by Clinical Exomes. , 2018, American journal of human genetics.

[39]  S. Kingsmore,et al.  Rapid whole-genome sequencing identifies a novel homozygous NPC1 variant associated with Niemann–Pick type C1 disease in a 7-week-old male with cholestasis , 2017, Cold Spring Harbor molecular case studies.

[40]  Gustavo Glusman,et al.  A unified test of linkage analysis and rare-variant association for analysis of pedigree sequence data , 2014, Nature Biotechnology.

[41]  S. Grosse,et al.  Inpatient Hospitalization Costs Associated with Birth Defects Among Persons of All Ages — United States, 2013 , 2017, MMWR. Morbidity and mortality weekly report.

[42]  E. Arias,et al.  Mortality in the United States, 2017. , 2018, NCHS data brief.

[43]  Jeffrey Braithwaite,et al.  Integrating Genomics into Healthcare: A Global Responsibility. , 2019, American journal of human genetics.

[44]  Laurie D. Smith,et al.  A 26-hour system of highly sensitive whole genome sequencing for emergency management of genetic diseases , 2015, Genome Medicine.

[45]  S. Mundlos,et al.  The Human Phenotype Ontology , 2010, Clinical genetics.

[46]  Laurie D. Smith,et al.  Whole-genome sequencing for identification of Mendelian disorders in critically ill infants: a retrospective analysis of diagnostic and clinical findings. , 2015, The Lancet. Respiratory medicine.

[47]  J. Hellmann,et al.  Predictors of mortality and length of stay for neonates admitted to children's hospital neonatal intensive care units , 2008, Journal of Perinatology.

[48]  M. Gubler,et al.  Autosomal dominant Alport's syndrome: study of a large Tunisian family. , 2006, Saudi Journal of Kidney Diseases and Transplantation.

[49]  Patrick Callier,et al.  Clinical whole-exome sequencing for the diagnosis of rare disorders with congenital anomalies and/or intellectual disability: substantial interest of prospective annual reanalysis , 2017, Genetics in Medicine.

[50]  A. Renieri,et al.  Autosomal-dominant Alport syndrome: natural history of a disease due to COL4A3 or COL4A4 gene. , 2004, Kidney international.

[51]  Cristina Has,et al.  Faculty of 1000 evaluation for Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. , 2018 .

[52]  Johan Larsson,et al.  eulerr: Area-Proportional Euler Diagrams with Ellipses , 2018 .

[53]  Matthew S. Lebo,et al.  Performance of ACMG-AMP Variant-Interpretation Guidelines among Nine Laboratories in the Clinical Sequencing Exploratory Research Consortium. , 2016, American journal of human genetics.

[54]  Jacek Majewski,et al.  Next-generation sequencing for diagnosis of rare diseases in the neonatal intensive care unit , 2016, Canadian Medical Association Journal.

[55]  M. Genuardi,et al.  Intersociety policy statement on the use of whole-exome sequencing in the critically ill newborn infant , 2017, Italian Journal of Pediatrics.

[56]  Olivier Bodenreider,et al.  Interoperability between phenotypes in research and healthcare terminologies—Investigating partial mappings between HPO and SNOMED CT , 2016, J. Biomed. Semant..

[57]  H. Peeters,et al.  A comprehensive clinical and genetic study in 127 patients with ID in Kinshasa, DR Congo , 2018, American journal of medical genetics. Part A.

[58]  Tomas W. Fitzgerald,et al.  Making new genetic diagnoses with old data: iterative reanalysis and reporting from genome-wide data in 1133 families with developmental disorders , 2017, Genetics in Medicine.

[59]  Anita Burgun-Parenthoine,et al.  Finding patients using similarity measures in a rare diseases-oriented clinical data warehouse: Dr. Warehouse and the needle in the needle stack , 2017, J. Biomed. Informatics.

[60]  J. Denny,et al.  Extracting research-quality phenotypes from electronic health records to support precision medicine , 2015, Genome Medicine.

[61]  Serge Batalov,et al.  Concomitant diagnosis of immune deficiency and Pseudomonas sepsis in a 19 month old with ecthyma gangrenosum by host whole-genome sequencing , 2018, Cold Spring Harbor molecular case studies.

[62]  J. Wallingford,et al.  White paper on the study of birth defects. , 2017, Birth Defects Research.

[63]  Dillon Y. Chen,et al.  Rapid Diagnosis of KCNQ2-Associated Early Infantile Epileptic Encephalopathy Improved Outcome. , 2018, Pediatric neurology.

[64]  Xuan Yuan,et al.  Effectiveness of exome and genome sequencing guided by acuity of illness for diagnosis of neurodevelopmental disorders , 2014, Science Translational Medicine.

[65]  J. Lantos,et al.  How infants die in the neonatal intensive care unit: trends from 1999 through 2008. , 2011, Archives of pediatrics & adolescent medicine.

[66]  G. Bejerano,et al.  Systematic reanalysis of clinical exome data yields additional diagnoses: implications for providers , 2016, Genetics in Medicine.

[67]  Y Hu,et al.  Global Responsibility , 2020, The International Encyclopedia of Higher Education Systems and Institutions.

[68]  Daniel G. MacArthur,et al.  The ExAC browser: displaying reference data information from over 60 000 exomes , 2016, bioRxiv.

[69]  Peter Szolovits,et al.  Genetic Misdiagnoses and the Potential for Health Disparities. , 2016, The New England journal of medicine.

[70]  Marcel H. Schulz,et al.  Clinical diagnostics in human genetics with semantic similarity searches in ontologies. , 2009, American journal of human genetics.

[71]  Laurie D. Smith,et al.  Whole-Exome Sequencing and Whole-Genome Sequencing in Critically Ill Neonates Suspected to Have Single-Gene Disorders. , 2016, Cold Spring Harbor perspectives in medicine.

[72]  Tudor Groza,et al.  The Human Phenotype Ontology in 2017 , 2016, Nucleic Acids Res..

[73]  Peter Saffrey,et al.  Rapid Whole-Genome Sequencing for Genetic Disease Diagnosis in Neonatal Intensive Care Units , 2012, Science Translational Medicine.

[74]  Brett J. Kennedy,et al.  Phevor combines multiple biomedical ontologies for accurate identification of disease-causing alleles in single individuals and small nuclear families. , 2014, American journal of human genetics.