Understanding the Bioinformatics Challenges of Integrating Genomics Into Healthcare

Genomic data are paving the way toward personalized healthcare. By unveiling genetic disease-contributing factors, genomic data can aid in the detection, diagnosis, and treatment of a wide range of complex diseases. Integrating genomic data into healthcare is riddled with a wide range of challenges spanning social, ethical, legal, educational, economic, and technical aspects. Bioinformatics is a core integration aspect presenting an overwhelming number of unaddressed challenges. In this paper, we tackle the fundamental bioinformatics integration concerns including: genomic data generation, storage, representation, and utilization in conjunction with clinical data. We divide the bioinformatics challenges into a series of seven intertwined integration aspects spanning the areas of informatics, knowledge management, and communication. For each aspect, we provide a detailed discussion of the current research directions, outstanding challenges, and possible resolutions. This paper seeks to help narrow the gap between the genomic applications, which are being predominantly utilized in research settings, and the clinical adoption of these applications.

[1]  Stanley C. Ahalt,et al.  A New Framework and Prototype Solution for Clinical Decision Support and Research in Genomics and Other Data-intensive Fields of Medicine , 2016, EGEMS.

[2]  M. Ladanyi,et al.  Integration of Molecular Profiling into the Lung Cancer Clinic , 2009, Clinical Cancer Research.

[3]  A. Ashworth,et al.  The DNA damage response and cancer therapy , 2012, Nature.

[4]  Leslie G. Biesecker,et al.  Opportunities and challenges for the integration of massively parallel genomic sequencing into clinical practice: lessons from the ClinSeq project , 2012, Genetics in Medicine.

[5]  Abel N. Kho,et al.  Practical challenges in integrating genomic data into the electronic health record , 2013, Genetics in Medicine.

[6]  Diane Hauser,et al.  The IGNITE network: a model for genomic medicine implementation and research , 2015, BMC Medical Genomics.

[7]  J. Harrow,et al.  Assessment of transcript reconstruction methods for RNA-seq , 2013, Nature Methods.

[8]  Sivakumar Gowrisankar,et al.  The landscape of genetic variation in dilated cardiomyopathy as surveyed by clinical DNA sequencing , 2014, Genetics in Medicine.

[9]  Alison B. Hamilton,et al.  Factors influencing organizational adoption and implementation of clinical genetic services , 2013, Genetics in Medicine.

[10]  Dan M. Roden,et al.  Leveraging the electronic health record to implement genomic medicine , 2012, Genetics in Medicine.

[11]  Teruyoshi Hishiki,et al.  Extraction of Gene-Disease Relations from Medline Using Domain Dictionaries and Machine Learning , 2005, Pacific Symposium on Biocomputing.

[12]  Steven Henikoff,et al.  SIFT: predicting amino acid changes that affect protein function , 2003, Nucleic Acids Res..

[13]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2002, Nucleic Acids Res..

[14]  Barbara A. Eckman,et al.  A Practitioner's Guide to Data Management and Data Integration in Bioinformatics , 2003, Bioinformatics.

[15]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[16]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[17]  G. Van Camp,et al.  Genetic diagnostics of early childhood hearing loss: better testing with next-generation DNA sequencing. , 2013, B-ENT.

[18]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[19]  Paul G Shekelle,et al.  Delivery of genomic medicine for common chronic adult diseases: a systematic review. , 2008, JAMA.

[20]  Helen E. Parkinson,et al.  The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog) , 2016, Nucleic Acids Res..

[21]  Arthur W. Toga,et al.  Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows , 2012, Genes.

[22]  M. Levy,et al.  Integrating cancer genomic data into electronic health records , 2016, Genome Medicine.

[23]  Joshua C. Denny,et al.  Chapter 13: Mining Electronic Health Records in the Genomics Era , 2012, PLoS Comput. Biol..

[24]  Aniruddha Datta,et al.  A Survey of Software and Hardware Approaches to Performing Read Alignment in Next Generation Sequencing , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[25]  Ann E. Loraine,et al.  Genoviz Software Development Kit: Java tool kit for building genomics visualization applications , 2009, BMC Bioinformatics.

[26]  Tatiana Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[27]  I. Kohane Using electronic health records to drive discovery in disease genomics , 2011, Nature Reviews Genetics.

[28]  Muin J Khoury,et al.  Genetics and genomics in practice: The continuum from genetic disease to genetic information in health and disease , 2003, Genetics in Medicine.

[29]  Susan M Wolf,et al.  Patient Autonomy and Incidental Findings in Clinical Genomics , 2013, Science.

[30]  Christopher G. Chute,et al.  CSER and eMERGE: current and potential state of the display of genetic information in the electronic health record , 2015, J. Am. Medical Informatics Assoc..

[31]  Heidi L. Rehm,et al.  Disease-targeted sequencing: a cornerstone in the clinic , 2013, Nature Reviews Genetics.

[32]  David G. Knowles,et al.  The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression , 2012, Genome research.

[33]  Philip Lijnzaad,et al.  The Ensembl genome database project , 2002, Nucleic Acids Res..

[34]  K. Itakura,et al.  Detection of sickle cell beta S-globin allele by hybridization with synthetic oligonucleotides. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Dimitrios I. Fotiadis,et al.  Machine learning applications in cancer prognosis and prediction , 2014, Computational and structural biotechnology journal.

[36]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[37]  Munir Pirmohamed,et al.  Personalized medicine: decades away? , 2006, Pharmacogenomics.

[38]  Henry T. Greely,et al.  Clinical genomics, big data, and electronic medical records: reconciling patient rights with research when privacy and science collide , 2017, Journal of law and the biosciences.

[39]  N. Carter Methods and strategies for analyzing copy number variation using DNA microarrays , 2007, Nature Genetics.

[40]  Deanna M. Church,et al.  ClinVar: public archive of relationships among sequence variation and human phenotype , 2013, Nucleic Acids Res..

[41]  R. Tibshirani,et al.  Gene expression profiling identifies clinically relevant subtypes of prostate cancer. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[42]  E. Clayton,et al.  Operational Implementation of Prospective Genotyping for Personalized Medicine: The Design of the Vanderbilt PREDICT Project , 2012, Clinical pharmacology and therapeutics.

[43]  Michael Krauthammer,et al.  GeneWays: a system for extracting, analyzing, visualizing, and integrating molecular pathway data , 2004, J. Biomed. Informatics.

[44]  A Chakravarti,et al.  High-throughput variation detection and genotyping using microarrays. , 2001, Genome research.

[45]  Jana Marie Schwarz,et al.  MutationTaster evaluates disease-causing potential of sequence alterations , 2010, Nature Methods.

[46]  M. Guyer,et al.  Charting a course for genomic medicine from base pairs to bedside , 2011, Nature.

[47]  Isaac S. Kohane,et al.  Technical desiderata for the integration of genomic data into Electronic Health Records , 2012, J. Biomed. Informatics.

[48]  Kensaku Kawamoto,et al.  Bmc Medical Informatics and Decision Making a National Clinical Decision Support Infrastructure to Enable the Widespread and Consistent Practice of Genomic and Personalized Medicine , 2009 .

[49]  E. Birney,et al.  Patterns of somatic mutation in human cancer genomes , 2007, Nature.

[50]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[51]  Mary Goldman,et al.  The UCSC Genome Browser database: extensions and updates 2013 , 2012, Nucleic Acids Res..

[52]  Russ B. Altman,et al.  PharmGKB: the Pharmacogenetics Knowledge Base , 2002, Nucleic Acids Res..

[53]  Marc S. Williams,et al.  ACMG recommendations for reporting of incidental findings in clinical exome and genome sequencing , 2013, Genetics in Medicine.

[54]  Alon Y. Halevy,et al.  Data integration and genomic medicine , 2007, J. Biomed. Informatics.

[55]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[56]  Stylianos E. Antonarakis,et al.  Inversions disrupting the factor VIII gene are a common cause of severe haemophilia A , 1993, Nature Genetics.

[57]  K. Sirotkin,et al.  The NCBI dbGaP database of genotypes and phenotypes , 2007, Nature Genetics.

[58]  Todd,et al.  Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning , 2002, Nature Medicine.

[59]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[60]  Elisa Rossi,et al.  Epidermal growth factor receptor gene and protein and gefitinib sensitivity in non-small-cell lung cancer. , 2005, Journal of the National Cancer Institute.

[61]  U. Meyer Pharmacogenetics and adverse drug reactions , 2000, The Lancet.

[62]  Felix W Frueh,et al.  From pharmacogenetics to personalized medicine: a vital need for educating health professionals and the community. , 2004, Pharmacogenomics.

[63]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[64]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[65]  Pablo Cingolani,et al.  © 2012 Landes Bioscience. Do not distribute. , 2022 .

[66]  George Hripcsak,et al.  Opportunities for genomic clinical decision support interventions , 2013, Genetics in Medicine.

[67]  V. Beneš,et al.  Integrative genomic analyses reveal an androgen-driven somatic alteration landscape in early-onset prostate cancer. , 2013, Cancer cell.

[68]  Jeffrey C. Hall,et al.  The CLIPMERGE PGx Program: Clinical Implementation of Personalized Medicine Through Electronic Health Records and Genomics–Pharmacogenomics , 2013, Clinical pharmacology and therapeutics.

[69]  Guilherme Del Fiol,et al.  Technical desiderata for the integration of genomic data with clinical decision support , 2014, J. Biomed. Informatics.

[70]  Jack A. Taylor,et al.  SNPinfo: integrating GWAS and candidate gene information into functional SNP selection for genetic association studies , 2009, Nucleic Acids Res..

[71]  A. McGuire,et al.  Research ethics and the challenge of whole-genome sequencing , 2008, Nature Reviews Genetics.

[72]  Cole Trapnell,et al.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions , 2013, Genome Biology.

[73]  M. Khoury,et al.  The continuum of translation research in genomic medicine: how can we accelerate the appropriate integration of human genome discoveries into health care and disease prevention? , 2007, Genetics in Medicine.

[74]  Richard O. Sinnott,et al.  Investigating reproducibility and tracking provenance – A genomic workflow case study , 2017, BMC Bioinformatics.

[75]  K. Phillips,et al.  Challenges to the translation of genomic information into clinical practice and health policy: Utilization, preferences and economic value. , 2008, Current opinion in molecular therapeutics.

[76]  Daniel Rios,et al.  Bioinformatics Applications Note Databases and Ontologies Deriving the Consequences of Genomic Variants with the Ensembl Api and Snp Effect Predictor , 2022 .

[77]  J. Fackenthal,et al.  Aberrant RNA splicing and its functional consequences in cancer cells , 2008, Disease Models & Mechanisms.

[78]  W J Kleijer,et al.  Inversion of the IDS gene resulting from recombination with IDS-related sequences is a common cause of the Hunter syndrome. , 1995, Human molecular genetics.

[79]  S. Drăghici,et al.  Analysis of microarray experiments of gene expression profiling. , 2006, American journal of obstetrics and gynecology.

[80]  D. Jaffe,et al.  Molecular Diagnosis of Infantile Mitochondrial Disease with Targeted Next-Generation Sequencing , 2012, Science Translational Medicine.

[81]  Brandon M. Welch,et al.  The Need for Clinical Decision Support Integrated with the Electronic Health Record for the Clinical Application of Whole Genome Sequencing Information , 2013, Journal of Personalized Medicine.