Challenges of Identifying Clinically Actionable Genetic Variants for Precision Medicine

Advances in genomic medicine have the potential to change the way we treat human disease, but translating these advances into reality for improving healthcare outcomes depends essentially on our ability to discover disease- and/or drug-associated clinically actionable genetic mutations. Integration and manipulation of diverse genomic data and comprehensive electronic health records (EHRs) on a big data infrastructure can provide an efficient and effective way to identify clinically actionable genetic variants for personalized treatments and reduce healthcare costs. We review bioinformatics processing of next-generation sequencing (NGS) data, bioinformatics infrastructures for implementing precision medicine, and bioinformatics approaches for identifying clinically actionable genetic variants using high-throughput NGS data and EHRs.

[1]  Ginny Allain,et al.  Personalized medicine. , 2012, MLO: medical laboratory observer.

[2]  F. Sanger,et al.  DNA sequencing with chain-terminating inhibitors. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Robert C. Green,et al.  How to know when physicians are ready for genomic medicine , 2015, Science Translational Medicine.

[4]  Stephen C. J. Parker,et al.  Accurate and comprehensive sequencing of personal genomes. , 2011, Genome research.

[5]  Christopher G. Chute,et al.  CSER and eMERGE: current and potential state of the display of genetic information in the electronic health record , 2015, J. Am. Medical Informatics Assoc..

[6]  Kenneth D. Doig,et al.  Assessing the clinical value of targeted massively parallel sequencing in a longitudinal, prospective population-based study of cancer patients , 2015, British Journal of Cancer.

[7]  Roland Eils,et al.  Coverage Bias and Sensitivity of Variant Calling for Four Whole-genome Sequencing Technologies , 2013, PloS one.

[8]  C. Sander,et al.  Predicting the functional impact of protein mutations: application to cancer genomics , 2011, Nucleic acids research.

[9]  Russ B. Altman,et al.  PATH-SCAN: A Reporting Tool for Identifying Clinically Actionable Variants , 2013, Pacific Symposium on Biocomputing.

[10]  Peggy L Peissig,et al.  SeqHBase: a big data toolset for family based sequencing data analysis , 2015, Journal of Medical Genetics.

[11]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[12]  Matthew S. Lebo,et al.  A survey of informatics approaches to whole-exome and whole-genome clinical reporting in the electronic health record , 2013, Genetics in Medicine.

[13]  Lin Liu,et al.  Comparison of Next-Generation Sequencing Systems , 2012, Journal of biomedicine & biotechnology.

[14]  L. Vissers,et al.  Genome sequencing identifies major causes of severe intellectual disability , 2014, Nature.

[15]  Heidi L Rehm,et al.  ClinGen--the Clinical Genome Resource. , 2015, The New England journal of medicine.

[16]  Abel N. Kho,et al.  Practical challenges in integrating genomic data into the electronic health record , 2013, Genetics in Medicine.

[17]  W. Miller,et al.  Comparison of Sequencing Platforms for Single Nucleotide Variant Calls in a Human Sample , 2013, PloS one.

[18]  Suzette J. Bielinski,et al.  Design and Anticipated Outcomes of the eMERGE-PGx Project: A Multi-Center Pilot for Pre-Emptive Pharmacogenomics in Electronic Health Record Systems , 2014, Clinical pharmacology and therapeutics.

[19]  Kensaku Kawamoto,et al.  A Proposed Clinical Decision Support Architecture Capable of Supporting Whole Genome Sequence Information , 2014, Journal of personalized medicine.

[20]  Timothy B. Stockwell,et al.  Evaluation of next generation sequencing platforms for population targeted sequencing studies , 2009, Genome Biology.

[21]  P Green,et al.  Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.

[22]  Miguel López-Coronado,et al.  Analysis of the Security and Privacy Requirements of Cloud-Based Electronic Health Records Systems , 2013, Journal of medical Internet research.

[23]  Olivier Elemento,et al.  A primer on precision medicine informatics , 2016, Briefings Bioinform..

[24]  Jing Hu,et al.  SIFT web server: predicting effects of amino acid substitutions on proteins , 2012, Nucleic Acids Res..

[25]  Eija Korpelainen,et al.  Hadoop-BAM: directly manipulating next generation sequencing data in the cloud , 2012, Bioinform..

[26]  Daniel Rios,et al.  Bioinformatics Applications Note Databases and Ontologies Deriving the Consequences of Genomic Variants with the Ensembl Api and Snp Effect Predictor , 2022 .

[27]  Quinten Waisfisz,et al.  Best Practice Guidelines for the Use of Next‐Generation Sequencing Applications in Genome Diagnostics: A National Collaborative Study of Dutch Genome Diagnostic Laboratories , 2013, Human mutation.

[28]  Inge Jonassen,et al.  Characteristics of 454 pyrosequencing data—enabling realistic simulation with flowsim , 2010, Bioinform..

[29]  Yan Guo,et al.  Three-stage quality control strategies for DNA re-sequencing data , 2014, Briefings Bioinform..

[30]  S. Riazuddin,et al.  Investigating the Molecular Basis of Retinal Degeneration in a Familial Cohort of Pakistani Decent by Exome Sequencing , 2015, PloS one.

[31]  Melissa A. Basford,et al.  The Electronic Medical Records and Genomics (eMERGE) Network: past, present, and future , 2013, Genetics in Medicine.

[32]  Marco Mora,et al.  Next-Generation Sequencing Workflow for NSCLC Critical Samples Using a Targeted Sequencing Approach by Ion Torrent PGM™ Platform , 2015, International journal of molecular sciences.

[33]  M. Schatz,et al.  Reducing INDEL calling errors in whole genome and exome sequencing data , 2014, Genome Medicine.

[34]  Mingyao Li,et al.  Evaluating the Impact of Sequencing Depth on Transcriptome Profiling in Human Adipose , 2013, PloS one.

[35]  Kevin B. Johnson,et al.  Electronic health record design and implementation for pharmacogenomics: a local perspective , 2013, Genetics in Medicine.

[36]  Gil Alterovitz,et al.  All the World's a Stage: Facilitating Discovery Science and Improved Cancer Care through the Global Alliance for Genomics and Health. , 2015, Cancer discovery.

[37]  J. Shendure,et al.  Exome sequencing as a tool for Mendelian disease gene discovery , 2011, Nature Reviews Genetics.

[38]  Bernard P. Puc,et al.  An integrated semiconductor device enabling non-optical genome sequencing , 2011, Nature.

[39]  S. Ellard,et al.  Using SIFT and PolyPhen to predict loss-of-function and gain-of-function mutations. , 2010, Genetic testing and molecular biomarkers.

[40]  E. Lander,et al.  Genomic mapping by fingerprinting random clones: a mathematical analysis. , 1988, Genomics.

[41]  Steven M. Johnson,et al.  A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning. , 2008, Genome research.

[42]  Jie Zhou,et al.  RNA-seq differential expression studies: more sequence or more replication? , 2014, Bioinform..

[43]  Boris Yamrom,et al.  The contribution of de novo coding mutations to autism spectrum disorder , 2014, Nature.

[44]  Joshua L. Deignan,et al.  ACMG clinical laboratory standards for next-generation sequencing , 2013, Genetics in Medicine.

[45]  Yasunobu Terabayashi,et al.  First Complete Genome Sequences of Staphylococcus aureus subsp. aureus Rosenbach 1884 (DSM 20231T), Determined by PacBio Single-Molecule Real-Time Technology , 2015, Genome Announcements.

[46]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[47]  Tommaso Mazza,et al.  Congruency in the prediction of pathogenic missense mutations: state-of-the-art web-based tools , 2013, Briefings Bioinform..

[48]  J. Kelsoe,et al.  RNA sequencing of transcriptomes in human brain regions: protein-coding and non-coding RNAs, isoforms and alleles , 2015, BMC Genomics.

[49]  Régis Beuscart,et al.  Toward a Literature-Driven Definition of Big Data in Healthcare , 2015, BioMed research international.

[50]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[51]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[52]  Marc S. Williams,et al.  ACMG recommendations for reporting of incidental findings in clinical exome and genome sequencing , 2013, Genetics in Medicine.

[53]  Ian M. Carr,et al.  OVA: integrating molecular and physical phenotype data from multiple biomedical domain ontologies with variant filtering for enhanced variant prioritization , 2015, Bioinform..

[54]  Hugo Y. K. Lam,et al.  Performance comparison of exome DNA sequencing technologies , 2011, Nature Biotechnology.

[55]  H. Swerdlow,et al.  A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers , 2012, BMC Genomics.

[56]  S. Teutsch,et al.  The Evaluation of Genomic Applications in Practice and Prevention (EGAPP) initiative: methods of the EGAPP Working Group , 2009, Genetics in Medicine.

[57]  Michael R. Hayden,et al.  Hunting human disease genes: lessons from the past, challenges for the future , 2013, Human Genetics.

[58]  Roy D. Sleator,et al.  'Big data', Hadoop and cloud computing in genomics , 2013, J. Biomed. Informatics.

[59]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[60]  Shi Jing,et al.  Innovative Testing and Measurement Solutions for Smart Grid: Huang/Innovative Testing and Measurement Solutions for Smart Grid , 2014 .

[61]  Sylvia Stracke,et al.  Cohort profile: Greifswald approach to individualized medicine (GANI_MED) , 2014, Journal of Translational Medicine.

[62]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[63]  Jacob A. Tennessen,et al.  Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes , 2012, Science.

[64]  Joyce A. Mitchell,et al.  Utility of gene-specific algorithms for predicting pathogenicity of uncertain gene variants , 2012, J. Am. Medical Informatics Assoc..

[65]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[66]  Christian Brueffer,et al.  The Sweden Cancerome Analysis Network - Breast (SCAN-B) Initiative: a large-scale multicenter infrastructure towards implementation of breast cancer genomic analyses in the clinical routine , 2015, Genome Medicine.

[67]  Kensaku Kawamoto,et al.  Characterizing genetic variants for clinical action , 2014, American journal of medical genetics. Part C, Seminars in medical genetics.

[68]  S. Turner,et al.  Real-time DNA sequencing from single polymerase molecules. , 2010, Methods in enzymology.

[69]  De novo mutations in epileptic encephalopathies , 2013 .

[70]  F. Gnad,et al.  Assessment of computational methods for predicting the effects of missense mutations in human cancers , 2013, BMC Genomics.

[71]  Ken Chen,et al.  ClinSeK: a targeted variant characterization framework for clinical sequencing , 2015, Genome Medicine.

[72]  R. Bennett,et al.  A practice guideline from the American College of Medical Genetics and Genomics and the National Society of Genetic Counselors: referral indications for cancer predisposition assessment , 2014, Genetics in Medicine.

[73]  M. Christman,et al.  The Coriell personalized medicine collaborative pharmacogenomics appraisal, evidence scoring and interpretation system , 2013, Genome Medicine.

[74]  P. Baybayan,et al.  Multiple Genome Sequences of Helicobacter pylori Strains of Diverse Disease and Antibiotic Resistance Backgrounds from Malaysia , 2013, Genome Announcements.

[75]  M J Becich,et al.  Clinical integration of next-generation sequencing technology. , 2012, Clinics in laboratory medicine.

[76]  Birgit Funke,et al.  College of American Pathologists' laboratory standards for next-generation sequencing clinical tests. , 2015, Archives of pathology & laboratory medicine.

[77]  F. Hock Drug Discovery and Evaluation: Pharmacological Assays , 2016, Springer International Publishing.

[78]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[79]  M. Schatz,et al.  Searching for SNPs with cloud computing , 2009, Genome Biology.

[80]  Damian Smedley,et al.  Improved exome prioritization of disease genes through cross-species phenotype comparison , 2014, Genome research.

[81]  Pablo Cingolani,et al.  © 2012 Landes Bioscience. Do not distribute. , 2022 .

[82]  Paolo Rocco,et al.  Good laboratory practice for clinical next-generation sequencing informatics pipelines , 2015 .

[83]  Avni Santani,et al.  Actionable exomic incidental findings in 6503 participants: challenges of variant classification , 2015, Genome research.

[84]  Iftikhar J. Kullo,et al.  Ethical, legal, and social implications of incorporating genomic information into electronic health records , 2013, Genetics in Medicine.

[85]  Timothy D. Harris,et al.  The challenges of sequencing by synthesis , 2009, Nature Biotechnology.

[86]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[87]  T. Klein,et al.  CPIC: Clinical Pharmacogenetics Implementation Consortium of the Pharmacogenomics Research Network , 2011, Clinical pharmacology and therapeutics.

[88]  C. Ponting,et al.  Sequencing depth and coverage: key considerations in genomic analyses , 2014, Nature Reviews Genetics.

[89]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[90]  Richard Durbin,et al.  Extending reference assembly models , 2015, Genome Biology.

[91]  Y. Teo,et al.  Nature and Extent of Genetic Diversity of Dengue Viruses Determined by 454 Pyrosequencing , 2015, PloS one.

[92]  Lior Pachter,et al.  The NIH BD2K center for big data in translational genomics , 2015, J. Am. Medical Informatics Assoc..

[93]  H. Guchelaar,et al.  Pharmacogenetics: From Bench to Byte— An Update of Guidelines , 2011, Clinical pharmacology and therapeutics.

[94]  Deanna M. Church,et al.  ClinVar: public archive of relationships among sequence variation and human phenotype , 2013, Nucleic Acids Res..

[95]  E. Koay,et al.  Targeted next-generation sequencing of the ATP7B gene for molecular diagnosis of Wilson disease. , 2016, Clinical biochemistry.