Supporting precision medicine by data mining across multi‐disciplines: an integrative approach for generating comprehensive linkages between single nucleotide variants (SNVs) and drug‐binding sites

Motivation: Genetic variants in drug targets and metabolizing enzymes often have important functional implications, including altering the efficacy and toxicity of drugs. Identifying single nucleotide variants (SNVs) that contribute to differences in drug response and understanding their underlying mechanisms are fundamental to successful implementation of the precision medicine model. This work reports an effort to collect, classify and analyze SNVs that may affect the optimal response to currently approved drugs. Results: An integrated approach was taken involving data mining across multiple information resources including databases containing drugs, drug targets, chemical structures, protein‐ligand structure complexes, genetic and clinical variations as well as protein sequence alignment tools. We obtained 2640 SNVs of interest, most of which occur rarely in populations (minor allele frequency < 0.01). Clinical significance of only 9.56% of the SNVs is known in ClinVar, although 79.02% are predicted as deleterious. The examples here demonstrate that even if the mapped SNVs predicted as deleterious may not result in significant structural modifications, they can plausibly modify the protein‐drug interactions, affecting selectivity and drug‐binding affinity. Our analysis identifies potentially deleterious SNVs present on drug‐binding residues that are relevant for further studies in the context of precision medicine. Availability and Implementation: Data are available from Supplementary information file. Contact: yanli.wang@nih.gov Supplementary information: Supplementary Tables S1‐S5 are available at Bioinformatics online.

[1]  Y. J. Kim,et al.  Meta-analysis of genome-wide association studies in East Asian-ancestry populations identifies four new loci for body mass index. , 2014, Human molecular genetics.

[2]  Serafim Batzoglou,et al.  Identifying a High Fraction of the Human Genome to be under Selective Constraint Using GERP++ , 2010, PLoS Comput. Biol..

[3]  C. Geula,et al.  Neurobiology of butyrylcholinesterase , 2003, Nature Reviews Neuroscience.

[4]  C. Bartels,et al.  Characterization of 12 silent alleles of the human butyrylcholinesterase (BCHE) gene. , 1996, American journal of human genetics.

[5]  J. Shendure,et al.  A general framework for estimating the relative pathogenicity of human genetic variants , 2014, Nature Genetics.

[6]  G. Jagadeesh,et al.  Structural determinants for binding, activation, and functional selectivity of the angiotensin AT1 receptor. , 2014, Journal of molecular endocrinology.

[7]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[8]  Sarah Ciccone,et al.  The anti-cancer drug chlorambucil as a substrate for the human polymorphic enzyme glutathione transferase P1-1: kinetic properties and crystallographic characterisation of allelic variants. , 2008, Journal of molecular biology.

[9]  Y. J. Kim,et al.  Meta-analysis identifies multiple loci associated with kidney function–related traits in east Asian populations , 2012, Nature Genetics.

[10]  B. Heckman-Stoddard,et al.  Precision medicine clinical trials: defining new treatment strategies. , 2014, Seminars in oncology nursing.

[11]  Richard Simon,et al.  Genomic AlterationDriven Clinical Trial Designs in Oncology , 2016, Annals of Internal Medicine.

[12]  E. Génin,et al.  How important are rare variants in common disease? , 2014, Briefings in functional genomics.

[13]  Ricardo Villamarín-Salomón,et al.  ClinVar: public archive of interpretations of clinically relevant variants , 2015, Nucleic Acids Res..

[14]  M. Field,et al.  Molecular dynamics simulations of human butyrylcholinesterase , 2005, Proteins.

[15]  G. Aithal,et al.  Association of polymorphisms in the cytochrome P450 CYP2C9 with warfarin dose requirement and risk of bleeding complications , 1999, The Lancet.

[16]  Elspeth A. Bruford,et al.  Genenames.org: the HGNC resources in 2015 , 2014, Nucleic Acids Res..

[17]  C. P. Nogueira,et al.  Identification of the structural mutation responsible for the dibucaine-resistant (atypical) variant form of human serum cholinesterase. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[18]  D. Schaid,et al.  Citalopram and escitalopram plasma drug and metabolite concentrations: genome-wide associations. , 2014, British Journal of Clinical Pharmacology.

[19]  J. Rossjohn,et al.  Multifunctional role of Tyr 108 in the catalytic mechanism of human glutathione transferase P1-1. Crystallographic and kinetic studies on the Y108F mutant enzyme. , 1997, Biochemistry.

[20]  K. Iwasaki,et al.  Genetic analysis of a Japanese patient with butyrylcholinesterase deficiency , 1997, Annals of human genetics.

[21]  Yanli Wang,et al.  MMDB: 3D structures and macromolecular interactions , 2011, Nucleic Acids Res..

[22]  John Moult,et al.  GWAS and drug targets , 2014, BMC Genomics.

[23]  Gayatry Mohapatra,et al.  Inherited susceptibility to lung cancer may be associated with the T790M drug resistance mutation in EGFR , 2005, Nature Genetics.

[24]  E. Boerwinkle,et al.  dbNSFP v3.0: A One‐Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice‐Site SNVs , 2016, Human mutation.

[25]  M. Denis,et al.  EGFR T790M resistance mutation in non small-cell lung carcinoma. , 2015, Clinica chimica acta; international journal of clinical chemistry.

[26]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[27]  T. Ogihara,et al.  Genome-wide association study of coronary artery disease in the Japanese , 2011, European Journal of Human Genetics.

[28]  Julie A. Johnson,et al.  Influence of coagulation factor, vitamin K epoxide reductase complex subunit 1, and cytochrome P450 2C9 gene polymorphisms on warfarin dose requirements , 2006, Clinical pharmacology and therapeutics.

[29]  C. Tyler-Smith,et al.  Deleterious- and disease-allele prevalence in healthy individuals: insights from current predictions, mutation databases, and population-scale resequencing. , 2012, American journal of human genetics.

[30]  Hui-Yong Sun,et al.  Finding chemical drugs for genetic diseases. , 2014, Drug discovery today.

[31]  Yvain Nicolet,et al.  Crystal Structure of Human Butyrylcholinesterase and of Its Complexes with Substrate and Products* , 2003, Journal of Biological Chemistry.

[32]  Hong-Wen Deng,et al.  ALDH2 is associated to alcohol dependence and is the major genetic determinant of “daily maximum drinks” in a GWAS study of an isolated rural chinese sample , 2014, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[33]  Y. Kamatani,et al.  Functional variants in ADH1B and ALDH2 coupled with alcohol and smoking synergistically enhance esophageal cancer risk. , 2009, Gastroenterology.

[34]  D. Townsend,et al.  The role of glutathione-S-transferase in anti-cancer drug resistance , 2003, Oncogene.

[35]  R. Leduc,et al.  Analysis of Transmembrane Domains 1 and 4 of the Human Angiotensin II AT1 Receptor by Cysteine-scanning Mutagenesis* , 2009, The Journal of Biological Chemistry.

[36]  Chien-Jen Chen,et al.  Association of GSTP1 Polymorphism and Survival for Esophageal Cancer , 2005, Clinical Cancer Research.

[37]  R. Deberardinis,et al.  Spectrum of mutations in the renin–angiotensin system genes in autosomal recessive renal tubular dysgenesis , 2012, Human mutation.

[38]  J. Buolamwini,et al.  Molecular Cloning, Characterization, and Expression in Escherichia coli of Full-length cDNAs of Three Human Glutathione S-Transferase Pi Gene Variants , 1997, The Journal of Biological Chemistry.

[39]  Yusuke Nakamura,et al.  Genome-wide association study of hematological and biochemical traits in a Japanese population , 2010, Nature Genetics.

[40]  A. Furano,et al.  The mutational spectrum of non-CpG DNA varies with CpG content. , 2010, Genome research.

[41]  M. Meyerson,et al.  EGFR mutation and resistance of non-small-cell lung cancer to gefitinib. , 2005, The New England journal of medicine.

[42]  Gang Fu,et al.  PubChem Substance and Compound databases , 2015, Nucleic Acids Res..

[43]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[44]  A. Chinnaiyan,et al.  AGTR1 as a therapeutic target in ER-positive and ERBB-negative breast cancer cases , 2009, Cell cycle.

[45]  E. Laborde Glutathione transferases as mediators of signaling pathways involved in cell proliferation and cell death , 2010, Cell Death and Differentiation.

[46]  Janet M. Thornton,et al.  Amino Acid Changes in Disease-Associated Variants Differ Radically from Variants Observed in the 1000 Genomes Project Dataset , 2013, PLoS Comput. Biol..

[47]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[48]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[49]  Raymond C. Stevens,et al.  Structural Basis for Ligand Recognition and Functional Selectivity at Angiotensin Receptor*♦ , 2015, The Journal of Biological Chemistry.

[50]  J. Lindh,et al.  Influence of CYP2C9 genotype on warfarin dose requirements—a systematic review and meta-analysis , 2009, European Journal of Clinical Pharmacology.

[51]  T. Beer,et al.  Polymorphisms of GSTP1 and related genes and prostate cancer risk , 2002, Prostate Cancer and Prostatic Diseases.

[52]  A. Daly,et al.  Polymorphisms in GSTP1, GSTM1, and GSTT1 and susceptibility to colorectal cancer. , 1999, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.