An Innovative Sequence-to-Structure-Based Approach to Drug Resistance Interpretation and Prediction: The Use of Molecular Interaction Fields to Detect HIV-1 Protease Binding-Site Dissimilarities

In silico methodologies have opened new avenues of research to understanding and predicting drug resistance, a pressing health issue that keeps rising at alarming pace. Sequence-based interpretation systems are routinely applied in clinical context in an attempt to predict mutation-based drug resistance and thus aid the choice of the most adequate antibiotic and antiviral therapy. An important limitation of approaches based on genotypic data exclusively is that mutations are not considered in the context of the three-dimensional (3D) structure of the target. Structure-based in silico methodologies are inherently more suitable to interpreting and predicting the impact of mutations on target-drug interactions, at the cost of higher computational and time demands when compared with sequence-based approaches. Herein, we present a fast, computationally inexpensive, sequence-to-structure-based approach to drug resistance prediction, which makes use of 3D protein structures encoded by input target sequences to draw binding-site comparisons with susceptible templates. Rather than performing atom-by-atom comparisons between input target and template structures, our workflow generates and compares Molecular Interaction Fields (MIFs) that map the areas of energetically favorable interactions between several chemical probe types and the target binding site. Quantitative, pairwise dissimilarity measurements between the target and the template binding sites are thus produced. The method is particularly suited to understanding changes to the 3D structure and the physicochemical environment introduced by mutations into the target binding site. Furthermore, the workflow relies exclusively on freeware, making it accessible to anyone. Using four datasets of known HIV-1 protease sequences as a case-study, we show that our approach is capable of correctly classifying resistant and susceptible sequences given as input. Guided by ROC curve analyses, we fined-tuned a dissimilarity threshold of classification that results in remarkable discriminatory performance (accuracy ≈ ROC AUC ≈ 0.99), illustrating the high potential of sequence-to-structure-, MIF-based approaches in the context of drug resistance prediction. We discuss the complementarity of the proposed methodology to existing prediction algorithms based on genotypic data. The present work represents a new step toward a more comprehensive and structurally-informed interpretation of the impact of genetic variability on the response to HIV-1 therapies.

[1]  Pawan Gupta,et al.  Prediction of drug-resistance in HIV-1 subtype C based on protease sequences from ART naive and first-line treatment failures in North India using genotypic and docking analysis. , 2011, Antiviral research.

[2]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[3]  Lynne Strasfeld,et al.  Antiviral drug resistance: mechanisms and clinical implications. , 2010, Infectious disease clinics of North America.

[4]  Marcus D. Ruopp,et al.  Youden Index and Optimal Cut‐Point Estimated from Observations Affected by a Lower Limit of Detection , 2008, Biometrical journal. Biometrische Zeitschrift.

[5]  J. Richardson,et al.  Asparagine and glutamine: using hydrogen atom contacts in the choice of side-chain amide orientation. , 1999, Journal of molecular biology.

[6]  Arun K. Ghosh,et al.  Structural studies of antiviral inhibitor with HIV-1 protease bearing drug resistant substitutions of V32I, I47V and V82I. , 2019, Biochemical and biophysical research communications.

[7]  José Mario Martínez,et al.  Convergent algorithms for protein structural alignment , 2007, BMC Bioinformatics.

[8]  J. Tukey Comparing individual means in the analysis of variance. , 1949, Biometrics.

[9]  H. Walter,et al.  HIV-GRADE: A Publicly Available, Rules-Based Drug Resistance Interpretation Algorithm Integrating Bioinformatic Knowledge , 2012, Intervirology.

[10]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[11]  Celia A. Schiffer,et al.  Structural and Thermodynamic Basis for the Binding of TMC114, a Next-Generation Human Immunodeficiency Virus Type 1 Protease Inhibitor , 2004, Journal of Virology.

[12]  R. Shafer,et al.  2019 update of the drug resistance mutations in HIV-1. , 2019, Topics in antiviral medicine.

[13]  Tommy F. Liu,et al.  The HIVdb System for HIV-1 Genotypic Resistance Interpretation , 2012, Intervirology.

[14]  J. Pines,et al.  Evidence-based emergency care , 2008 .

[15]  Zoya Khalid,et al.  Prediction of HIV Drug Resistance by Combining Sequence and Structural Properties , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[16]  J. Louis,et al.  Structural Studies of a Rationally Selected Multi-Drug Resistant HIV-1 Protease Reveal Synergistic Effect of Distal Mutations on Flap Dynamics , 2016, PloS one.

[17]  C. Florkowski Sensitivity, specificity, receiver-operating characteristic (ROC) curves and likelihood ratios: communicating the performance of diagnostic tests. , 2008, The Clinical biochemist. Reviews.

[18]  B. Doneley,et al.  Interpreting Diagnostic Tests , 2018, Avian Medicine and Surgery in Practice.

[19]  Stefano Alcaro,et al.  Molecular interaction fields in drug discovery: recent advances and future perspectives , 2013 .

[20]  J. Wu,et al.  Comparison of antiviral resistance across acute and chronic viral infections , 2018, Antiviral research.

[21]  W. Youden,et al.  Index for rating diagnostic tests , 1950, Cancer.

[22]  Thomas Hummel,et al.  SHIVA - a web application for drug resistance and tropism testing in HIV , 2016, BMC Bioinformatics.

[23]  Majid Masso,et al.  Sequence and structure based models of HIV-1 protease and reverse transcriptase drug resistance , 2013, BMC Genomics.

[24]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[25]  Vasant Ranade Molecular Interaction Fields , 2006 .

[26]  Özlem Tastan Bishop,et al.  Characterizing early drug resistance-related events using geometric ensembles from HIV protease dynamics , 2018, Scientific Reports.

[27]  Bryan Chan,et al.  Human immunodeficiency virus reverse transcriptase and protease sequence database , 2003, Nucleic Acids Res..

[28]  Thomas Lengauer,et al.  Innovations: Bioinformatics-assisted anti-HIV therapy , 2006, Nature Reviews Microbiology.

[29]  I. Weber,et al.  Tackling the problem of HIV drug resistance. , 2016, Postepy biochemii.

[30]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[31]  Jie Zhang,et al.  Comparison of the Precision and Sensitivity of the Antivirogram and PhenoSense HIV Drug Susceptibility Assays , 2005, Journal of acquired immune deficiency syndromes.

[32]  V. Poroikov,et al.  A Computational Approach for the Prediction of HIV Resistance Based on Amino Acid and Nucleotide Descriptors , 2018, Molecules.

[33]  Thomas Lengauer,et al.  Geno2pheno: estimating phenotypic drug resistance from HIV-1 genotypes , 2003, Nucleic Acids Res..

[34]  David C. Hoaglin,et al.  John W. Tukey and data analysis , 2003 .

[35]  Rafael Najmanovich,et al.  Detection of Binding Site Molecular Interaction Field Similarities , 2015, J. Chem. Inf. Model..

[36]  P. Goodford A computational procedure for determining energetically favorable binding sites on biologically important macromolecules. , 1985, Journal of medicinal chemistry.

[37]  Louis-Philippe Morency,et al.  NRGsuite: a PyMOL plugin to perform docking simulations in real time using FlexAID , 2015, Bioinform..

[38]  A. Wlodawer,et al.  Structure-based inhibitors of HIV-1 protease. , 1993, Annual review of biochemistry.

[39]  A. Vandamme,et al.  A Genotypic Drug Resistance Interpretation Algorithm that Significantly Predicts Therapy Response in HIV-1-Infected Patients , 2001, Antiviral therapy.

[40]  Ge-Fei Hao,et al.  Structure-based methods for predicting target mutation-induced drug resistance and rational drug design to overcome the problem. , 2012, Drug discovery today.

[41]  A. Walmsley,et al.  Microbial and viral drug resistance mechanisms. , 2002, Trends in microbiology.

[42]  V. Calvez,et al.  Clinically relevant interpretation of genotype for resistance to abacavir , 2003, AIDS.

[43]  J. Jensen,et al.  Antiviral drug resistance as an adaptive process , 2016, Virus evolution.

[44]  Ram Samudrala,et al.  Prediction of HIV-1 Protease Inhibitor Resistance using a Protein–Inhibitor Flexible Docking Approach , 2005, Antiviral therapy.

[45]  Gabriele Cruciani,et al.  Molecular interaction fields : applications in drug discovery and ADME prediction , 2005 .

[46]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[47]  Irene T. Weber,et al.  HIV-1 Protease: Structural Perspectives on Drug Resistance , 2009, Viruses.

[48]  Tingjun Hou,et al.  Molecular dynamics and free energy studies on the wild-type and double mutant HIV-1 protease complexed with amprenavir and two amprenavir-related inhibitors: mechanism for binding and drug resistance. , 2007, Journal of medicinal chemistry.

[49]  S. Singh,et al.  An in silico pharmacological approach toward the discovery of potent inhibitors to combat drug resistance HIV‐1 protease variants , 2018, Journal of cellular biochemistry.

[50]  Simon C. Potter,et al.  The EMBL-EBI search and sequence analysis tools APIs in 2019 , 2019, Nucleic Acids Res..

[51]  T. Lumley,et al.  gplots: Various R Programming Tools for Plotting Data , 2015 .

[52]  I. Weber,et al.  Prediction of HIV drug resistance from genotype with encoded three-dimensional protein structure , 2014, BMC Genomics.

[53]  Hong Huang Lin,et al.  Computer prediction of drug resistance mutations in proteins. , 2005, Drug discovery today.

[54]  A. Vandamme,et al.  Algorithms for the interpretation of HIV-1 genotypic drug resistance information. , 2006, Antiviral research.

[55]  X. Qiu,et al.  Recent developments of peptidomimetic HIV-1 protease inhibitors. , 2011, Current medicinal chemistry.

[56]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[57]  H. Field,et al.  General Mechanisms of Antiviral Resistance , 2010, Genetics and Evolution of Infectious Disease.

[58]  R. Shafer,et al.  Genotypic predictors of human immunodeficiency virus type 1 drug resistance , 2006, Proceedings of the National Academy of Sciences.

[59]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[60]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[61]  D. Powers Evaluation: From Precision, Recall and F-Factor to ROC, Informedness, Markedness & Correlation , 2008 .

[62]  Ge-Fei Hao,et al.  Computational mutation scanning and drug resistance mechanisms of HIV-1 protease inhibitors. , 2010, The journal of physical chemistry. B.