Antibody interface prediction with 3D Zernike descriptors and SVM

MOTIVATION Antibodies are a class of proteins capable of specifically recognizing and binding to a virtually infinite number of antigens. This binding malleability makes them the most valuable category of biopharmaceuticals for both diagnostic and therapeutic applications. The correct identification of the antigen-binding residues in the antibody is crucial for all antibody design and engineering techniques and could also help to understand the complex antigen binding mechanisms. However, the antibody-binding interface prediction field appears to be still rather underdeveloped. RESULTS We present a novel method for antibody interface prediction from their experimentally solved structures based on 3D Zernike Descriptors. Roto-translationally invariant descriptors are computed from circular patches of the antibody surface enriched with a chosen subset of physico-chemical properties from the AAindex1 amino acid index set, and are used as samples for a binary classification problem. An SVM classifier is used to distinguish interface surface patches from non-interface ones. The proposed method was shown to outperform other antigen-binding interface prediction software. AVAILABILITY AND IMPLEMENTATION Linux binaries and Python scripts are available at https://github.com/sebastiandaberdaku/AntibodyInterfacePrediction. The datasets generated and/or analyzed during the current study are available at https://doi.org/10.6084/m9.figshare.5442229. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Neelesh Soni,et al.  In silico methods for design of biological therapeutics. , 2017, Methods.

[2]  Daisuke Kihara,et al.  Potential for Protein Surface Shape Analysis Using Spherical Harmonics and 3D Zernike Descriptors , 2009, Cell Biochemistry and Biophysics.

[3]  Sebastian Kelm,et al.  SAbPred: a structure-based antibody prediction server , 2016, Nucleic Acids Res..

[4]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[5]  Fei Tony Liu,et al.  Isolation-Based Anomaly Detection , 2012, TKDD.

[6]  Brian D. Weitzner,et al.  Modeling and docking of antibody structures with Rosetta , 2017, Nature Protocols.

[7]  K. Lindpaintner,et al.  Antibody and Antigen Contact Residues Define Epitope and Paratope Size and Structure , 2013, The Journal of Immunology.

[8]  M. L. Connolly Analytical molecular surface calculation , 1983 .

[9]  Costas D. Maranas,et al.  OptMAVEn – A New Framework for the de novo Design of Antibody Variable Region Models Targeting Specific Antigen Epitopes , 2014, PloS one.

[10]  Zhengwei Zhu,et al.  CD-HIT: accelerated for clustering the next-generation sequencing data , 2012, Bioinform..

[11]  Jean-Christophe Nebel,et al.  Progress and challenges in predicting protein interfaces , 2015, Briefings Bioinform..

[12]  Ruth Nussinov,et al.  Taking geometry to its edge: Fast unbound rigid (and hinge‐bent) docking , 2003, Proteins.

[13]  Minoru Kanehisa,et al.  AAindex: amino acid index database, progress report 2008 , 2007, Nucleic Acids Res..

[14]  B. Li,et al.  Rapid comparison of properties on protein surface , 2008, Proteins.

[15]  Andrew C. R. Martin,et al.  AbDb: antibody structure database—a database of PDB-derived antibody structures , 2018, Database J. Biol. Databases Curation.

[16]  Carlo Ferrari,et al.  Computing voxelised representations of macromolecular surfaces , 2018, Int. J. High Perform. Comput. Appl..

[17]  Daisuke Kihara,et al.  Protein-protein docking using region-based 3D Zernike descriptors , 2009, BMC Bioinformatics.

[18]  C. Dumontet,et al.  Strategies and challenges for the next generation of antibody–drug conjugates , 2017, Nature Reviews Drug Discovery.

[19]  L. Cavacini,et al.  Structure and function of immunoglobulins. , 2010, The Journal of allergy and clinical immunology.

[20]  Carlo Ferrari,et al.  Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction , 2018, BMC Bioinformatics.

[21]  Yanay Ofran,et al.  Paratome: an online tool for systematic identification of antigen-binding regions in antibodies based on sequence or structure , 2012, Nucleic Acids Res..

[22]  C. Oostenbrink,et al.  Antibody humanization by molecular dynamics simulations—in‐silico guided selection of critical backmutations , 2016, Journal of molecular recognition : JMR.

[23]  A. Lesk,et al.  Canonical structures for the hypervariable regions of immunoglobulins. , 1987, Journal of molecular biology.

[24]  Yanay Ofran,et al.  Structural Consensus among Antibodies Defines the Antigen Binding Site , 2012, PLoS Comput. Biol..

[25]  Janice M. Reichert,et al.  Antibodies to watch in 2017 , 2016, mAbs.

[26]  Bin Li,et al.  Fast protein tertiary structure retrieval based on global surface shape similarity , 2008, Proteins.

[27]  A. Lesk,et al.  Standard conformations for the canonical structures of immunoglobulins. , 1997, Journal of molecular biology.

[28]  Klaus Schulten,et al.  Computational de novo design of antibodies binding to a peptide with high affinity , 2017, Biotechnology and bioengineering.

[29]  Paolo Marcatili,et al.  Prediction of site-specific interactions in antibody-antigen complexes: the proABC method and server , 2013, Bioinform..

[30]  Andrew J. Martin,et al.  Antibody-antigen interactions: contact analysis and binding site topography. , 1996, Journal of molecular biology.

[31]  Alfonso Valencia,et al.  Progress and challenges in predicting protein-protein interaction sites , 2008, Briefings Bioinform..

[32]  T. T. Wu,et al.  AN ANALYSIS OF THE SEQUENCES OF THE VARIABLE REGIONS OF BENCE JONES PROTEINS AND MYELOMA LIGHT CHAINS AND THEIR IMPLICATIONS FOR ANTIBODY COMPLEMENTARITY , 1970, The Journal of experimental medicine.

[33]  Joan Wicks,et al.  Novel anti-Sialyl-Tn monoclonal antibodies and antibody-drug conjugates demonstrate tumor specificity and anti-tumor activity , 2017, mAbs.

[34]  Daisuke Kihara,et al.  3D-SURFER: software for high-throughput protein surface comparison and analysis , 2009, Bioinform..

[35]  Tso-Jung Yen,et al.  Discussion on "Stability Selection" by Meinshausen and Buhlmann , 2010 .

[36]  Pietro Liò,et al.  Parapred: antibody paratope prediction using convolutional and recurrent neural networks , 2018, Bioinform..

[37]  Xiaolian Gao,et al.  Effective Optimization of Antibody Affinity by Phage Display Integrated with High-Throughput DNA Synthesis and Sequencing Technologies , 2015, PloS one.

[38]  Olan Dolezal,et al.  i-bodies, Human Single Domain Antibodies That Antagonize Chemokine Receptor CXCR4* , 2016, The Journal of Biological Chemistry.

[39]  Carlo Ferrari,et al.  Computing Discrete Fine-Grained Representations of Protein Surfaces , 2015, CIBB.

[40]  N. Meinshausen,et al.  Stability selection , 2008, 0809.2932.

[41]  C. Deane,et al.  Antibody i-Patch prediction of the antibody binding site improves rigid local antibody-antigen docking. , 2013, Protein engineering, design & selection : PEDS.

[42]  Oberdan Leo,et al.  Key concepts in immunology. , 2010, Vaccine.

[43]  Jiye Shi,et al.  ABodyBuilder: Automated antibody structure prediction with data–driven accuracy estimation , 2016, mAbs.

[44]  Kouhei Tsumoto,et al.  Affinity Improvement of a Therapeutic Antibody by Structure-Based Computational Design: Generation of Electrostatic Interactions in the Transition State Stabilizes the Antibody-Antigen Complex , 2014, PloS one.

[45]  Haruki Nakamura,et al.  Computer-aided antibody design , 2012, Protein engineering, design & selection : PEDS.

[46]  Mathieu Rouard,et al.  IMGT unique numbering for immunoglobulin and T cell receptor constant domains and Ig superfamily C-like domains. , 2005, Developmental and comparative immunology.

[47]  P. Trail,et al.  Antibody drug conjugates for treatment of breast cancer: Novel targets and diverse approaches in ADC design. , 2018, Pharmacology & therapeutics.