ABodyBuilder: Automated antibody structure prediction with data–driven accuracy estimation

ABSTRACT Computational modeling of antibody structures plays a critical role in therapeutic antibody design. Several antibody modeling pipelines exist, but no freely available methods currently model nanobodies, provide estimates of expected model accuracy, or highlight potential issues with the antibody's experimental development. Here, we describe our automated antibody modeling pipeline, ABodyBuilder, designed to overcome these issues. The algorithm itself follows the standard 4 steps of template selection, orientation prediction, complementarity-determining region (CDR) loop modeling, and side chain prediction. ABodyBuilder then annotates the ‘confidence’ of the model as a probability that a component of the antibody (e.g., CDRL3 loop) will be modeled within a root–mean square deviation threshold. It also flags structural motifs on the model that are known to cause issues during in vitro development. ABodyBuilder was tested on 4 separate datasets, including the 11 antibodies from the Antibody Modeling Assessment–II competition. ABodyBuilder builds models that are of similar quality to other methodologies, with sub–Angstrom predictions for the ‘canonical’ CDR loops. Its ability to model nanobodies, and rapidly generate models (∼30 seconds per model) widens its potential usage. ABodyBuilder can also help users in decision–making for the development of novel antibodies because it provides model confidence and potential sequence liabilities. ABodyBuilder is freely available at http://opig.stats.ox.ac.uk/webapps/abodybuilder.

[1]  Haruki Nakamura,et al.  Kotai Antibody Builder: automated high-resolution structural modeling of antibodies , 2014, Bioinform..

[2]  Ylva Gavel,et al.  Sequence differences between glycosylated and non-glycosylated Asn-X-Thr/Ser acceptor sites: implications for protein engineering , 1990, Protein engineering.

[3]  Charlotte M. Deane,et al.  ANARCI: antigen receptor numbering and receptor classification , 2015, Bioinform..

[4]  Lisa Yan,et al.  Automated antibody structure prediction using Accelrys tools: Results and best practices , 2014, Proteins.

[5]  Yoonjoo Choi,et al.  FREAD revisited: Accurate loop structure prediction using a database search algorithm , 2010, Proteins.

[6]  V. Giudicelli,et al.  IMGT unique numbering for immunoglobulin and T cell receptor variable domains and Ig superfamily V-like domains. , 2003, Developmental and comparative immunology.

[7]  I. Benhar,et al.  Selection of antibodies from synthetic antibody libraries. , 2012, Archives of biochemistry and biophysics.

[8]  Apollon Papadimitriou,et al.  Structure-Based Prediction of Asparagine and Aspartate Degradation Sites in Antibody Variable Regions , 2014, PloS one.

[9]  Jeffrey J. Gray,et al.  Large-scale sequence and structural comparisons of human naive and antigen-experienced antibody repertoires , 2016, Proceedings of the National Academy of Sciences.

[10]  S. Quake,et al.  The promise and challenge of high-throughput sequencing of the antibody repertoire , 2014, Nature Biotechnology.

[11]  Haruki Nakamura,et al.  High‐resolution modeling of antibody structures by a combination of bioinformatics, expert knowledge, and molecular simulations , 2014, Proteins.

[12]  C. Deane,et al.  ABangle: characterising the VH-VL orientation in antibodies. , 2013, Protein engineering, design & selection : PEDS.

[13]  Inbal Sela-Culang,et al.  The Structural Basis of Antibody-Antigen Recognition , 2013, Front. Immunol..

[14]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[15]  C. Deane,et al.  CODA: A combined algorithm for predicting the structurally variable regions of protein models , 2001, Protein science : a publication of the Protein Society.

[16]  W. Robinson Sequencing the functional antibody repertoire—diagnostic and therapeutic discovery , 2015, Nature Reviews Rheumatology.

[17]  Alexey Teplyakov,et al.  Antibody modeling assessment II. Structures and models , 2014, Proteins.

[18]  T. T. Wu,et al.  AN ANALYSIS OF THE SEQUENCES OF THE VARIABLE REGIONS OF BENCE JONES PROTEINS AND MYELOMA LIGHT CHAINS AND THEIR IMPLICATIONS FOR ANTIBODY COMPLEMENTARITY , 1970, The Journal of experimental medicine.

[19]  Roland L. Dunbrack,et al.  A new clustering of antibody CDR loop conformations. , 2011, Journal of molecular biology.

[20]  Thomas B. Kepler,et al.  Affinity maturation in an HIV broadly neutralizing B-cell lineage through reorientation of variable domains , 2014, Proceedings of the National Academy of Sciences.

[21]  Marco Biasini,et al.  pv: v1.8.1 , 2015 .

[22]  Harald Kolmar,et al.  Single-domain antibodies for biomedical applications , 2016, Immunopharmacology and immunotoxicology.

[23]  Anna Tramontano,et al.  A database of immunoglobulins with integrated tools: DIGIT , 2011, Nucleic Acids Res..

[24]  Asher Mullard 2014 FDA drug approvals , 2015, Nature Reviews Drug Discovery.

[25]  Brian D. Weitzner,et al.  Blind prediction performance of RosettaAntibody 3.0: Grafting, relaxation, kinematic loop modeling, and full CDR optimization , 2014, Proteins.

[26]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[27]  Haruki Nakamura,et al.  Computer-aided antibody design , 2012, Protein engineering, design & selection : PEDS.

[28]  Anna Tramontano,et al.  Antibody structural modeling with prediction of immunoglobulin structure (PIGS) , 2014 .

[29]  A. Bujotzek,et al.  MoFvAb: Modeling the Fv region of antibodies , 2015, mAbs.

[30]  Daniel Seeliger,et al.  Boosting antibody developability through rational sequence optimization , 2015, mAbs.

[31]  James Dunbar,et al.  Prediction of VH–VL domain orientation for antibody variable domain modeling , 2015, Proteins.

[32]  Roland L. Dunbrack,et al.  proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improved prediction of protein side-chain conformations with SCWRL4 , 2022 .

[33]  Manuel Berrondo,et al.  Automated Aufbau of antibody structures from given sequences using Macromoltek's SmrtMolAntibody , 2014, Proteins.

[34]  Li Yang,et al.  Structural characterization of N-linked oligosaccharides on monoclonal antibody cetuximab by the combination of orthogonal matrix-assisted laser desorption/ionization hybrid quadrupole-quadrupole time-of-flight tandem mass spectrometry and sequential enzymatic digestion. , 2007, Analytical biochemistry.

[35]  Richard Friesner,et al.  Antibody structure determination using a combination of homology modeling, energy‐based refinement, and loop prediction , 2014, Proteins.

[36]  P. Labute,et al.  Antibody modeling assessment , 2011, Proteins.

[37]  Jiye Shi,et al.  SAbDab: the structural antibody database , 2013, Nucleic Acids Res..

[38]  Paolo Marcatili,et al.  PIGS: automatic prediction of antibody structures , 2008, Bioinform..

[39]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[40]  A. Lesk,et al.  Canonical structures for the hypervariable regions of immunoglobulins. , 1987, Journal of molecular biology.

[41]  Charlotte M Deane,et al.  Predicting antibody complementarity determining region structures without classification. , 2011, Molecular bioSystems.

[42]  Jeffrey J. Gray,et al.  Toward high‐resolution homology modeling of antibody Fv regions and application to antibody–antigen docking , 2009, Proteins.

[43]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[44]  Guy Georges,et al.  VH-VL orientation prediction for antibody humanization candidate selection: A case study , 2016, mAbs.

[45]  George Georgiou,et al.  In-depth determination and analysis of the human paired heavy- and light-chain antibody repertoire , 2014, Nature Medicine.

[46]  H. Kettenberger,et al.  Developability assessment during the selection of novel therapeutic antibodies. , 2015, Journal of pharmaceutical sciences.

[47]  Juan C Almagro,et al.  Second antibody modeling assessment (AMA‐II) , 2014, Proteins.