The origin of CDR H3 structural diversity.

Antibody complementarity determining region (CDR) H3 loops are critical for adaptive immunological functions. Although the other five CDR loops adopt predictable canonical structures, H3 conformations have proven unclassifiable, other than an unusual C-terminal "kink" present in most antibodies. To determine why the majority of H3 loops are kinked and to learn whether non-antibody proteins have loop structures similar to those of H3, we searched a set of 15,679 high-quality non-antibody structures for regions geometrically similar to the residues immediately surrounding the loop. By incorporating the kink into our search, we identified 1,030 H3-like loops from 632 protein families. Some protein families, including PDZ domains, appear to use the identified region for recognition and binding. Our results suggest that the kink is conserved in the immunoglobulin heavy chain fold because it disrupts the β-strand pairing at the base of the loop. Thus, the kink is a critical driver of the observed structural diversity in CDR H3.

[1]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[2]  S. Quake,et al.  The promise and challenge of high-throughput sequencing of the antibody repertoire , 2014, Nature Biotechnology.

[3]  E. Padlan,et al.  Antibody-antigen complexes. , 1988, Annual review of biochemistry.

[4]  Haruki Nakamura,et al.  H3‐rules: identification of CDR‐H3 structures in antibodies , 1999, FEBS letters.

[5]  Jie J. Zheng,et al.  ReviewPDZ domains and their binding partners : structure , specificity , and modification , 2010 .

[6]  E. Padlan,et al.  Anatomy of the antibody molecule. , 1994, Molecular immunology.

[7]  B. L. Sibanda,et al.  β-Hairpin families in globular proteins , 1985, Nature.

[8]  A Tramontano,et al.  Antibody structure, prediction and redesign. , 1997, Biophysical chemistry.

[9]  M. Levitt A simplified representation of protein conformations for rapid simulation of protein folding. , 1976, Journal of molecular biology.

[10]  C. Barbas,et al.  Semisynthetic combinatorial antibody libraries: a chemical solution to the diversity problem. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[11]  B. L. Sibanda,et al.  Conformation of beta-hairpins in protein structures. A systematic classification with applications to modelling by homology, electron density fitting and protein engineering. , 1989, Journal of molecular biology.

[12]  S. Tonegawa Somatic generation of antibody diversity , 1983, Nature.

[13]  George Georgiou,et al.  High-throughput sequencing of the paired human immunoglobulin heavy and light chain repertoire , 2013, Nature Biotechnology.

[14]  Qifang Xu,et al.  Assignment of protein sequences to existing domain and family classification systems: Pfam and the PDB , 2012, Bioinform..

[15]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[16]  Paolo Marcatili,et al.  PIGS: automatic prediction of antibody structures , 2008, Bioinform..

[17]  Simon J. Henderson,et al.  Monoclonal antibody therapeutics: history and future. , 2012, Current opinion in pharmacology.

[18]  A. Lesk,et al.  Canonical structures for the hypervariable regions of immunoglobulins. , 1987, Journal of molecular biology.

[19]  Jack Snoeyink,et al.  Scientific benchmarks for guiding macromolecular energy function improvement. , 2013, Methods in enzymology.

[20]  Frederic A. Fellouse,et al.  High-throughput generation of synthetic antibodies from highly functional minimalist phage-displayed libraries. , 2007, Journal of molecular biology.

[21]  M. Lascombe,et al.  Three-dimensional structure of antibodies. , 1988, Annual review of immunology.

[22]  A Tramontano,et al.  Antibody modeling: implications for engineering and design. , 2000, Methods.

[23]  Mingjie Zhang,et al.  The structure of the harmonin/sans complex reveals an unexpected interaction mode of the two Usher syndrome proteins , 2010, Proceedings of the National Academy of Sciences.

[24]  Sachdev S Sidhu,et al.  Synthetic therapeutic antibodies , 2006, Nature chemical biology.

[25]  Roland L. Dunbrack,et al.  The Role of Balanced Training and Testing Data Sets for Binary Classifiers in Bioinformatics , 2013, PloS one.

[26]  Y. Ofran,et al.  The indistinguishability of epitopes from protein surface is explained by the distinct binding preferences of each of the six antigen-binding loops. , 2013, Protein engineering, design & selection : PEDS.

[27]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[28]  P. Bohlen,et al.  Selection of high affinity human neutralizing antibodies to VEGFR2 from a large antibody phage display library for antiangiogenesis therapy , 2002, International journal of cancer.

[29]  P. Labute,et al.  Antibody modeling assessment , 2011, Proteins.

[30]  Juan C Almagro,et al.  Second antibody modeling assessment (AMA‐II) , 2014, Proteins.

[31]  M. Karplus,et al.  PDB-based protein loop prediction: parameters for selection and methods for optimization. , 1997, Journal of molecular biology.

[32]  A. Lesk,et al.  Conformations of immunoglobulin hypervariable regions , 1989, Nature.

[33]  Jeffrey J. Gray,et al.  Structure-based non-canonical amino acid design to covalently crosslink an antibody-antigen complex. , 2014, Journal of structural biology.

[34]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[35]  Yoonjoo Choi,et al.  FREAD revisited: Accurate loop structure prediction using a database search algorithm , 2010, Proteins.

[36]  S Suhai,et al.  Prediction of hypervariable CDR-H3 loop structures in antibodies. , 1995, Protein engineering.

[37]  Oleg V. Koliasnikov,et al.  Antibody Cdr H3 Modeling Rules: Extension for the Case of Absence of Arg H94 and Asp H101 , 2006, J. Bioinform. Comput. Biol..

[38]  M. Sternberg,et al.  Automated classification of antibody complementarity determining region 3 of the heavy chain (H3) loops into canonical forms and its application to protein structure prediction. , 1998, Journal of molecular biology.

[39]  A. Goede,et al.  Loops In Proteins (LIP)--a comprehensive loop database for homology modelling. , 2003, Protein engineering.

[40]  Shuai Cheng Li,et al.  LoopWeaver: Loop Modeling by the Weighted Scaling of Verified Proteins , 2013, J. Comput. Biol..

[41]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[42]  A. Lesk,et al.  Common features of the conformations of antigen‐binding loops in immunoglobulins and application to modeling loop conformations , 1992, Proteins.

[43]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[44]  Haruki Nakamura,et al.  High‐resolution modeling of antibody structures by a combination of bioinformatics, expert knowledge, and molecular simulations , 2014, Proteins.

[45]  A Tramontano,et al.  Conformations of the third hypervariable region in the VH domain of immunoglobulins. , 1998, Journal of molecular biology.

[46]  W. Lim,et al.  Unexpected modes of PDZ domain scaffolding revealed by structure of nNOS-syntrophin complex. , 1999, Science.

[47]  R. White,et al.  High-Throughput Sequencing of the Zebrafish Antibody Repertoire , 2009, Science.

[48]  A R Rees,et al.  WAM: an improved algorithm for modelling antibodies on the WEB. , 2000, Protein engineering.

[49]  Jens Meiler,et al.  ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. , 2011, Methods in enzymology.

[50]  Roland L. Dunbrack,et al.  A new clustering of antibody CDR loop conformations. , 2011, Journal of molecular biology.

[51]  Haruki Nakamura,et al.  Structural classification of CDR‐H3 revisited: A lesson in antibody modeling , 2008, Proteins.

[52]  Qifang Xu,et al.  The protein common interface database (ProtCID)—a comprehensive database of interactions of homologous proteins in multiple crystal forms , 2010, Nucleic Acids Res..

[53]  J. Engler,et al.  Expressed murine and human CDR-H3 intervals of equal length exhibit distinct repertoires that differ in their amino acid composition and predicted range of structures. , 2003, Journal of molecular biology.

[54]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[55]  Haruki Nakamura,et al.  Structural classification of CDR‐H3 in antibodies , 1996, FEBS letters.

[56]  A. Lesk,et al.  Standard conformations for the canonical structures of immunoglobulins. , 1997, Journal of molecular biology.

[57]  Guoli Wang,et al.  PISCES: recent improvements to a PDB sequence culling server , 2005, Nucleic Acids Res..

[58]  Jeffrey J. Gray,et al.  Toward high‐resolution homology modeling of antibody Fv regions and application to antibody–antigen docking , 2009, Proteins.

[59]  R. Sauer,et al.  Covalent Linkage of Distinct Substrate Degrons Controls Assembly and Disassembly of DegP Proteolytic Cages , 2011, Cell.

[60]  R. Lequin Enzyme immunoassay (EIA)/enzyme-linked immunosorbent assay (ELISA). , 2005, Clinical chemistry.

[61]  Brian D. Weitzner,et al.  Blind prediction performance of RosettaAntibody 3.0: Grafting, relaxation, kinematic loop modeling, and full CDR optimization , 2014, Proteins.

[62]  David Baker,et al.  Efficient sampling of protein conformational space using fast loop building and batch minimization on highly parallel computers , 2012, J. Comput. Chem..

[63]  P. T. Jones,et al.  Replacing the complementarity-determining regions in a human antibody with those from a mouse , 1986, Nature.