Clonify: unseeded antibody lineage assignment from next-generation sequencing data

Defining the dynamics and maturation processes of antibody clonal lineages is crucial to understanding the humoral response to infection and immunization. Although individual antibody lineages have been previously analyzed in isolation, these studies provide only a narrow view of the total antibody response. Comprehensive study of antibody lineages has been limited by the lack of an accurate clonal lineage assignment algorithm capable of operating on next-generation sequencing datasets. To address this shortcoming, we developed Clonify, which is able to perform unseeded lineage assignment on very large sets of antibody sequences. Application of Clonify to IgG+ memory repertoires from healthy individuals revealed a surprising lack of influence of large extended lineages on the overall repertoire composition, indicating that this composition is driven less by the order and frequency of pathogen encounters than previously thought. Clonify is freely available at www.github.com/briney/clonify-python.

[1]  Chaim A. Schramm,et al.  Developmental pathway for potent V1V2-directed HIV-neutralizing antibodies , 2014, Nature.

[2]  Ning Ma,et al.  IgBLAST: an immunoglobulin variable domain sequence analysis tool , 2013, Nucleic Acids Res..

[3]  Tongqing Zhou,et al.  Structure and immune recognition of trimeric prefusion HIV-1 Env , 2014, Nature.

[4]  Tongqing Zhou,et al.  PGV04, an HIV-1 gp120 CD4 Binding Site Antibody, Is Broad and Potent in Neutralization but Does Not Induce Conformational Changes Characteristic of CD4 , 2012, Journal of Virology.

[5]  Daphne Koller,et al.  The Effects of Somatic Hypermutation on Neutralization and Binding in the PGT121 Family of Broadly Neutralizing HIV Antibodies , 2013, PLoS pathogens.

[6]  W. Koff,et al.  Toward a Human Vaccines Project , 2014, Nature Immunology.

[7]  C. Nusbaum,et al.  High-Resolution Description of Antibody Heavy-Chain Repertoires in Humans , 2011, PloS one.

[8]  Pham Phung,et al.  Broad neutralization coverage of HIV by multiple highly potent antibodies , 2011, Nature.

[9]  J. Mullikin,et al.  Somatic Populations of PGT135–137 HIV-1-Neutralizing Antibodies Identified by 454 Pyrosequencing and Bioinformatics , 2012, Front. Microbio..

[10]  Mario Roederer,et al.  Rational Design of Envelope Identifies Broadly Neutralizing Human Monoclonal Antibodies to HIV-1 , 2010, Science.

[11]  S. Quake,et al.  The promise and challenge of high-throughput sequencing of the antibody repertoire , 2014, Nature Biotechnology.

[12]  Ron Diskin,et al.  Sequence and Structural Convergence of Broad and Potent HIV Antibodies That Mimic CD4 Binding , 2011, Science.

[13]  V. Pascual,et al.  Amino acid insertions and deletions contribute to diversify the human Ig repertoire , 1998, Immunological reviews.

[14]  Cassandra B. Jabara,et al.  Accurate sampling and deep sequencing of the HIV-1 protease gene using a Primer ID , 2011, Proceedings of the National Academy of Sciences.

[15]  Tongqing Zhou,et al.  De novo identification of VRC01 class HIV-1–neutralizing antibodies by next-generation sequencing of B-cell transcripts , 2013, Proceedings of the National Academy of Sciences.

[16]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[17]  Baoshan Zhang,et al.  Mining the antibodyome for HIV-1–neutralizing antibodies with next-generation sequencing and phylogenetic pairing of heavy/light chains , 2013, Proceedings of the National Academy of Sciences.

[18]  B A McKinney,et al.  High-throughput antibody sequencing reveals genetic evidence of global regulation of the naïve and memory repertoires that extends across individuals , 2012, Genes and Immunity.

[19]  James E Crowe,et al.  Impact of new sequencing technologies on studies of the human B cell repertoire. , 2013, Current opinion in immunology.

[20]  D. Sheward,et al.  Degenerate Primer IDs and the Birthday Problem , 2012, Proceedings of the National Academy of Sciences.

[21]  Dennis R. Burton,et al.  Toward a more accurate view of human B-cell repertoire by next-generation sequencing, unbiased repertoire capture and single-molecule barcoding , 2014, Scientific Reports.

[22]  J. Crowe,et al.  Evidence for preferential Ig gene usage and differential TdT and exonuclease activities in human naïve and memory B cells. , 2007, Molecular immunology.

[23]  Wayne C Koff,et al.  Broadly neutralizing HIV antibodies define a glycan-dependent epitope on the prefusion conformation of gp41 on cleaved envelope trimers. , 2014, Immunity.

[24]  J. Crowe,et al.  Immunodominance of the VH1–46 Antibody Gene Segment in the Primary Repertoire of Human Rotavirus-Specific B Cells Is Reduced in the Memory Compartment through Somatic Mutation of Nondominant Clones1 , 2008, The Journal of Immunology.

[25]  Baoshan Zhang,et al.  Broad and potent neutralization of HIV-1 by a gp41-specific human antibody , 2012, Nature.

[26]  D. Koller,et al.  High-resolution antibody dynamics of vaccine-induced immune responses , 2014, Proceedings of the National Academy of Sciences.

[27]  James E. Crowe,et al.  Location and length distribution of somatic hypermutation-associated DNA insertions and deletions reveals regions of antibody structural plasticity , 2012, Genes and Immunity.

[28]  Philip R. Johnson,et al.  Accelerating Next-Generation Vaccine Development for Global Disease Prevention , 2013, Science.

[29]  Chaim A. Schramm,et al.  Co-evolution of a broadly neutralizing HIV-1 antibody and founder virus , 2013, Nature.

[30]  Mario Roederer,et al.  Focused Evolution of HIV-1 Neutralizing Antibodies Revealed by Structures and Deep Sequencing , 2011, Science.

[31]  Young Do Kwon,et al.  Multidonor analysis reveals structural elements, genetic determinants, and maturation pathway for HIV-1 neutralization by VRC01-class antibodies. , 2013, Immunity.

[32]  Mark M. Davis,et al.  Human responses to influenza vaccination show seroconversion signatures and convergent antibody rearrangements. , 2014, Cell host & microbe.

[33]  William R. Schief,et al.  Promiscuous Glycan Site Recognition by Antibodies to the High-Mannose Patch of gp120 Broadens Neutralization of HIV , 2014, Science Translational Medicine.

[34]  John R Mascola,et al.  Antibody responses to envelope glycoproteins in HIV-1 infection , 2015, Nature Immunology.

[35]  Mark M. Davis,et al.  Lineage Structure of the Human Antibody Repertoire in Response to Influenza Vaccination , 2013, Science Translational Medicine.

[36]  Daniel G. Brown,et al.  PANDAseq: paired-end assembler for illumina sequences , 2012, BMC Bioinformatics.

[37]  Pham Phung,et al.  Broad and Potent Neutralizing Antibodies from an African Donor Reveal a New HIV-1 Vaccine Target , 2009, Science.

[38]  Scott Boyd,et al.  Benchmarking the performance of human antibody gene alignment utilities using a 454 sequence dataset , 2010, Bioinform..

[39]  Elizabeth Ernestina Godoy-Lozano,et al.  Reconstructing and mining the B cell repertoire with ImmunediveRsity , 2015, mAbs.

[40]  John P. Moore,et al.  Recombinant HIV envelope trimer selects for quaternary-dependent antibodies targeting the trimer apex , 2014, Proceedings of the National Academy of Sciences.

[41]  Stephen R. Quake,et al.  Genetic measurement of memory B-cell recall using antibody repertoire sequencing , 2013, Proceedings of the National Academy of Sciences.

[42]  S. Zolla-Pazner,et al.  Preferential use of the VH5-51 gene segment by the human immune response to code for antibodies against the V3 domain of HIV-1. , 2009, Molecular immunology.