Commonality despite exceptional diversity in the baseline human antibody repertoire

In principle, humans can produce an antibody response to any non-self-antigen molecule in the appropriate context. This flexibility is achieved by the presence of a large repertoire of naive antibodies, the diversity of which is expanded by somatic hypermutation following antigen exposure1. The diversity of the naive antibody repertoire in humans is estimated to be at least 1012 unique antibodies2. Because the number of peripheral blood B cells in a healthy adult human is on the order of 5 × 109, the circulating B cell population samples only a small fraction of this diversity. Full-scale analyses of human antibody repertoires have been prohibitively difficult, primarily owing to their massive size. The amount of information encoded by all of the rearranged antibody and T cell receptor genes in one person—the ‘genome’ of the adaptive immune system—exceeds the size of the human genome by more than four orders of magnitude. Furthermore, because much of the B lymphocyte population is localized in organs or tissues that cannot be comprehensively sampled from living subjects, human repertoire studies have focused on circulating B cells3. Here we examine the circulating B cell populations of ten human subjects and present what is, to our knowledge, the largest single collection of adaptive immune receptor sequences described to date, comprising almost 3 billion antibody heavy-chain sequences. This dataset enables genetic study of the baseline human antibody repertoire at an unprecedented depth and granularity, which reveals largely unique repertoires for each individual studied, a subpopulation of universally shared antibody clonotypes, and an exceptional overall diversity of the antibody repertoire.A genetic study of the baseline human antibody repertoire, based on the circulating B cell populations of ten subjects, reveals universally shared antibody clonotypes within repertoires that are largely unique to the individual.

[1]  H. S. Horn,et al.  Measurement of "Overlap" in Comparative Ecological Studies , 1966, The American Naturalist.

[2]  A. Chao Estimating the population size for capture-recapture data with unequal catchability. , 1987, Biometrics.

[3]  A. Meyerhans,et al.  DNA recombination during PCR. , 1990, Nucleic acids research.

[4]  K. Rajewsky Clonal selection and learning in the antibody system , 1996, Nature.

[5]  A Tramontano,et al.  Conformations of the third hypervariable region in the VH domain of immunoglobulins. , 1998, Journal of molecular biology.

[6]  Generation of Antibody Diversity , 2002 .

[7]  M Hummel,et al.  Design and standardization of PCR primers and protocols for detection of clonal immunoglobulin and T-cell receptor gene recombinations in suspect lymphoproliferations: Report of the BIOMED-2 Concerted Action BMH4-CT98-3936 , 2003, Leukemia.

[8]  J. Liese,et al.  Reference values for B cell subpopulations from infancy to adulthood , 2010, Clinical and experimental immunology.

[9]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[10]  Daniel G. Brown,et al.  PANDAseq: paired-end assembler for illumina sequences , 2012, BMC Bioinformatics.

[11]  C. Nusbaum,et al.  High-Resolution Description of Antibody Heavy-Chain Repertoires in Humans , 2011, PloS one.

[12]  Wen-Han Hwang,et al.  Estimating the Richness of a Population When the Maximum Number of Classes Is Fixed: A Nonparametric Solution to an Archaeological Problem , 2012, PloS one.

[13]  Brett A. McKinney,et al.  Tissue-Specific Expressed Antibody Variable Gene Repertoires , 2014, PloS one.

[14]  George Georgiou,et al.  In-depth determination and analysis of the human paired heavy- and light-chain antibody repertoire , 2014, Nature Medicine.

[15]  Jens Meiler,et al.  Improving Loop Modeling of the Antibody Complementarity-Determining Region 3 Using Knowledge-Based Restraints , 2016, PloS one.

[16]  Dennis R. Burton,et al.  Clonify: unseeded antibody lineage assignment from next-generation sequencing data , 2016, Scientific Reports.

[17]  Anne Chao,et al.  Nonparametric Estimation and Comparison of Species Richness , 2016 .

[18]  Joseph Kaplinsky,et al.  Robust estimates of overall immune-repertoire diversity from high-throughput measurements on samples , 2016, Nature Communications.

[19]  Scott D Boyd,et al.  Deep sequencing and human antibody repertoire analysis. , 2016, Current opinion in immunology.

[20]  Bryan Briney,et al.  Zika virus activates de novo and cross-reactive memory B cell responses in dengue-experienced donors , 2017, Science Immunology.

[21]  IGoR: a tool for high-throughput immune repertoire analysis , 2017, 1705.08246.

[22]  Quentin Marcou,et al.  High-throughput immune repertoire analysis with IGoR , 2017, Nature Communications.

[23]  Lynn Morris,et al.  Multi-Donor Longitudinal Antibody Repertoire Sequencing Reveals the Existence of Public Antibody Clonotypes in HIV-1 Infection , 2018, Cell host & microbe.

[24]  Dennis R. Burton,et al.  Massively scalable genetic analysis of antibody repertoires , 2018, bioRxiv.