Detecting selection in immunoglobulin sequences

The ability to detect selection by analyzing mutation patterns in experimentally derived immunoglobulin (Ig) sequences is a critical part of many studies. Such techniques are useful not only for understanding the response to pathogens, but also to determine the role of antigen-driven selection in autoimmunity, B cell cancers and the diversification of pre-immune repertoires in certain species. Despite its importance, quantifying selection in experimentally derived sequences is fraught with difficulties. The necessary parameters for statistical tests (such as the expected frequency of replacement mutations in the absence of selection) are non-trivial to calculate, and results are not easily interpretable when analyzing more than a handful of sequences. We have developed a web server that implements our previously proposed Focused binomial test for detecting selection. Several features are integrated into the web site in order to facilitate analysis, including V(D)J germline segment identification with IMGT alignment, batch submission of sequences and integration of additional test statistics proposed by other groups. We also implement a Z-score-based statistic that increases the power of detecting selection while maintaining specificity, and further allows for the combined analysis of sequences from different germlines. The tool is freely available at http://clip.med.yale.edu/selection.

[1]  M. Whitlock Combining probability from independent tests: the weighted Z‐method is superior to Fisher's approach , 2005, Journal of evolutionary biology.

[2]  L. Wysocki,et al.  Di- and trinucleotide target preferences of somatic mutagenesis in normal and autoreactive B cells. , 1996, Journal of immunology.

[3]  Patrick C. Wilson,et al.  Rapid cloning of high-affinity human monoclonal antibodies against influenza virus , 2008, Nature.

[4]  L. Hedges,et al.  Statistical Methods for Meta-Analysis , 1987 .

[5]  Uri Hershberg,et al.  Improved methods for detecting selection by mutation analysis of Ig V region sequences. , 2008, International immunology.

[6]  L. Staudt,et al.  Generation of antibody diversity in the immune response of BALB/c mice to influenza virus hemagglutinin. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Spencer,et al.  Strong intrinsic biases towards mutation and conservation of bases in human IgVH genes during somatic hypermutation prevent statistical analysis of antigen selection , 1998, Immunology.

[8]  R. Tibshirani,et al.  Molecular analysis of immunoglobulin genes in diffuse large B-cell lymphomas. , 2000, Blood.

[9]  Thomas B. Kepler,et al.  The Nucleotide-Replacement Spectrum Under Somatic Hypermutation Exhibits Microsequence Dependence That Is Strand-Symmetric and Distinct from That Under Germline Mutation1 , 2000, The Journal of Immunology.

[10]  Marie-Paule Lefranc,et al.  IMGT/V-QUEST: the highly customized and integrated system for IG and TR standardized V-J and V-D-J sequence analysis , 2008, Nucleic Acids Res..

[11]  Patrick Wilson,et al.  iHMMune-align: hidden Markov model-based alignment and identification of germline genes in rearranged immunoglobulin gene sequences , 2007, Bioinform..

[12]  D. Schatz,et al.  Two levels of protection for the B cell genome during somatic hypermutation , 2008, Nature.

[13]  D. Pisetsky,et al.  Structure and function of anti-DNA autoantibodies derived from a single autoimmune mouse. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[14]  L. Wysocki,et al.  Sequence-specific targeting of two bases on both DNA strands by the somatic hypermutation mechanism. , 2003, Molecular immunology.

[15]  L. Boursier,et al.  Mathematical analysis of antigen selection in somatically mutated immunoglobulin genes associated with autoimmunity , 2010, Lupus.

[16]  S. Sinha,et al.  Problems in using statistical analysis of replacement and silent mutations in antibody genes for determining antigen‐driven affinity selection , 2005, Immunology.

[17]  David G. Schatz,et al.  Targeting of somatic hypermutation , 2006, Nature Reviews Immunology.

[18]  Marie-Paule Lefranc,et al.  IMGT , the international ImMunoGeneTics information system , 2003 .

[19]  G. Glass Statistical Methods for Meta-Analysis.Larry V. Hedges , Ingram Olkin , 1986 .

[20]  P. Casali,et al.  The CDR1 sequences of a major proportion of human germline Ig VH genes are inherently susceptible to amino acid replacement. , 1994, Immunology today.

[21]  R. White,et al.  High-Throughput Sequencing of the Zebrafish Antibody Repertoire , 2009, Science.

[22]  Ida Retter,et al.  VBASE2, an integrative V gene database , 2004, Nucleic Acids Res..

[23]  Thomas B. Kepler,et al.  SoDA: implementation of a 3D alignment algorithm for inference of antigen receptor recombinations , 2006, Bioinform..

[24]  Steven H. Kleinstein,et al.  Estimating Hypermutation Rates from Clonal Tree Data 1 , 2003, The Journal of Immunology.