IgSimulator: a versatile immunosequencing simulator

MOTIVATION The recent introduction of next-generation sequencing technologies to antibody studies have resulted in a growing number of immunoinformatics tools for antibody repertoire analysis. However, benchmarking these newly emerging tools remains problematic since the gold standard datasets that are needed to validate these tools are typically not available. RESULTS Since simulating antibody repertoires is often the only feasible way to benchmark new immunoinformatics tools, we developed the IgSimulator tool that addresses various complications in generating realistic antibody repertoires. IgSimulator's code has modular structure and can be easily adapted to new requirements to simulation. AVAILABILITY AND IMPLEMENTATION IgSimulator is open source and freely available as a C++ and Python program running on all Unix-compatible platforms. The source code is available from yana-safonova.github.io/ig_simulator. CONTACT safonova.yana@gmail.com SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Thierry Mora,et al.  Statistical inference of the generation probability of T-cell receptors from sequence repertoires , 2012, Proceedings of the National Academy of Sciences.

[2]  Sergey I. Nikolenko,et al.  SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing , 2012, J. Comput. Biol..

[3]  Klaus Rajewsky,et al.  Somatic hypermutation in normal and transformed human B cells , 1998, Immunological reviews.

[4]  Jordan R. Willis,et al.  Frequency and genetic characterization of V(DD)J recombinants in the human peripheral blood antibody repertoire , 2012, Immunology.

[5]  R. White,et al.  High-Throughput Sequencing of the Zebrafish Antibody Repertoire , 2009, Science.

[6]  Pavel A. Pevzner,et al.  Immunoglobulin Classification Using the Colored Antibody Graph , 2015, RECOMB.

[7]  Leping Li,et al.  ART: a next-generation sequencing read simulator , 2012, Bioinform..

[8]  N A Kolchanov,et al.  Somatic hypermutagenesis in immunoglobulin genes. II. Influence of neighbouring base sequences on mutagenesis. , 1992, Biochimica et biophysica acta.

[9]  M. Egholm,et al.  Measurement and Clinical Monitoring of Human Lymphocyte Clonality by Massively Parallel V-D-J Pyrosequencing , 2009, Science Translational Medicine.

[10]  Mikhail Shugay,et al.  Towards error-free profiling of immune repertoires , 2014, Nature Methods.

[11]  Marie-Paule Lefranc,et al.  IMGT/V-QUEST: the highly customized and integrated system for IG and TR standardized V-J and V-D-J sequence analysis , 2008, Nucleic Acids Res..

[12]  K. P. Murphy,et al.  Janeway's immunobiology , 2007 .