Prospective identification of parasitic sequences in phage display screens

Phage display empowered the development of proteins with new function and ligands for clinically relevant targets. In this report, we use next-generation sequencing to analyze phage-displayed libraries and uncover a strong bias induced by amplification preferences of phage in bacteria. This bias favors fast-growing sequences that collectively constitute <0.01% of the available diversity. Specifically, a library of 109 random 7-mer peptides (Ph.D.-7) includes a few thousand sequences that grow quickly (the ‘parasites’), which are the sequences that are typically identified in phage display screens published to date. A similar collapse was observed in other libraries. Using Illumina and Ion Torrent sequencing and multiple biological replicates of amplification of Ph.D.-7 library, we identified a focused population of 770 ‘parasites’. In all, 197 sequences from this population have been identified in literature reports that used Ph.D.-7 library. Many of these enriched sequences have confirmed function (e.g. target binding capacity). The bias in the literature, thus, can be viewed as a selection with two different selection pressures: (i) target-binding selection, and (ii) amplification-induced selection. Enrichment of parasitic sequences could be minimized if amplification bias is removed. Here, we demonstrate that emulsion amplification in libraries of ∼106 diverse clones prevents the biased selection of parasitic clones.

[1]  K. A. Noren,et al.  Construction of high-complexity combinatorial phage display peptide libraries. , 2001, Methods.

[2]  D. Marvin,et al.  Role of capsid structure and membrane protein processing in determining the size and copy number of peptides displayed on the major coat protein of filamentous bacteriophage. , 1996, Journal of molecular biology.

[3]  Benjamin Bolduc,et al.  A target-unrelated peptide in an M13 phage display library traced to an advantageous mutation in the gene II ribosome-binding site. , 2008, Analytical biochemistry.

[4]  W. Delano,et al.  Convergent solutions to binding at a protein-protein interface. , 2000, Science.

[5]  Ping Zhu,et al.  MimoDB 2.0: a mimotope database and beyond , 2011, Nucleic Acids Res..

[6]  S. Dübel,et al.  Mutations in the N-Terminus of the Major Coat Protein (pVIII, gp8) of Filamentous Bacteriophage Affect Infectivity , 2003, Journal of Molecular Microbiology and Biotechnology.

[7]  M. Robinson,et al.  A scaling normalization method for differential expression analysis of RNA-seq data , 2010, Genome Biology.

[8]  R. Barrett,et al.  Peptides on phage: a vast library of peptides for identifying ligands. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Lee Makowski,et al.  One from column A and two from column B: the benefits of phage display in molecular-recognition studies. , 2002, Current opinion in chemical biology.

[10]  H. Mihara,et al.  Dense surface functionalization using peptides that recognize differences in organized structures of self-assembling nanomaterials. , 2012, Molecular bioSystems.

[11]  Jesse J. Salk,et al.  Detection of ultra-rare mutations by next-generation sequencing , 2012, Proceedings of the National Academy of Sciences.

[12]  U. Sack,et al.  Human Monoclonal Rheumatoid Synovial B Lymphocyte Hybridoma with a New Disease-Related Specificity for Cartilage Oligomeric Matrix Protein1 , 2001, The Journal of Immunology.

[13]  Feng-Biao Guo,et al.  MimoDB: a New Repository for Mimotope Data Derived from Phage Display Technology , 2010, Molecules.

[14]  W. Dower,et al.  Membrane insertion defects caused by positive charges in the early mature region of protein pIII of filamentous phage fd can be corrected by prlA suppressors , 1994, Journal of bacteriology.

[15]  J. Scott,et al.  Searching for peptide ligands with an epitope library. , 1990, Science.

[16]  B. Finlay,et al.  Phage display: applications, innovations, and issues in phage and host biology. , 1998, Canadian journal of microbiology.

[17]  A. Folgori,et al.  A general strategy to identify mimotopes of pathological antigens using only random peptide libraries and human sera. , 1994, The EMBO journal.

[18]  J. Bernués,et al.  HMGB1 Interacts with Many Apparently Unrelated Proteins by Recognizing Short Amino Acid Sequences* , 2002, The Journal of Biological Chemistry.

[19]  Sindy K. Y. Tang,et al.  Diversity of Phage-Displayed Libraries of Peptides during Panning and Amplification , 2011, Molecules.

[20]  Jamie K. Scott,et al.  The nature of target-unrelated peptides recovered in the screening of phage-displayed random peptide libraries with antibodies. , 2005, Analytical biochemistry.

[21]  A. Theberge,et al.  Microdroplets in microfluidics: an evolving platform for discoveries in chemistry and biology. , 2010, Angewandte Chemie.

[22]  Timur Shtatland,et al.  PepBank - a database of peptides based on sequence text mining and public peptide data sources , 2007, BMC Bioinformatics.

[23]  Andreas Plückthun,et al.  Signal sequences directing cotranslational translocation expand the range of proteins amenable to phage display , 2006, Nature Biotechnology.

[24]  V. Petrenko,et al.  Diversity and censoring of landscape phage libraries. , 2009, Protein engineering, design & selection : PEDS.

[25]  M. Stephens,et al.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. , 2008, Genome research.

[26]  Lei Wang,et al.  Capillary-composited microfluidic device for heat shock transformation of Escherichia coli. , 2011, Journal of bioscience and bioengineering.

[27]  Ratmir Derda,et al.  High-throughput discovery of synthetic surfaces that support proliferation of pluripotent cells. , 2010, Journal of the American Chemical Society.

[28]  Cristian S. Calude,et al.  Proceedings of the Workshop on Multiset Processing: Multiset Processing, Mathematical, Computer Science, and Molecular Computing Points of View , 2000 .

[29]  Ratmir Derda,et al.  Deep sequencing analysis of phage libraries using Illumina platform. , 2012, Methods.

[30]  Sindy K. Y. Tang,et al.  Uniform amplification of phage display libraries in monodisperse emulsions. , 2012, Methods.

[31]  Lei Wang,et al.  Heat-shock transformation of Escherichia coli in nanolitre droplets formed in a capillary-composited microfluidic device , 2011 .

[32]  Lee Makowski,et al.  Quantitative assessment of peptide sequence diversity in M13 combinatorial peptide phage display libraries. , 2002, Journal of molecular biology.

[33]  A. Fierabracci Unravelling autoimmune pathogenesis by screening random peptide libraries with human sera. , 2009, Immunology letters.

[34]  W. Thomas,et al.  Corruption of phage display libraries by target-unrelated clones: diagnosis and countermeasures. , 2010, Analytical biochemistry.

[35]  R. Hosse,et al.  In vitro display technologies reveal novel biopharmaceutics , 2006, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[36]  Wei Li,et al.  Multiple modular microfluidic (M3) reactors for the synthesis of polymer particles. , 2009, Lab on a chip.

[37]  Margaret C. Linak,et al.  Sequence-specific error profile of Illumina sequencers , 2011, Nucleic acids research.

[38]  Sindy K. Y. Tang,et al.  Uniform amplification of phage with different growth characteristics in individual compartments consisting of monodisperse droplets. , 2010, Angewandte Chemie.

[39]  E. Seidemann,et al.  Probability model for molecular recognition in biological receptor repertoires: significance to the olfactory system. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Kim-Anh Do,et al.  Steps toward mapping the human vasculature by phage display , 2002, Nature Medicine.

[41]  A. Meola,et al.  Selection of antigenic and immunogenic mimics of hepatitis C virus using sera from patients. , 1996, Journal of immunology.

[42]  Tal Pupko,et al.  Deep Panning: Steps towards Probing the IgOme , 2012, PloS one.

[43]  M. Robinson,et al.  Small-sample estimation of negative binomial dispersion, with applications to SAGE data. , 2007, Biostatistics.

[44]  Lee Makowski,et al.  Estimating the diversity of peptide populations from limited sequence data , 2003, Bioinform..

[45]  L. Gold,et al.  Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase. , 1990, Science.

[46]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[47]  Doris Chen,et al.  Monitoring Genomic Sequences during SELEX Using High-Throughput Sequencing: Neutral SELEX , 2010, PloS one.

[48]  Selection of peptides that target the aminoacyl-tRNA site of bacterial 16S ribosomal RNA. , 2009, Biochemistry.

[49]  R R Breaker,et al.  Emergence of a replicating species from an in vitro RNA evolution reaction. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[50]  J. Szostak,et al.  In vitro selection of RNA molecules that bind specific ligands , 1990, Nature.

[51]  Johan T den Dunnen,et al.  Phage display screening without repetitious selection rounds. , 2012, Analytical biochemistry.

[52]  J. Reichert,et al.  Development trends for human monoclonal antibody therapeutics , 2010, Nature Reviews Drug Discovery.

[53]  Amy E. Keating Methods in protein design , 2013 .

[54]  L. Work,et al.  Development of renal-targeted vectors through combined in vivo phage display and capsid engineering of adenoviral fibers from serotype 19p. , 2007, Molecular therapy : the journal of the American Society of Gene Therapy.

[55]  M. Nahm,et al.  Monoclonal Antibodies Specific for Neisseria meningitidis Group B Polysaccharide and Their Peptide Mimotopes , 2001, Infection and Immunity.

[56]  Yue Cui,et al.  Preferential binding of peptides to graphene edges and planes. , 2011, Journal of the American Chemical Society.

[57]  J. Devlin,et al.  Random peptide libraries: a source of specific protein binding molecules. , 1990, Science.

[58]  R. Perham,et al.  Factors limiting display of foreign peptides on the major coat protein of filamentous bacteriophage capsids and a potential role for leader peptidase , 1998, FEBS letters.

[59]  D. Klepacki,et al.  Selection of small peptides, inhibitors of translation. , 2009, Journal of molecular biology.

[60]  Dario Neri,et al.  20 years of DNA-encoded chemical libraries. , 2011, Chemical communications.

[61]  David R. Liu,et al.  Reaction discovery enabled by DNA-templated synthesis and in vitro selection , 2004, Nature.

[62]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[63]  Lee Makowski,et al.  RELIC – A bioinformatics server for combinatorial peptide analysis and identification of protein‐ligand interaction sites , 2004, Proteomics.

[64]  Piotr J. Balwierz,et al.  Methods for analyzing deep sequencing expression data: constructing the human and mouse promoterome with deepCAGE data , 2009, Genome Biology.

[65]  L. Farinelli,et al.  By-passing in vitro screening—next generation sequencing technologies applied to antibody display and in silico candidate selection , 2010, Nucleic acids research.

[66]  Mark D. Robinson,et al.  Moderated statistical tests for assessing differences in tag abundance , 2007, Bioinform..

[67]  Emmanuel Dias-Neto,et al.  Next-Generation Phage Display: Integrating and Comparing Available Molecular Tools to Enable Cost-Effective High-Throughput Analysis , 2009, PloS one.

[68]  G. Cesareni,et al.  Modifying filamentous phage capsid: limits in the size of the major capsid protein. , 1995, Journal of molecular biology.

[69]  Ali Torkamani,et al.  Phenotype-information-phenotype cycle for deconvolution of combinatorial antibody libraries selected against complex systems , 2011, Proceedings of the National Academy of Sciences.

[70]  Wadih Arap,et al.  Synchronous selection of homing peptides for multiple tissues by in vivo phage display , 2006, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[71]  Jamie K. Scott,et al.  Random-peptide libraries and antigen-fragment libraries for epitope mapping and the development of vaccines and diagnostics , 2001, Current Opinion in Chemical Biology.