Engineering and analysis of peptide-recognition domain specificities by phage display and deep sequencing.

Protein interaction networks depend in part on the specific recognition of unstructured peptides by folded domains. Understanding how members of a domain family use a similar fold to recognize different peptide sequences selectively is a fundamental question. One way to advance our understanding of peptide recognition is to apply an existing model of peptide recognition for a particular domain toward engineering synthetic domain variants with desired properties. Successes, failures, and unintended outcomes can help refine the model and can illuminate more general principles of peptide recognition. Using the PDZ domain fold as an example, we describe methods for (1) structure-based combinatorial library design and directed evolution of domain variants and (2) specificity profiling of large repertoires of synthetic variants using multiplexed deep sequencing. Peptide-binding preferences for hundreds of variants can be decoded in parallel, enabling comparisons between different library designs and selection pressures. The tremendous depth of coverage of the binding peptide profiles also permits robust computational analysis. This approach to studying peptide recognition can be applied to other domains and to a variety of structural and functional models by tailoring the combinatorial library design and selection pressures accordingly.

[1]  L. Cantley,et al.  Recognition of Unique Carboxyl-Terminal Motifs by Distinct PDZ Domains , 1997, Science.

[2]  Richard Durbin,et al.  A large genome center's improvements to the Illumina sequencing system , 2008, Nature Methods.

[3]  Comprehensive mutational analysis of the M13 major coat protein: improved scaffolds for C-terminal phage display. , 2004, Journal of molecular biology.

[4]  M. Teresa Pisabarro,et al.  Analysis of PDZ Domain-Ligand Interactions Using Carboxyl-terminal Phage Display* , 2000, The Journal of Biological Chemistry.

[5]  Robert P. St.Onge,et al.  Highly-multiplexed barcode sequencing: an efficient method for parallel analysis of pooled samples , 2010, Nucleic acids research.

[6]  Gary D Bader,et al.  Rapid Evolution of Functional Complexity in a Domain Family , 2009, Science Signaling.

[7]  Sachdev S Sidhu,et al.  Origins of PDZ Domain Ligand Specificity , 2003, The Journal of Biological Chemistry.

[8]  Thomas A. Kunkel,et al.  Rapid and efficient site-specific mutagenesis without phenotypic selection. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[9]  T. D. Schneider,et al.  Sequence logos: a new way to display consensus sequences. , 1990, Nucleic acids research.

[10]  Gary D. Bader,et al.  MUSI: an integrated system for identifying multiple specificity from very large peptide or nucleic acid data sets , 2011, Nucleic acids research.

[11]  S. Sidhu,et al.  Phage display for selection of novel binding peptides. , 2000, Methods in enzymology.

[12]  T. Pawson,et al.  Assembly of Cell Regulatory Systems Through Protein Interaction Domains , 2003, Science.

[13]  D. Summers,et al.  A method for generating sticky-end PCR products which facilitates unidirectional cloning and the one-step assembly of complex DNA constructs. , 2008, Plasmid.

[14]  F. Studier,et al.  Protein production by auto-induction in high density shaking cultures. , 2005, Protein expression and purification.

[15]  Raffi Tonikian,et al.  Identifying specificity profiles for peptide recognition modules from phage-displayed peptide libraries , 2007, Nature Protocols.

[16]  Anton Nekrutenko,et al.  Manipulation of FASTQ data with Galaxy , 2010, Bioinform..

[17]  Gary D. Bader,et al.  Coevolution of PDZ domain-ligand interactions analyzed by high-throughput phage display and deep sequencing. , 2010, Molecular bioSystems.

[18]  Chris Sander,et al.  A Specificity Map for the PDZ Domain Family , 2008, PLoS biology.