A neural strategy for the inference of SH3 domain-peptide interaction specificity

BackgroundThe SH3 domain family is one of the most representative and widely studied cases of so-called Peptide Recognition Modules (PRM). The polyproline II motif PxxP that generally characterizes its ligands does not reflect the complex interaction spectrum of the over 1500 different SH3 domains, and the requirement of a more refined knowledge of their specificity implies the setting up of appropriate experimental and theoretical strategies. Due to the limitations of the current technology for peptide synthesis, several experimental high-throughput approaches have been devised to elucidate protein-protein interaction mechanisms. Such approaches can rely on and take advantage of computational techniques, such as regular expressions or position specific scoring matrices (PSSMs) to pre-process entire proteomes in the search for putative SH3 targets.In this regard, a reliable inference methodology to be used for reducing the sequence space of putative binding peptides represents a valuable support for molecular and cellular biologists.ResultsUsing as benchmark the peptide sequences obtained from in vitro binding experiments, we set up a neural network model that performs better than PSSM in the detection of SH3 domain interactors. In particular our model is more precise in its predictions, even if its performance can vary among different SH3 domains and is strongly dependent on the number of binding peptides in the benchmark.ConclusionWe show that a neural network can be more effective than standard methods in SH3 domain specificity detection. Neural classifiers identify general SH3 domain binders and domain-specific interactors from a PxxP peptide population, provided that there are a sufficient proportion of true positives in the training sets. This capability can also improve peptide selection for library definition in array experiments. Further advances can be achieved, including properly encoded domain sequences and structural information as input for a global neural network.

[1]  Gary D Bader,et al.  A Combined Experimental and Computational Strategy to Define Protein Interaction Networks for Peptide Recognition Modules , 2001, Science.

[2]  M. Helmer-Citterich,et al.  SH3-SPOT: an algorithm to predict preferred ligands to different members of the SH3 gene family. , 2000, Journal of molecular biology.

[3]  S. Henikoff,et al.  Embedding strategies for effective use of information from multiple sequence alignments , 1997, Protein science : a publication of the Protein Society.

[4]  A. Sparks,et al.  Distinct ligand preferences of Src homology 3 domains from Src, Yes, Abl, Cortactin, p53bp2, PLCgamma, Crk, and Grb2. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[5]  B. Mayer,et al.  SH3 domains: complexity in moderation. , 2001, Journal of cell science.

[6]  Andrea Musacchio,et al.  How SH3 domains recognize proline. , 2002, Advances in protein chemistry.

[7]  Benno Schwikowski,et al.  Predicting protein-peptide interactions via a network-based motif sampler , 2004, ISMB/ECCB.

[8]  Gianni Cesareni,et al.  Can we infer peptide recognition specificity mediated by SH3 domains? , 2002, FEBS letters.

[9]  L. Castagnoli,et al.  Protein Interaction Networks by Proteome Peptide Scanning , 2004, PLoS biology.

[10]  Jean-Loup Faulon,et al.  Predicting protein-protein interactions using signature products , 2005, Bioinform..

[11]  Marius Sudol,et al.  From Src Homology domains to other signaling modules: proposal of the `protein recognition code' , 1998, Oncogene.

[12]  Cathy H. Wu Artificial Neural Networks for Molecular Sequence Analysis , 1997, Comput. Chem..

[13]  S. Schreiber,et al.  Two binding orientations for peptides to the Src SH3 domain: development of a general model for SH3-ligand interactions. , 1995, Science.

[14]  M. Sudol,et al.  The importance of being proline: the interaction of proline‐rich motifs in signaling proteins with their cognate domains , 2000, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[15]  Gregory R. Grant,et al.  Bioinformatics - The Machine Learning Approach , 2000, Comput. Chem..

[16]  David A. Gough,et al.  Predicting protein-protein interactions from primary structure , 2001, Bioinform..

[17]  Wendell A. Lim,et al.  Structural determinants of peptide-binding orientation and of sequence specificity in SH3 domains , 1995, Nature.