Discovering Interaction Motifs from Protein Interaction Networks

Recent breakthroughs in high throughput experiments to determine protein-protein interaction have generated a vast amount of protein interaction data. However, most of the experiments could only answer the question of whether two proteins interact but not the question on the mechanisms by which proteins interact. Such understanding is crucial for understanding the protein interaction of an organism as a whole (the interactome) and even predicting novel protein interactions. Protein interaction usually occurs at some specific sites on the proteins and, given their importance, they are usually well conserved throughout the evolution of the proteins of the same family. Based on this observation, a number of works on finding protein patterns/motifs conserved in interacting proteins have emerged in the last few years. Such motifs are collectively termed as the interaction motifs. This chapter provides a review on the different approaches on finding interaction motifs with a discussion on their implications, potentials and possible areas of improvements in the future.

[1]  See-Kiong Ng,et al.  Integrative Approach for Computationally Inferring Protein Domain Interactions , 2003, Bioinform..

[2]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[3]  See-Kiong Ng,et al.  A correlated motif approach for finding short linear motifs from protein interaction networks , 2006, BMC Bioinformatics.

[4]  E. Birney,et al.  Patterns of somatic mutation in human cancer genomes , 2007, Nature.

[5]  Thomas Hofmann,et al.  Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.

[6]  C. Sander,et al.  Database of homology‐derived protein structures and the structural meaning of sequence alignment , 1991, Proteins.

[7]  A. Valencia,et al.  Computational methods for the prediction of protein interactions. , 2002, Current opinion in structural biology.

[8]  P. Tomançak,et al.  Global Analysis of mRNA Localization Reveals a Prominent Role in Organizing Cellular Architecture and Function , 2007, Cell.

[9]  Leszek Rychlewski,et al.  ELM server: a new resource for investigating short functional sites in modular eukaryotic proteins , 2003, Nucleic Acids Res..

[10]  Sean R. Collins,et al.  Global landscape of protein complexes in the yeast Saccharomyces cerevisiae , 2006, Nature.

[11]  Marius Sudol,et al.  From Src Homology domains to other signaling modules: proposal of the `protein recognition code' , 1998, Oncogene.

[12]  Michael Schroeder,et al.  Using structural motif descriptors for sequence-based binding site prediction , 2007, BMC Bioinformatics.

[13]  P. Bork,et al.  Proteome survey reveals modularity of the yeast cell machinery , 2006, Nature.

[14]  Jinyan Li,et al.  Bioinformatics Original Paper Discovering Motif Pairs at Interaction Sites from Protein Sequences on a Proteome-wide Scale , 2022 .

[15]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[16]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[17]  S. Kornfeld,et al.  Gamma subunit of the AP-1 adaptor complex binds clathrin: implications for cooperative binding in coated vesicle assembly. , 2001, Molecular biology of the cell.

[18]  Lutz Riechmann,et al.  Early protein evolution: building domains from ligand-binding polypeptide segments. , 2006, Journal of molecular biology.

[19]  Tony Pawson,et al.  Defining the Specificity Space of the Human Src Homology 2 Domain*S , 2008, Molecular & Cellular Proteomics.

[20]  Aidong Zhang,et al.  Mining Protein Interactome Networks to Measure Interaction Reliability and Select Hub Proteins , 2010, Int. J. Knowl. Discov. Bioinform..

[21]  Douglas L. Brutlag,et al.  Discovering Empirically Conserved Amino Acid Substitution Groups in Databases of Protein Families , 1996, ISMB.

[22]  Henry C M Leung,et al.  Finding linear motif pairs from protein interaction networks: a probabilistic approach. , 2007, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[23]  Robert D. Finn,et al.  iPfam: visualization of protein?Cprotein interactions in PDB at domain and amino acid resolutions , 2005, Bioinform..

[24]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[25]  E. Sprinzak,et al.  Correlated sequence-signatures as markers of protein-protein interaction. , 2001, Journal of molecular biology.

[26]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2004, Nucleic Acids Res..

[27]  Jeremy Buhler,et al.  Finding Motifs Using Random Projections , 2002, J. Comput. Biol..

[28]  Hiroshi Mamitsuka Essential Latent Knowledge for Protein-Protein Interactions: Analysis by an Unsupervised Learning Approach , 2005, TCBB.

[29]  T. Pawson,et al.  SH2 domains recognize specific phosphopeptide sequences , 1993, Cell.

[30]  Victor Neduva,et al.  Peptides mediating interaction networks: new leads at last. , 2006, Current opinion in biotechnology.

[31]  T. Pawson,et al.  Signaling through scaffold, anchoring, and adaptor proteins. , 1997, Science.

[32]  Eric C. Rouchka,et al.  Gibbs Recursive Sampler: finding transcription factor binding sites , 2003, Nucleic Acids Res..

[33]  Xiaomei Wu,et al.  Genome-wide inference of protein interaction sites: lessons from the yeast high-quality negative protein–protein interaction dataset , 2008, Nucleic acids research.

[34]  Gary D Bader,et al.  Computational Prediction of Protein–Protein Interactions , 2008, Molecular biotechnology.

[35]  Shmuel Sattath,et al.  How reliable are experimental protein-protein interaction data? , 2003, Journal of molecular biology.

[36]  Xiaomei Wu,et al.  Prediction of yeast protein–protein interaction network: insights from the Gene Ontology and annotations , 2006, Nucleic acids research.

[37]  Christopher W. V. Hogue,et al.  Structure-Templated Predictions of Novel Protein Interactions from Sequence Information , 2007, PLoS Comput. Biol..

[38]  S. Jones,et al.  Principles of protein-protein interactions. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[39]  Albert Chan,et al.  PIPE: a protein-protein interaction prediction engine based on the re-occurring short polypeptide sequences between known interacting protein pairs , 2006, BMC Bioinformatics.

[40]  J. H. Shinn,et al.  Minimotif Miner: a tool for investigating protein function , 2006, Nature Methods.

[41]  B. Rost,et al.  Analysing six types of protein-protein interfaces. , 2003, Journal of molecular biology.

[42]  I. Donaldson,et al.  Automatic annotation of BIND molecular interactions from three-dimensional structures. , 2001, Biopolymers.

[43]  Jinyan Li,et al.  Discovery of stable and significant binding motif pairs from PDB complexes and protein interaction datasets , 2005, Bioinform..

[44]  D. Koller,et al.  InSite: a computational method for identifying protein-protein interaction binding sites on a proteome-wide scale , 2007, Genome Biology.