Delayed Comparison and Apriori Algorithm (DCAA): A Tool for Discovering Protein–Protein Interactions From Time-Series Phosphoproteomic Data

Analysis of high-throughput omics data is one of the most important approaches for obtaining information regarding interactions between proteins/genes. Time-series omics data are a series of omics data points indexed in time order and normally contain more abundant information about the interactions between biological macromolecules than static omics data. In addition, phosphorylation is a key posttranslational modification (PTM) that is indicative of possible protein function changes in cellular processes. Analysis of time-series phosphoproteomic data should provide more meaningful information about protein interactions. However, although many algorithms, databases, and websites have been developed to analyze omics data, the tools dedicated to discovering molecular interactions from time-series omics data, especially from time-series phosphoproteomic data, are still scarce. Moreover, most reported tools ignore the lag between functional alterations and the corresponding changes in protein synthesis/PTM and are highly dependent on previous knowledge, resulting in high false-positive rates and difficulties in finding newly discovered protein–protein interactions (PPIs). Therefore, in the present study, we developed a new method to discover protein–protein interactions with the delayed comparison and Apriori algorithm (DCAA) to address the aforementioned problems. DCAA is based on the idea that there is a lag between functional alterations and the corresponding changes in protein synthesis/PTM. The Apriori algorithm was used to mine association rules from the relationships between items in a dataset and find PPIs based on time-series phosphoproteomic data. The advantage of DCAA is that it does not rely on previous knowledge and the PPI database. The analysis of actual time-series phosphoproteomic data showed that more than 68% of the protein interactions/regulatory relationships predicted by DCAA were accurate. As an analytical tool for PPIs that does not rely on a priori knowledge, DCAA should be useful to predict PPIs from time-series omics data, and this approach is not limited to phosphoproteomic data.

[1]  Erh-Min Lai,et al.  Protein-Protein Interactions: Co-Immunoprecipitation. , 2017, Methods in molecular biology.

[2]  Stefano Teso,et al.  Improved multi-level protein–protein interaction prediction with semantic-based regularization , 2014, BMC Bioinformatics.

[3]  Adrian V. Lee,et al.  An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics , 2018, Cell.

[4]  Yang Zhang,et al.  COFACTOR: improved protein function prediction by combining structure, sequence and protein–protein interaction information , 2017, Nucleic Acids Res..

[5]  Vesteinn Thorsson,et al.  Abstract 3287: An integrated TCGA pan-cancer clinical data resource to drive high quality survival outcome analytics , 2018, Bioinformatics and Systems Biology.

[6]  Ioannis Xenarios,et al.  DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions , 2002, Nucleic Acids Res..

[7]  Peter Uetz,et al.  The yeast two-hybrid system: a tool for mapping protein-protein interactions. , 2015, Cold Spring Harbor protocols.

[8]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[9]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[10]  Burkhard Rost,et al.  Evolutionary profiles improve protein-protein interaction prediction from sequence , 2015, Bioinform..

[11]  Narayanaswamy Srinivasan,et al.  Nucleic Acids Research Advance Access published June 21, 2007 PIC: Protein Interactions Calculator , 2007 .

[12]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[13]  Min Li,et al.  Protein-protein interaction site prediction through combining local and global features with deep neural networks , 2019, Bioinform..

[14]  V. S. Rao,et al.  Protein-Protein Interaction Detection: Methods and Analysis , 2014, International journal of proteomics.

[15]  Baldomero Oliva,et al.  iLoops: a protein-protein interaction prediction server based on structural features , 2013, Bioinform..

[16]  Julie P M Viala,et al.  Protein-Protein Interaction: Tandem Affinity Purification in Bacteria. , 2017, Methods in molecular biology.

[17]  G Andrew Woolley,et al.  Yeast Two Hybrid Screening of Photo-Switchable Protein-Protein Interaction Libraries. , 2020, Journal of molecular biology.

[18]  Abdulaziz Yousef,et al.  A novel method based on new adaptive LVQ neural network for predicting protein-protein interactions from protein sequences. , 2013, Journal of theoretical biology.

[19]  P. Cohen,et al.  The regulation of protein function by multisite phosphorylation--a 25 year update. , 2000, Trends in biochemical sciences.

[20]  Y. Ivarsson,et al.  Interaction Analysis through Proteomic Phage Display , 2014, BioMed research international.

[21]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[22]  Witold Pedrycz,et al.  Protein complex identification through Markov clustering with firefly algorithm on dynamic protein-protein interaction networks , 2016, Inf. Sci..

[23]  B. Douzi,et al.  Protein-Protein Interactions: Surface Plasmon Resonance. , 2017, Methods in molecular biology.

[24]  Bo-Yu Huang,et al.  High-Throughput Screening of Sulfated Proteins by Using a Genome-Wide Proteome Microarray and Protein Tyrosine Sulfation System. , 2017, Analytical chemistry.

[25]  AgrawalRakesh,et al.  Mining association rules between sets of items in large databases , 1993 .

[26]  D. Kihara,et al.  Predicting permanent and transient protein–protein interfaces , 2013, Proteins.

[27]  Giuseppe Troiano,et al.  The crucial role of protein phosphorylation in cell signaling and its use as targeted therapy (Review) , 2017, International journal of molecular medicine.

[28]  Pascal Braun,et al.  History of protein–protein interactions: From egg‐white to complex networks , 2012, Proteomics.

[29]  Li-Jun Bi,et al.  The Ser/Thr Protein Kinase Protein-Protein Interaction Map of M. tuberculosis* , 2017, Molecular & Cellular Proteomics.

[30]  Luhua Lai,et al.  Sequence-based prediction of protein protein interaction using a deep-learning algorithm , 2017, BMC Bioinformatics.

[31]  Mingwei Liu,et al.  Proteomics identifies new therapeutic targets of early-stage hepatocellular carcinoma , 2019, Nature.