StreaM - A Stream-Based Algorithm for Counting Motifs in Dynamic Graphs

Determining the occurrence of motifs yields profound insight for many biological systems, like metabolic, protein-protein interaction, and protein structure networks. Meaningful spatial protein-structure motifs include enzyme active sites and ligand-binding sites which are essential for function, shape, and performance of an enzyme. Analyzing their dynamics over time leads to a better understanding of underlying properties and processes. In this work, we present StreaM, a stream-based algorithm for counting undirected 4-vertex motifs in dynamic graphs. We evaluate StreaM against the four predominant approaches from the current state of the art on generated and real-world datasets, a simulation of a highly dynamic enzyme. For this case, we show that StreaM is capable to capture essential molecular protein dynamics and thereby provides a powerful method for evaluating large molecular dynamics trajectories. Compared to related work, our approach achieves speedups of upi¾?to 2,i¾?300 times on real-world datasets.

[1]  B. Alder,et al.  Studies in Molecular Dynamics. I. General Method , 1959 .

[2]  Anna Walsh STUDIES IN MOLECULAR DYNAMICS , 1965 .

[3]  R. Jernigan,et al.  Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. , 1996, Journal of molecular biology.

[4]  J. Thornton,et al.  PROMOTIF—A program to identify and analyze structural motifs in proteins , 1996, Protein science : a publication of the Protein Society.

[5]  Prabhakar Raghavan,et al.  Computing on data streams , 1999, External Memory Algorithms.

[6]  Samarjit Chakraborty,et al.  Approximation Algorithms for 3-D Commom Substructure Identification in Drug and Protein Molecules , 1999, WADS.

[7]  G J Kleywegt,et al.  Recognition of spatial motifs in protein structures. , 1999, Journal of molecular biology.

[8]  R. Jernigan,et al.  Anisotropy of fluctuation dynamics of proteins with an elastic network model. , 2001, Biophysical journal.

[9]  U. Alon,et al.  Ordering Genes in a Flagella Pathway by Analysis of Expression Kinetics from Living Bacteria , 2001, Science.

[10]  K. Sneppen,et al.  Specificity and Stability in Topology of Protein Networks , 2002, Science.

[11]  Gert Vriend,et al.  Increasing the precision of comparative models with YASARA NOVA—a self‐parameterizing force field , 2002, Proteins.

[12]  S. Shen-Orr,et al.  Network motifs in the transcriptional regulation network of Escherichia coli , 2002, Nature Genetics.

[13]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[14]  Wei Zhang,et al.  A point‐charge force field for molecular mechanics simulations of proteins based on condensed‐phase quantum mechanical calculations , 2003, J. Comput. Chem..

[15]  Réka Albert,et al.  Conserved network motifs allow protein-protein interaction prediction , 2004, Bioinform..

[16]  Falk Schreiber,et al.  MAVisto: a tool for the exploration of network motifs , 2005, Bioinform..

[17]  Mong-Li Lee,et al.  NeMoFinder: dissecting genome-wide protein-protein interactions with meso-scale network motifs , 2006, KDD '06.

[18]  Sebastian Wernicke,et al.  Efficient Detection of Network Motifs , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[19]  G. Vriend,et al.  Fast empirical pKa prediction by Ewald summation. , 2006, Journal of molecular graphics & modelling.

[20]  Sebastian Wernicke,et al.  FANMOD: a tool for fast network motif detection , 2006, Bioinform..

[21]  Mong-Li Lee,et al.  Labeling network motifs in protein interactomes for protein function prediction , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[22]  Kwang-Hyun Cho,et al.  Coupled feedback loops form dynamic motifs of cellular networks. , 2008, Biophysical journal.

[23]  Noga Alon,et al.  Biomolecular network motif counting and discovery by color coding , 2008, ISMB.

[24]  Tijana Milenkoviæ,et al.  Uncovering Biological Network Function via Graphlet Degree Signatures , 2008, Cancer informatics.

[25]  Yuval Shavitt,et al.  Automatic Large Scale Generation of Internet PoP Level Maps , 2008, IEEE GLOBECOM 2008 - 2008 IEEE Global Telecommunications Conference.

[26]  Michael Schroeder,et al.  Unraveling Protein Networks with Power Graph Analysis , 2008, PLoS Comput. Biol..

[27]  David Hales,et al.  Motifs in evolving cooperative networks look like protein structure networks , 2008, Networks Heterog. Media.

[28]  Sahar Asadi,et al.  Kavosh: a new algorithm for finding network motifs , 2009, BMC Bioinformatics.

[29]  Martin Ester,et al.  Dense Graphlet Statistics of Protein Interaction and Random Networks , 2009, Pacific Symposium on Biocomputing.

[30]  Yuval Shavitt,et al.  Approximating the Number of Network Motifs , 2009, Internet Math..

[31]  Madhav V. Marathe,et al.  Subgraph Enumeration in Large Social Contact Networks Using Parallel Color Coding and Streaming , 2010, 2010 39th International Conference on Parallel Processing.

[32]  Yuval Shavitt,et al.  A Structural Approach for PoP Geo-Location , 2010, 2010 INFOCOM IEEE Conference on Computer Communications Workshops.

[33]  David A. Bader,et al.  Massive streaming data analytics: A case study with clustering coefficients , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW).

[34]  Thorsten Strufe,et al.  Leveraging Network Motifs for the Adaptation of Structured Peer-to-Peer-Networks , 2010, 2010 IEEE Global Telecommunications Conference GLOBECOM 2010.

[35]  Fernando M. A. Silva,et al.  g-tries: an efficient data structure for discovering network motifs , 2010, SAC '10.

[36]  Jari Saramäki,et al.  Temporal motifs in time-dependent networks , 2011, ArXiv.

[37]  Christian Biemann,et al.  Quantifying Semantics using Complex Network Analysis , 2012, COLING.

[38]  Tsai-Ching Lu,et al.  Temporal Motifs Reveal the Dynamics of Editor Interactions in Wikipedia , 2012, ICWSM.

[39]  Luis A. A. Meira,et al.  acc-Motif Detection Tool , 2012, 1203.3415.

[40]  Louxin Zhang,et al.  Counting motifs in the human interactome , 2013, Nature Communications.

[41]  Thorsten Strufe,et al.  Dynamic network analyzer building a framework for the graph-theoretic analysis of dynamic networks , 2013, SummerSim.

[42]  Simona E. Rombo,et al.  Searching for repetitions in biological networks: methods, resources and tools , 2015, Briefings Bioinform..