Network motifs are over-represented patterns within a network, and signify the fundamental building blocks of that network. The process of finding network motifs is closely related to the traditional subgraph isomorphism problem in computer science, which finds instances of a particular subgraph in a graph. This problem has been proven NP-complete, and thus even for relatively small subgraphs and graphs, today’s most efficient approaches require a large amount of computation time. Here we present parallel algorithms for network motif finding, including both query parallelization, where different subgraphs are searched for in parallel, and network parallelization, where the original network is partitioned into overlapping regions, and a single subgraph is searched for in parallel among the different regions. The network partitioning algorithm is a novel application of hierarchical clustering and divides the network into strongly connected components, while minimizing the overlap between them. Our results show query parallel network motif finding speeds up the computation time linearly with the number of workers, and allows us to systematically search for larger motifs than ever before. In addition, our novel network partitioning algorithms are effective at partitioning the network, which enables network parallelization to effectively reduce the time required to count instances of a particular subgraph.
[1]
Anil K. Jain,et al.
Data clustering: a review
,
1999,
CSUR.
[2]
Mihalis Yannakakis,et al.
The Complexity of Multiterminal Cuts
,
1994,
SIAM J. Comput..
[3]
Joshua A. Grochow,et al.
Network Motif Discovery Using Subgraph Enumeration and Symmetry-Breaking
,
2007,
RECOMB.
[4]
S. Shen-Orr,et al.
Network motifs: simple building blocks of complex networks.
,
2002,
Science.
[5]
B. D. Mckay,et al.
Practical graph isomorphism, Numerical mathematics and computing, Proc. 10th Manitoba Conf., Winnipeg/Manitoba 1980
,
1981
.
[6]
Mohit Singh,et al.
Approximating the k-multicut problem
,
2006,
SODA '06.
[7]
S. L. Wong,et al.
Motifs, themes and thematic maps of an integrated Saccharomyces cerevisiae interaction network
,
2005,
Journal of biology.