On the Relative Efficiency of Maximal Clique Enumeration Algorithms , with Application to High-Throughput Computational Biology

The efficient enumeration of maximal cliques has applications in microarray analysis and a number of other foundational problems of computational biology. In this paper, we analyze and test existing maximal clique enumeration algorithms for various classes of graphs. The classic branch and bound algorithm of Bron and Kerbosch proves to be relatively fast for sparse graphs, but slows considerably as edge density increases. Attempts to improve this algorithm are discussed. Experimental results demonstrate the difficulty of making improvements, especially when analyzing the overlap between cliques. Novel strategies for maximal clique enumeration algorithms are also described and placed in the context of ongoing research.

[1]  J. Moon,et al.  On cliques in graphs , 1965 .

[2]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[3]  W. Reznikoff,et al.  The lactose operon‐controlling elements: a complex paradigm , 1992, Molecular microbiology.

[4]  Panos M. Pardalos,et al.  The maximum clique problem , 1994, J. Glob. Optim..

[5]  Thomas Linke,et al.  Visualizing plant metabolomic correlation networks using clique-metabolite matrices , 2001, Bioinform..

[6]  P. Marin,et al.  Synaptic multiprotein complexes associated with 5‐HT2C receptors: a proteomic approach , 2002, The EMBO journal.

[7]  Expression and distribution of phocein and members of the striatin family in neurones of rat peripheral ganglia , 2003, Histochemistry and Cell Biology.

[8]  Small Maximal Independent Sets and Faster Exact Graph Coloring , 2003 .

[9]  Michael A. Langston,et al.  High performance computational tools for Motif discovery , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[10]  Zhigang Jin,et al.  Characterization and promoter analysis of the mouse nestin gene , 2004, FEBS letters.

[11]  Akira Tanaka,et al.  The Worst-Case Time Complexity for Generating All Maximal Cliques , 2004, COCOON.

[12]  Faisal N. Abu-Khzam,et al.  Scalable parallel algorithms for difficult combinatorial problems: A case study in optimization , 2004, Parallel and Distributed Computing and Networks.

[13]  Michael A. Langston,et al.  Computational, Integrative, and Comparative Methods for the Elucidation of Genetic Coexpression Networks , 2005, Journal of biomedicine & biotechnology.