On the Approximability of the Minimum Weight $t$-partite Clique Problem

The Minimum Weight t-partite Clique Problem MWtCP is the problem of finding a t-clique with minimum weight in a complete edgeweighted t-partite graph. The motivation for studying this problem is its potential in modelling the problem of identifying sets of commonly existing putative co-regulated, co-expressed genes, called gene clusters. In this paper, we show that MWtCP is NP-hard, APX-hard in the general case. We also present a 2-approximation algorithm that runs in O(n) for the metric case and has 1+ 1 t -approximation performance guarantee for the ultrametric subclass of instances. We further show how relaxing or tightening the application of the metricity property affects the approximation ratio. Finally insights on the application MWtCP to gene cluster discovery are presented. Submitted: August 2018 Reviewed: January 2019 Revised: March 2019 Reviewed: July 2019 Revised: September 2019 Accepted: March 2020 Final: March 2020 Published: April 2020 Article type: Regular Paper Communicated by: F. Vandin E-mail addresses: gasolano@up.edu.ph (Geoffrey Solano) guillaume.blin@labri.fr (Guillaume Blin) mathieu.raffinot@labri.fr (Mathieu Raffinot) jhoiclemente@gmail.com (Jhoirene Clemente) jdlcaro@up.edu.ph (Jaime Caro) 172 Solano, Blin, Raffinot, Clemente & Caro Approximability of MWtCP

[1]  Pablo Moscato,et al.  Combinatorial and Algorithmic Issues for Microarray Analysis , 2007, Handbook of Approximation Algorithms and Metaheuristics.

[2]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[3]  Wayne J. Pullan,et al.  Approximating the maximum vertex/edge weighted clique using local search , 2008, J. Heuristics.

[4]  M. Yu. Khachai,et al.  2-Approximation algorithm for finding a clique with minimum weight of vertices and edges , 2014 .

[5]  Jack Minker,et al.  An Analysis of Some Graph Theoretical Cluster Techniques , 1970, JACM.

[6]  박경철,et al.  An Extended Formulation Approach to the Edge-weighted Maximal Clique Problem , 1995 .

[7]  Albert-László Barabási,et al.  Systems biology. Life's complexity pyramid. , 2002, Science.

[8]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[9]  Shuji Tsukiyama,et al.  A New Algorithm for Generating All the Maximal Independent Sets , 1977, SIAM J. Comput..

[10]  Fred W. Glover,et al.  Solving the maximum edge weight clique problem via unconstrained quadratic programming , 2007, Eur. J. Oper. Res..

[11]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[12]  Gang Yu,et al.  Approximation Algorithms for the k-Clique Covering Problem , 1996, SIAM J. Discret. Math..

[13]  Frank Harary,et al.  A Procedure for Clique Detection Using the Group Matrix , 1957 .

[14]  Sven Rahmann,et al.  Integer Linear Programming Techniques for Discovering Approximate Gene Clusters , 2007 .

[15]  R. Prim Shortest connection networks and some generalizations , 1957 .

[16]  Geoffrey A. Solano,et al.  InteGene: An Integer Linear Programming Tool for Discovering Approximate Gene Clusters , 2019, 2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA).

[17]  Mathieu Raffinot,et al.  A Clique Finding Algorithm for the Approximate Gene Cluster Discovery Problem , 2018 .

[18]  Geoffrey A. Solano,et al.  Cluster center genes as candidate biomarkers for the classification of Leukemia , 2014, IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications.

[19]  J. Jeffry Howbert,et al.  The Maximum Clique Problem , 2007 .

[20]  Geoffrey A. Solano,et al.  Clique-finding Tool for Detecting Approximate Gene Clusters , 2019, 2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA).

[21]  Luca Trevisan,et al.  Inapproximability of Combinatorial Optimization Problems , 2004, Electron. Colloquium Comput. Complex..

[22]  Mathieu Raffinot,et al.  The Algorithmic of Gene Teams , 2002, WABI.

[23]  M. Yu. Khachai,et al.  Efficient algorithms with performance guarantees for some problems of finding several cliques in a complete undirected weighted graph , 2015 .

[24]  Henry N. Adorna,et al.  PROBING THE HARDNESS OF THE APPROXIMATE GENE CLUSTER DISCOVERY PROBLEM (AGCDP) , 2014 .

[25]  U. Alon Biological Networks: The Tinkerer as an Engineer , 2003, Science.

[26]  D. Kumlander,et al.  A new exact algorithm for the maximum-weight clique problem based on a heuristic vertex-coloring and a backtrack search , 2022, International Journal of Global Operations Research.

[27]  Rainer Breitling,et al.  What is Systems Biology? , 2010, Front. Physiology.

[28]  Michael M. Sørensen,et al.  New facets and a branch-and-cut algorithm for the weighted clique problem , 2004, Eur. J. Oper. Res..

[29]  J. Håstad Clique is hard to approximate withinn1−ε , 1999 .

[30]  Jiping Liu,et al.  Approximation Algorithms for Some Graph Partitioning Problems , 2000, J. Graph Algorithms Appl..

[31]  Dannie Durand,et al.  The Incompatible Desiderata of Gene Cluster Properties , 2005, Comparative Genomics.

[32]  Jean Walrand,et al.  Maximal Cliques in Unit Disk Graphs: Polynomial Approximation , 2006 .