A Comparison Between Block CEM and Two-Way CEM Algorithms to Cluster a Contingency Table

When the data consists of a set of objects described by a set of variables, we have recently proposed a new mixture model which takes into account the block clustering problem on the both sets and have developed the block CEM algorithm. In this paper, we embed the block clustering problem of contingency table in the mixture approach. In using a Poisson model and adopting the classification maximum likelihood principle we perform an adapted version of block CEM. We evaluate its performance and compare it to a simple use of CEM applied on the both sets separately. We present detailed experimental results on simulated data and we show the interest of this new algorithm.

[1]  F. Marcotorchino,et al.  Block seriation problems: A unified approach. Reply to the problem of H. Garcia and J. M. Proth (Applied Stochastic Models and Data Analysis, 1, (1), 25–34 (1985)) , 1987 .

[2]  Mohamed Nadif,et al.  Fuzzy clustering to estimate the parameters of block mixture models , 2006, Soft Comput..

[3]  Gérard Govaert,et al.  Clustering with block mixture models , 2003, Pattern Recognit..

[4]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[5]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[6]  Gilbert Ritschard,et al.  Maximisation de l'association par regroupement de lignes ou de colonnes d'un tableau croisé , 2001 .

[7]  Inderjit S. Dhillon,et al.  Co-clustering documents and words using bipartite spectral graph partitioning , 2001, KDD '01.

[8]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[9]  Gérard Govaert La classification croisée , 1989, Monde des Util. Anal. Données.

[10]  G. Celeux,et al.  A Classification EM algorithm for clustering and two stochastic versions , 1992 .

[11]  Michael J. Symons,et al.  Clustering criteria and multivariate normal mixtures , 1981 .

[12]  Gérard Govaert,et al.  An EM algorithm for the block mixture model , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  George M. Church,et al.  Biclustering of Expression Data , 2000, ISMB.

[14]  Phipps Arabie,et al.  The bond energy algorithm revisited , 1990, IEEE Trans. Syst. Man Cybern..

[15]  D. Duffy,et al.  A permutation-based algorithm for block clustering , 1991 .