A neural-network approach for biclustering of gene expression data based on the plaid model

Biclustering techniques, for simultaneous row-column clustering, are widely used in the analysis of the gene expression data. Many different biclustering techniques have been proposed, such as the iterative signature algorithm (ISA) (Bergmann et al., 2003), global biclustering (Wolf et al., 2006), evolutionary fuzzy biclustering (Mitra et al., 2007), etc. Among these techniques, the plaid model is often used for multivariate data analysis. However, difficulties exist because there are mixed binary and continuous variables in this model for which the traditionally used optimization algorithms suitable for continuous variables cannot be employed in the realization of the biclustering process. In this paper, a novel neural-network approach is proposed to tackle such a mixed binary and continuous optimization problem in the plaid model. Experiment results show that the accuracy of the biclustering can be significantly improved with the proposed algorithm.

[1]  G. Churchill,et al.  Experimental design for gene expression microarrays. , 2001, Biostatistics.

[2]  Hong Yan,et al.  HoughFeature, a novel method for assessing drug effects in three-color cDNA microarray experiments , 2007, BMC Bioinformatics.

[3]  Sven Bergmann,et al.  Iterative signature algorithm for the analysis of large-scale gene expression data. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Arlindo L. Oliveira,et al.  Biclustering algorithms for biological data analysis: a survey , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[5]  Thomas Hofmann,et al.  Global Biclustering of Microarray Data , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[6]  Hong Yan,et al.  A New Strategy of Geometrical Biclustering for Microarray Data Analysis , 2007, APBC.

[7]  E. Winzeler,et al.  Genomics, gene expression and DNA arrays , 2000, Nature.

[8]  L. Lazzeroni Plaid models for gene expression data , 2000 .

[9]  George M. Church,et al.  Biclustering of Expression Data , 2000, ISMB.

[10]  Sushmita Mitra,et al.  Evolutionary Fuzzy Biclustering of Gene Expression Data , 2007, RSKT.

[11]  Hong Yan,et al.  Biclustering gene expression data based on a high dimensional geometric method , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[12]  P. Brown,et al.  Exploring the metabolic and genetic control of gene expression on a genomic scale. , 1997, Science.