TRIQ: A Comprehensive Evaluation Measure for Triclustering Algorithms

Triclustering has shown to be a valuable tool for the analysis of microarray data since its appearance as an improvement of classical clustering and biclustering techniques. Triclustering relaxes the constraints for grouping and allows genes to be evaluated under a subset of experimental conditions and a subset of time points simultaneously. The authors previously presented a genetic algorithm, TriGen, that finds triclusters of gene expression dasta. They also defined three different fitness functions for TriGen: \(MSR_{3D}\), LSL and MSL. In order to asses the results obtained by application of TriGen, a validity measure needs to be defined. Therefore, we present TRIQ, a validity measure which combines information from three different sources: (1) correlation among genes, conditions and times, (2) graphic validation of the patterns extracted and (3) functional annotations for the genes extracted.

[1]  Jan Hauke,et al.  Comparison of Values of Pearson's and Spearman's Correlation Coefficients on the Same Sets of Data , 2011 .

[2]  Cristina Rubio-Escudero,et al.  LSL: A new measure to evaluate triclusters , 2014, 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[3]  K. Tan,et al.  Finding Time-Lagged 3D Clusters , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[4]  Zhoujun Li,et al.  Multi-objective evolutionary algorithm for mining 3D clusters in gene-sample-time microarray data , 2008, 2008 IEEE International Conference on Granular Computing.

[5]  Shuigeng Zhou,et al.  gTRICLUSTER: A More General and Effective 3D Clustering Algorithm for Gene-Sample-Time Microarray Data , 2006, BioDM.

[6]  Oscar Cordón,et al.  A Multiobjective Evolutionary Conceptual Clustering Methodology for Gene Annotation Within Structural Databases: A Case of Study on the Gene Ontology Database , 2008, IEEE Transactions on Evolutionary Computation.

[7]  Vincent S. Tseng,et al.  A novel method for mining temporally dependent association rules in three-dimensional microarray datasets , 2010, 2010 International Computer Symposium (ICS2010).

[8]  Anand Swaroop,et al.  A role for prenylated rab acceptor 1 in vertebrate photoreceptor development , 2012, BMC Neuroscience.

[9]  Mohammed J. Zaki,et al.  TRICLUSTER: an effective algorithm for mining coherent clusters in 3D microarray data , 2005, SIGMOD '05.

[10]  C. Spearman CORRELATION CALCULATED FROM FAULTY DATA , 1910 .

[11]  Cristina Rubio-Escudero,et al.  MSL: A Measure to Evaluate Three-dimensional Patterns in Gene Expression Data , 2015, Evolutionary bioinformatics online.

[12]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[13]  Zhen Hu,et al.  BMC Bioinformatics BioMed Central Methodology article CLEAN: CLustering Enrichment ANalysis , 2009 .

[14]  Rocío Romero-Záliz,et al.  Classification of Gene Expression Profiles: Comparison of K-means and Expectation Maximization Algorithms , 2008, 2008 Eighth International Conference on Hybrid Intelligent Systems.

[15]  Gerhard Nahler,et al.  Pearson Correlation Coefficient , 2020, Definitions.

[16]  Cristina Rubio-Escudero,et al.  Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure , 2014, TheScientificWorldJournal.

[17]  Jan Koster,et al.  OTX2 directly activates cell cycle genes and inhibits differentiation in medulloblastoma cells , 2012, International journal of cancer.

[18]  Martin Vingron,et al.  Ontologizer 2.0 - a multifunctional tool for GO term enrichment analysis and data exploration , 2008, Bioinform..

[19]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[20]  Zhen Hu,et al.  Algorithm for Discovering Low-Variance 3-Clusters from Real-Valued Datasets , 2010, 2010 IEEE International Conference on Data Mining.

[21]  José Cristóbal Riquelme Santos,et al.  TriGen: A genetic algorithm to mine triclusters in temporal gene expression data , 2014, Neurocomputing.

[22]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[23]  George M. Church,et al.  Biclustering of Expression Data , 2000, ISMB.