An application of kernel methods to gene cluster temporal meta-analysis

The application of various clustering techniques for large-scale gene-expression measurement experiments is a well-established method in bioinformatics. Clustering is also usually accompanied by functional characterization of gene sets by assessing statistical enrichments of structured vocabularies, such as the gene ontology (GO) [Gene Ontology Consortium. The gene ontology (GO) project in 2006. Nucleic Acids Research (Database issue), vol. 34; 2006. p. D322-6]. If different clusters are generated for correlated experiments, a machine learning step termed cluster meta-analysis may be performed, in order to discover relations among the components of such sets. Several approaches have been proposed: in particular, kernel methods may be used to exploit the graphical structure of typical ontologies such as GO. Following up the formulation of such approach [Merico D, Zoppis I, Antoniotti M, Mauri G. Evaluating graph kernel methods for relation discovery in GO-annotated clusters. In: KES-2007/WIRN-2007, Part IV, Lecture notes in artificial intelligence, vol. 4694. Berlin: Springer; 2007. p. 892-900; Zoppis I, Merico D, Antoniotti M, Mishra B, Mauri G. Discovering relations among GO-annotated clusters by graph kernel methods. In: Proceedings of the 2007 international symposium on bioinformatics research and applications. Lecture notes in computer science, vol. 4463. Berlin: Springer; 2007], in this paper we discuss, from an information-theoretic point of view, further results about its applicability and its performance.

[1]  Naren Ramakrishnan,et al.  Reconstructing formal temporal models of cellular events using the GO process ontology , 2005 .

[2]  Hans-Peter Kriegel,et al.  Protein function prediction via graph kernels , 2005, ISMB.

[3]  Bernhard Schölkopf,et al.  Kernel Methods in Computational Biology , 2005 .

[4]  Risi Kondor,et al.  Diffusion kernels on graphs and other discrete structures , 2002, ICML 2002.

[5]  A. Gupta,et al.  Extracting Dynamics from Static Cancer Expression Data , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[6]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[7]  R. Courant,et al.  Methods of Mathematical Physics , 1962 .

[8]  Ziv Bar-Joseph,et al.  Analyzing time series gene expression data , 2004, Bioinform..

[9]  E. Winzeler,et al.  Genomics, gene expression and DNA arrays , 2000, Nature.

[10]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .

[11]  Jason C. Mills,et al.  GOurmet: A tool for quantitative comparison and visualization of gene expression profiles based on gene ontology (GO) distributions , 2006, BMC Bioinformatics.

[12]  Giancarlo Mauri,et al.  Discovering Relations Among GO-Annotated Clusters by Graph Kernel Methods , 2007, ISBRA.

[13]  Thomas Gärtner,et al.  On Graph Kernels: Hardness Results and Efficient Alternatives , 2003, COLT.

[14]  Junhyong Kim,et al.  Reconstructing the Temporal Ordering of Biological Samples Using Microarray Data , 2003, Bioinform..

[15]  Roded Sharan,et al.  Center CLICK: A Clustering Algorithm with Applications to Gene Expression Analysis , 2000, ISMB.

[16]  William Stafford Noble,et al.  Kernel methods for predicting protein-protein interactions , 2005, ISMB.

[17]  John D. Lafferty,et al.  Diffusion Kernels on Graphs and Other Discrete Input Spaces , 2002, ICML.

[18]  Naren Ramakrishnan,et al.  Simultaneously Segmenting Multiple Gene Expression Time Courses by Analyzing Cluster Dynamics , 2009, APBC.

[19]  Hisashi Kashima,et al.  Marginalized Kernels Between Labeled Graphs , 2003, ICML.

[20]  P. Brown,et al.  Exploring the metabolic and genetic control of gene expression on a genomic scale. , 1997, Science.

[21]  Patrick Brézillon,et al.  Lecture Notes in Artificial Intelligence , 1999 .

[22]  D. Botstein,et al.  The transcriptional program of sporulation in budding yeast. , 1998, Science.

[23]  D. Edwards,et al.  Statistical Analysis of Gene Expression Microarray Data , 2003 .

[24]  Giancarlo Mauri,et al.  Evaluating Graph Kernel Methods for Relation Discovery in GO-Annotated Clusters , 2007, KES.

[25]  T. Speed,et al.  GOstat: find statistically overrepresented Gene Ontologies within a group of genes. , 2004, Bioinformatics.

[26]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[27]  Hans-Peter Kriegel,et al.  Shortest-path kernels on graphs , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).