Mining pathway signatures from microarray data and relevant biological knowledge

High-throughput technologies such as DNA microarray are in the process of revolutionizing the way modern biological research is being done. Bioinformatics tools are becoming increasingly important to assist biomedical scientists in their quest in understanding complex biological processes. Gene expression analysis has attracted a large amount of attention over the last few years mostly in the form of algorithms, exploring cluster and regulatory relationships among genes of interest, and programs that try to display the multidimensional microarray data in appropriate formats so that they make biological sense. To reduce the dimensionality of microarray data and make the corresponding analysis more biologically relevant, in this paper we propose a biologically-led approach to biochemical pathway analysis using microarray data and relevant biological knowledge. The method selects a subset of genes for each pathway that describes the behaviour of the pathway at a given experimental condition, and transforms them into pathway signatures. The metabolic pathways of Escherichia coli are used as a case study.

[1]  Robert Giegerich,et al.  PathFinder: reconstruction and dynamic visualization of metabolic pathways , 2002, Bioinform..

[2]  J. Claverie Computational methods for the identification of differential and coordinated gene expression. , 1999, Human molecular genetics.

[3]  Xiaohui Liu,et al.  Biochemical Pathway Analysis via Signature Mining , 2005, CompLife.

[4]  Annette Payne,et al.  Signature Mining : a Heuristic Approach to Biochemical Pathway Analysis , .

[5]  Xiaohui Liu,et al.  Consensus clustering and functional interpretation of gene-expression data , 2004, Genome Biology.

[6]  Steven C. Lawlor,et al.  GenMAPP, a new tool for viewing and analyzing microarray data on biological pathways , 2002, Nature Genetics.

[7]  P. Brown,et al.  Parallel human genome analysis: microarray-based expression monitoring of 1000 genes. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Peter D. Karp,et al.  The EcoCyc Database , 2002, Nucleic Acids Res..

[9]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[10]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[12]  M. Bittner,et al.  Expression profiling using cDNA microarrays , 1999, Nature Genetics.

[13]  Akihiko Konagaya,et al.  GSCope: a clipped fisheye viewer effective for highly complicated biomolecular network graphs , 2003, Bioinform..

[14]  Dr. Zbigniew Michalewicz,et al.  How to Solve It: Modern Heuristics , 2004 .

[15]  L. Ohno-Machado Journal of Biomedical Informatics , 2001 .

[16]  An-Ping Zeng,et al.  Reconstruction of metabolic networks from genome data and analysis of their global structure for various organisms , 2003, Bioinform..

[17]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[18]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[19]  D. Botstein,et al.  DNA microarray analysis of gene expression in response to physiological and genetic changes that affect tryptophan metabolism in Escherichia coli. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Ajay N. Jain,et al.  Pathway recognition and augmentation by computational analysis of microarray expression data , 2006, Bioinform..

[21]  Sui Huang,et al.  Back to the biology in systems biology: what can we learn from biomolecular networks? , 2004, Briefings in functional genomics & proteomics.

[22]  Kelvin H Lee,et al.  Functional genomics and proteomics as a foundation for systems biology. , 2003, Briefings in functional genomics & proteomics.

[23]  Nikolay A. Kolchanov,et al.  GeneNet: a gene network database and its automated visualization , 1998, Bioinform..