Design of an Integrated and Effective Platform for Gene Expression Data Mining

Gene expression data mining is an important issue in bioinformatics research and applications since it can effectively help discover the functions of genes in high throughput way. Although there exists some software for gene expression analysis, they are insufficient in terms of efficiency, automation, flexibility and degree of integration. In this research, we design an integrated platform, namely GeneFilter, which carries the merits of high efficiency, high degree of automation, web-based anywhere flexibility and seamless integration. A standard and automated analysis flow is designed, which includes data normalization, genes filtering, genes clustering, and genes scoring so that users can easily discover the list of interesting genes, interesting expression patterns or gene clusters across different chip-array experiments. Rich kinds of visualization modules are also provided in web-based manner such that users can visualize the analysis results flexibly at anywhere. Meanwhile, useful biological knowledge like Gene Ontology and pathway information can also be retrieved easily in the same integrated platform. Hence, the GeneFilter platform serves as a highly effective platform for gene expression data mining and has been applied to different kinds of disease analysis successfully.

[1]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[2]  Hongfang Liu,et al.  Identifying significant genes from microarray data , 2004, Proceedings. Fourth IEEE Symposium on Bioinformatics and Bioengineering.

[3]  Anoop Grewal,et al.  Tools for Analyzing Microarray Expression Data , 2000 .

[4]  Jung-Hsien Chiang,et al.  MeKE: Discovering the Functions of Gene Products from Biomedical Literature Via Sentence Alignment , 2003, Bioinform..

[5]  Yan Wu,et al.  Quantitative Quality Control in Microarray Experiments and the Application in Data Filtering, Normalization and False Positive Rate Prediction , 2003, Bioinform..

[6]  Ron Shamir,et al.  Clustering Gene Expression Patterns , 1999, J. Comput. Biol..

[7]  Li-Ju Chen,et al.  An Ancova Approach to Normalize Microarray Data, and its Performance to Existing Methods , 2005, J. Bioinform. Comput. Biol..

[8]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[9]  M. Rajeevan,et al.  Copyright © American Society for Investigative Pathology and the Association for Molecular Pathology Validation of Array-Based Gene Expression Profiles by Real-Time (Kinetic) RT-PCR , 2022 .

[10]  M. Rajeevan,et al.  Use of real-time quantitative PCR to validate the results of cDNA array and differential display PCR technologies. , 2001, Methods.

[11]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.

[12]  P. Brown,et al.  Exploring the metabolic and genetic control of gene expression on a genomic scale. , 1997, Science.

[13]  Zohar Yakhini,et al.  Clustering gene expression patterns , 1999, J. Comput. Biol..

[14]  Terence P. Speed,et al.  Normalization for cDNA microarry data , 2001, SPIE BiOS.

[15]  J. Hudson,et al.  Development of a prostate cDNA microarray and statistical gene expression analysis package , 2000, Molecular carcinogenesis.

[16]  E. Dougherty,et al.  Multivariate measurement of gene expression relationships. , 2000, Genomics.

[17]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Cheng-Yan Kao,et al.  Ranking Genes for Discriminability on Microarray Data , 2003, J. Inf. Sci. Eng..

[19]  Nir Friedman,et al.  Tissue classification with gene expression profiles. , 2000 .

[20]  S. Dudoit,et al.  STATISTICAL METHODS FOR IDENTIFYING DIFFERENTIALLY EXPRESSED GENES IN REPLICATED cDNA MICROARRAY EXPERIMENTS , 2002 .