ICA based supervised gene classification of Microarray data in yeast functional genome

In order to study the function of unknown genes in functional genome, traditional supervised classification algorithms were applied to gene classification with Microarray expression profiles. But the results show that the classification precision is poor and the accuracies achieved for different classes varies dramatically. Because the gene expression profiles are mixed with different biologically meaningful information and there is much noise in the genomic-scale dataset. Independent component analysis (ICA) is a method for multi-channel signal processing to separate mixed signals. Through linear transformation, ICA minimizes the statistical dependence of the components of the represented variables. So in this paper ICA based supervised gene classification in yeast functional genome is presented, which is a hybrid method of ICA with supervised classification approaches. This method recognizes the hidden patterns under the gene expression profiles and reduces the noise that is abundant in the gene expression profiles efficiently. Experimental results show that this method improves the performance of precision and recall

[1]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Aidong Zhang,et al.  Cluster analysis for gene expression data: a survey , 2004, IEEE Transactions on Knowledge and Data Engineering.

[3]  E. Oja,et al.  Independent Component Analysis , 2001 .

[4]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[5]  George Karypis,et al.  Gene classification using expression profiles: a feasibility study , 2001, Proceedings 2nd Annual IEEE International Symposium on Bioinformatics and Bioengineering (BIBE 2001).

[6]  D. Botstein,et al.  The transcriptional program of sporulation in budding yeast. , 1998, Science.

[7]  Jan Komorowski,et al.  Predicting Gene Function from Gene Expressions and Ontologies , 2000, Pacific Symposium on Biocomputing.

[8]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Aapo Hyvärinen,et al.  Fast and robust fixed-point algorithms for independent component analysis , 1999, IEEE Trans. Neural Networks.

[10]  Erkki Oja,et al.  Independent component analysis: algorithms and applications , 2000, Neural Networks.

[11]  Masato Inoue,et al.  BLIND GENE CLASSIFICATION BASED ON ICA OF MICROARRAY DATA , 2001 .

[12]  Wolfram Liebermeister,et al.  Linear modes of gene expression determined by independent component analysis , 2002, Bioinform..

[13]  Junying Zhang,et al.  Support vector machine classifications for microarray expression data set , 2003, Proceedings Fifth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2003.

[14]  Dmitrij Frishman,et al.  MIPS: a database for protein sequences and complete genomes , 1998, Nucleic Acids Res..