Fast network component analysis (FastNCA) for gene regulatory network reconstruction from microarray data

MOTIVATION Recently developed network component analysis (NCA) approach is promising for gene regulatory network reconstruction from microarray data. The existing NCA algorithm is an iterative method which has two potential limitations: computational instability and multiple local solutions. The subsequently developed NCA-r algorithm with Tikhonov regularization can help solve the first issue but cannot completely handle the second one. Here we develop a novel Fast Network Component Analysis (FastNCA) algorithm which has an analytical solution that is much faster and does not have the above limitations. RESULTS Firstly FastNCA is compared to NCA and NCA-r using synthetic data. The reconstruction of FastNCA is more accurate than that of NCA-r and comparable to that of properly converged NCA. FastNCA is not sensitive to the correlation among the input signals, while its performance does degrade a little but not as dramatically as that of NCA. Like NCA, FastNCA is not very sensitive to small inaccuracies in a priori information on the network topology. FastNCA is about several tens times faster than NCA and several hundreds times faster than NCA-r. Then, the method is applied to real yeast cell-cycle microarray data. The activities of the estimated cell-cycle regulators by FastNCA and NCA-r are compared to the semi-quantitative results obtained independently by Lee et al. (2002). It is shown here that there is a greater agreement between the results of FastNCA and Lee's, which is represented by the ratio 23/33, than that between the results of NCA-r and Lee's, which is 14/33. AVAILABILITY Software and supplementary materials are available from http://www.eee.hku.hk/~cqchang/FastNCA.htm

[1]  L. Scharf,et al.  Statistical Signal Processing: Detection, Estimation, and Time Series Analysis , 1991 .

[2]  M. Savageau Biochemical Systems Analysis: A Study of Function and Design in Molecular Biology , 1976 .

[3]  L. Johnston,et al.  Getting started: regulating the initiation of DNA replication in yeast. , 1997, Annual review of microbiology.

[4]  Mike Tyers,et al.  The fork'ed path to mitosis , 2000, Genome Biology.

[5]  Gene H. Golub,et al.  Matrix computations , 1983 .

[6]  Katy C. Kao,et al.  Transcriptome-based determination of multiple transcription regulator activities in Escherichia coli by using network component analysis. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Y.S. Hung,et al.  Network Component Analysis for Blind Source Separation , 2006, 2006 International Conference on Communications, Circuits and Systems.

[8]  Dimitris N. Politis,et al.  Computer-intensive methods in statistical analysis , 1998, IEEE Signal Process. Mag..

[9]  Russ B. Altman,et al.  Missing value estimation methods for DNA microarrays , 2001, Bioinform..

[10]  Vwani P. Roychowdhury,et al.  A generalized framework for network component analysis , 2005, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[11]  Jonas S. Almeida,et al.  Decoupling dynamical systems for pathway identification from metabolic profiles , 2004, Bioinform..

[12]  E. Voit,et al.  Pathway Analysis and Optimization in Metabolic Engineering , 2002 .

[13]  James C Liao,et al.  A Global Regulatory Role of Gluconeogenic Genes in Escherichia coli Revealed by Transcriptome Network Analysis* , 2005, Journal of Biological Chemistry.

[14]  Nicola J. Rinaldi,et al.  Serial Regulation of Transcriptional Regulators in the Yeast Cell Cycle , 2001, Cell.

[15]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[16]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[17]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[18]  Chiara Sabatti,et al.  Network component analysis: Reconstruction of regulatory signals in biological systems , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[19]  James C. Liao,et al.  Transcriptome network component analysis with limited microarray data , 2006, Bioinform..

[20]  Tianwei Yu,et al.  Inference of transcriptional regulatory network by two-stage constrained space factor analysis , 2005, Bioinform..

[21]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[22]  Shi-Shang Jang,et al.  Biological network mapping and source signal deduction , 2007, Bioinform..

[23]  S. Batzoglou,et al.  Application of independent component analysis to microarrays , 2003, Genome Biology.

[24]  James C Liao,et al.  Inferring yeast cell cycle regulators and interactions using transcription factor activities , 2005, BMC Genomics.

[25]  D. Botstein,et al.  Singular value decomposition for genome-wide expression data processing and modeling. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Yu Sun,et al.  The discovery of transcriptional modules by a two-stage matrix decomposition approach , 2007, Bioinform..

[27]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[28]  Zhi Ding,et al.  Fast Network Component Analysis for Gene Regulation Networks , 2007, 2007 IEEE Workshop on Machine Learning for Signal Processing.

[29]  Katy C. Kao,et al.  gNCA: a framework for determining transcription factor activity based on transcriptome: identifiability and numerical implementation. , 2005, Metabolic engineering.

[30]  Chiara Sabatti,et al.  Bayesian sparse hidden components analysis for transcription regulation networks , 2005, Bioinform..

[31]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[32]  Timothy S Gardner,et al.  Reverse-engineering transcription control networks. , 2005, Physics of life reviews.

[33]  Wolfram Liebermeister,et al.  Linear modes of gene expression determined by independent component analysis , 2002, Bioinform..

[34]  Louis L. Scharf,et al.  Rank reduction for modeling stationary signals , 1987, IEEE Trans. Acoust. Speech Signal Process..

[35]  John J. Wyrick,et al.  Genome-wide location and function of DNA binding proteins. , 2000, Science.

[36]  D. Botstein,et al.  Two yeast forkhead genes regulate the cell cycle and pseudohyphal growth , 2000, Nature.