Transcription Network Analysis by A Sparse Binary Factor Analysis Algorithm

Summary Transcription factor activities (TFAs), rather than expression levels, control gene expres- sion and provide valuable information for investigating TF-gene regulations. The underly- ing bimodal or switch-like patterns of TFAs may play important roles in gene regulation. Network Component Analysis (NCA) is a popular method to deduce TFAs and TF-gene control strengths from microarray data. However, it does not directly examine the bimodal- ity of TFAs and it needs the TF-gene connection topology to be a priori known. In this paper, we modify NCA to model gene expression regulation by Binary Factor Analysis (BFA), which directly captures switch-like patterns of TFAs. Moreover, sparse technique is employed on the mixing matrix of BFA, and thus the proposed sparse BYY-BFA al- gorithm, developed under Bayesian Ying-Yang (BYY) learning framework, can not only uncover the latent TFA profile’s switch-like patterns, but also be capable of automatically shutting off the unnecessary connections. Simulation study demonstrates the effectiveness of BYY-BFA, and a preliminary application to Saccharomyces cerevisiae cell cycle data and Escherichia coli carbon source transition data shows that the reconstructed binary pat- terns of TFAs by BYY-BFA are consistent with the ups and downs of TFAs by NCA, and that BYY-BFA also works well when the network topology is unknown.

[1]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[2]  Chiara Sabatti,et al.  Network component analysis: Reconstruction of regulatory signals in biological systems , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Lei Xu,et al.  Advances on BYY harmony learning: information theoretic perspective, generalized projection geometry, and independent factor autodetermination , 2004, IEEE Transactions on Neural Networks.

[4]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[5]  Lei Xu,et al.  On essential topics of BYY harmony learning: Current status, challenging issues, and gene analysis applications , 2012 .

[6]  L. Xu Independent Component Analysis and Extensions with Noise and Time: A Bayesian Ying-Yang Learning Perspective , 2003 .

[7]  Jeff Hasty,et al.  A DNA methylation–based switch generates bistable gene expression , 2007, Nature Genetics.

[8]  Lei Xu,et al.  Emerging themes on information theory and Bayesian approach , 2010 .

[9]  Grace X. Y. Zheng,et al.  MicroRNAs can generate thresholds in target gene expression , 2011, Nature Genetics.

[10]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[11]  Lei Xu,et al.  Canonical Dual Approach to Binary Factor Analysis , 2009, ICA.

[12]  X. Lei,et al.  Codimensional matrix pairing perspective of BYY harmony learning: hierarchy of bilinear systems, joint decomposition of data-covariance, and applications of network biology , 2011 .

[13]  Matthew N. McCall,et al.  The Gene Expression Barcode: leveraging public data repositories to begin cataloging the human and murine transcriptomes , 2010, Nucleic Acids Res..

[14]  Geoffrey E. Hinton,et al.  Modeling Human Motion Using Binary Latent Variables , 2006, NIPS.

[15]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[16]  H. Bussemaker,et al.  Regulatory element detection using correlation with expression , 2001, Nature Genetics.

[17]  Václav Snásel,et al.  Binary Factor Analysis with Help of Formal Concepts , 2004, CLA.

[18]  Jennifer G. Dy,et al.  Sparse Probabilistic Principal Component Analysis , 2009, AISTATS.

[19]  Katy C. Kao,et al.  gNCA: a framework for determining transcription factor activity based on transcriptome: identifiability and numerical implementation. , 2005, Metabolic engineering.

[20]  Lei Xu,et al.  A Comparative Investigation on Model Selection in Binary Factor Analysis , 2005, Foundations of Data Mining and knowledge Discovery.

[21]  Lei Xu,et al.  BYY harmony learning, independent state space, and generalized APT financial analyses , 2001, IEEE Trans. Neural Networks.

[22]  L. Xu Bayesian Ying-Yang system, best harmony learning, and five action circling , 2010 .

[23]  L. MacNeil,et al.  Gene regulatory networks and the role of robustness and stochasticity in the control of gene expression. , 2011, Genome research.

[24]  James C. Liao,et al.  Trimming of mammalian transcriptional networks using network component analysis , 2010, BMC Bioinformatics.

[25]  James C. Liao,et al.  Transcriptome network component analysis with limited microarray data , 2006, Bioinform..

[26]  Peter M. Williams,et al.  Bayesian Regularization and Pruning Using a Laplace Prior , 1995, Neural Computation.

[27]  Katy C. Kao,et al.  Transcriptome-based determination of multiple transcription regulator activities in Escherichia coli by using network component analysis. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Michael Gormley,et al.  Expression profiles of switch-like genes accurately classify tissue and infectious disease phenotypes in model-based classification , 2008, BMC Bioinformatics.