Reverse-engineering biological interaction networks from noisy data using Regularized Least Squares and Instrumental Variables

The problem of reverse engineering the topology of a biological network from noisy time-series measurements is one of the most important challenges in the field of Systems Biology. In this work, we develop a new inference approach which combines the Regularized Least Squares (RLS) technique with a technique to avoid the introduction of bias and non-consistency due to measurement noise in the estimation of the parameters in the standard Least Squares (LS) formulation, the Instrumental Variables (IV) method. We test our approach on a set of nonlinear in silico networks and show that the combined exploitation of RLS and IV methods improves the predictions with respect to other standard approaches.

[1]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[2]  Reinhard Laubenbacher,et al.  Modeling and Simulation of Biological Networks , 2007 .

[3]  Gene H. Golub,et al.  Generalized cross-validation as a method for choosing a good ridge parameter , 1979, Milestones in Matrix Computation.

[4]  J. Collins,et al.  Chemogenomic profiling on a genome-wide scale using reverse-engineered gene networks , 2005, Nature Biotechnology.

[5]  J. Collins,et al.  Inferring Genetic Networks and Identifying Compound Mode of Action via Expression Profiling , 2003, Science.

[6]  Jesper Tegnér,et al.  Reverse engineering gene networks using singular value decomposition and robust regression , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[7]  D. Floreano,et al.  Revealing strengths and weaknesses of methods for gene network inference , 2010, Proceedings of the National Academy of Sciences.

[8]  Michael Hecker,et al.  Gene regulatory network inference: Data integration in dynamic models - A review , 2009, Biosyst..

[9]  F. Amato,et al.  Exploiting prior knowledge and preferential attachment to infer biological interaction networks , 2009, 2009 17th Mediterranean Conference on Control and Automation.

[10]  Carsten O. Daub,et al.  The mutual information: Detecting and evaluating dependencies between variables , 2002, ECCB.

[11]  Diego di Bernardo,et al.  Robust Identification of Large Genetic Networks , 2003, Pacific Symposium on Biocomputing.

[12]  Francesco Amato,et al.  Identification of regulatory pathways of the cell cycle in fission yeast , 2006 .

[13]  D. di Bernardo,et al.  Linear matrix inequalities approach to reconstruction of biological networks. , 2007, IET systems biology.

[14]  Ting Chen,et al.  Modeling Gene Expression with Differential Equations , 1998, Pacific Symposium on Biocomputing.

[15]  Kevin Y. Yip,et al.  Improved Reconstruction of In Silico Gene Regulatory Networks by Integrating Knockout and Perturbation Data , 2010, PloS one.

[16]  Jongrae Kim,et al.  Reverse Engineering Partially-Known Interaction Networks from Noisy Data , 2011 .

[17]  David L. Olson,et al.  Advanced Data Mining Techniques , 2008 .

[18]  D di Bernardo,et al.  Inference of gene networks from temporal gene expression profiles. , 2007, IET systems biology.

[19]  Gail D. Baura,et al.  Nonlinear System Identification , 2002 .

[20]  Jerry M. Mendel,et al.  The constrained total least squares technique and its applications to harmonic superresolution , 1991, IEEE Trans. Signal Process..

[21]  Y. Tu,et al.  Quantitative noise analysis for gene expression microarray experiments , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Kwang-Hyun Cho,et al.  Linear time-varying models can reveal non-linear interactions of biomolecular regulatory networks using multiple time-series data , 2008, Bioinform..

[23]  Dario Floreano,et al.  Generating Realistic In Silico Gene Networks for Performance Assessment of Reverse Engineering Methods , 2009, J. Comput. Biol..

[24]  N. D. Clarke,et al.  Towards a Rigorous Assessment of Systems Biology Models: The DREAM3 Challenges , 2010, PloS one.