Non-linear dimensionality reduction of signaling networks

BackgroundSystems wide modeling and analysis of signaling networks is essential for understanding complex cellular behaviors, such as the biphasic responses to different combinations of cytokines and growth factors. For example, tumor necrosis factor (TNF) can act as a proapoptotic or prosurvival factor depending on its concentration, the current state of signaling network and the presence of other cytokines. To understand combinatorial regulation in such systems, new computational approaches are required that can take into account non-linear interactions in signaling networks and provide tools for clustering, visualization and predictive modeling.ResultsHere we extended and applied an unsupervised non-linear dimensionality reduction approach, Isomap, to find clusters of similar treatment conditions in two cell signaling networks: (I) apoptosis signaling network in human epithelial cancer cells treated with different combinations of TNF, epidermal growth factor (EGF) and insulin and (II) combination of signal transduction pathways stimulated by 21 different ligands based on AfCS double ligand screen data. For the analysis of the apoptosis signaling network we used the Cytokine compendium dataset where activity and concentration of 19 intracellular signaling molecules were measured to characterise apoptotic response to TNF, EGF and insulin. By projecting the original 19-dimensional space of intracellular signals into a low-dimensional space, Isomap was able to reconstruct clusters corresponding to different cytokine treatments that were identified with graph-based clustering. In comparison, Principal Component Analysis (PCA) and Partial Least Squares – Discriminant analysis (PLS-DA) were unable to find biologically meaningful clusters. We also showed that by using Isomap components for supervised classification with k-nearest neighbor (k-NN) and quadratic discriminant analysis (QDA), apoptosis intensity can be predicted for different combinations of TNF, EGF and insulin. Prediction accuracy was highest when early activation time points in the apoptosis signaling network were used to predict apoptosis rates at later time points. Extended Isomap also outperformed PCA on the AfCS double ligand screen data. Isomap identified more functionally coherent clusters than PCA and captured more information in the first two-components. The Isomap projection performs slightly worse when more signaling networks are analyzed; suggesting that the mapping function between cues and responses becomes increasingly non-linear when large signaling pathways are considered.ConclusionWe developed and applied extended Isomap approach for the analysis of cell signaling networks. Potential biological applications of this method include characterization, visualization and clustering of different treatment conditions (i.e. low and high doses of TNF) in terms of changes in intracellular signaling they induce.

[1]  A S Slutsky,et al.  Effect of adrenoreceptors on endotoxin-induced cytokines and lipid peroxidation in lung explants. , 1999, American journal of respiratory and critical care medicine.

[2]  Amitabha Mukerjee,et al.  Non-linear Dimensionality Reduction by Locally Linear Isomaps , 2004, ICONIP.

[3]  George Karypis,et al.  Multilevel k-way Partitioning Scheme for Irregular Graphs , 1998, J. Parallel Distributed Comput..

[4]  Carlo Riccardo Rossi,et al.  Tumor necrosis factor, cancer and anticancer therapy. , 2005, Cytokine & growth factor reviews.

[5]  Jens Nilsson,et al.  Approximate geodesic distances reveal biologically relevant structures in microarray data , 2004, Bioinform..

[6]  D. Lauffenburger,et al.  A Systems Model of Signaling Identifies a Molecular Basis Set for Cytokine-Induced Apoptosis , 2005, Science.

[7]  W. Wang,et al.  A Comparative Study of Feature-Salience Ranking Techniques , 2001, Neural Computation.

[8]  Lawrence K. Saul,et al.  Think Globally, Fit Locally: Unsupervised Learning of Low Dimensional Manifold , 2003, J. Mach. Learn. Res..

[9]  Madhusudan Natarajan,et al.  A global analysis of cross-talk in a mammalian cellular signalling network , 2006, Nature Cell Biology.

[10]  Clifford Stein,et al.  Introduction to Algorithms, 2nd edition. , 2001 .

[11]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[12]  Lili X. Peng,et al.  A High-throughput Quantitative Multiplex Kinase Assay for Monitoring Information Flow in Signaling Networks , 2003, Molecular & Cellular Proteomics.

[13]  D. Lauffenburger,et al.  A Compendium of Signals and Responses Triggered by Prodeath and Prosurvival Cytokines*S , 2005, Molecular & Cellular Proteomics.

[14]  John G. Albeck,et al.  Cue-Signal-Response Analysis of TNF-Induced Apoptosis by Partial Least Squares Regression of Dynamic Multivariate Data , 2004, J. Comput. Biol..

[15]  J. Ihle Cytokine receptor signalling , 1995, Nature.

[16]  Alan Wells,et al.  Modeling of signal-response cascades using decision tree analysis , 2005, Bioinform..

[17]  S. Akira,et al.  Role of adapters in Toll-like receptor signalling. , 2003, Biochemical Society transactions.

[18]  Brian D. Ripley,et al.  Pattern Recognition and Neural Networks , 1996 .

[19]  J. Glazebrook,et al.  Local Context Finder (LCF) reveals multidimensional relationships among mRNA expression profiles of Arabidopsis responding to pathogen infection , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Ching-Mei Hsu,et al.  Different pathways leading to activation of extracellular signal-regulated kinase and p38 MAP kinase by formyl-methionyl-leucyl-phenylalanine or platelet activating factor in human neutrophils. , 2005, Journal of biomedical science.

[21]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[22]  T. Jaakkola,et al.  Bayesian Network Approach to Cell Signaling Pathway Modeling , 2002, Science's STKE.

[23]  Grant W. Brown,et al.  Integration of chemical-genetic and genetic interaction data links bioactive compounds to cellular target pathways , 2004, Nature Biotechnology.

[24]  S D Prionas,et al.  Dual role of tumor necrosis factor-alpha in angiogenesis. , 1992, The American journal of pathology.

[25]  D. Donoho,et al.  Hessian eigenmaps: Locally linear embedding techniques for high-dimensional data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Jeffrey L. Solka,et al.  Spectral embedding finds meaningful (relevant) structure in image and microarray data , 2005, BMC Bioinformatics.

[27]  Kevin Dawson,et al.  Sample phenotype clusters in high-density oligonucleotide microarray data sets are revealed using Isomap, a nonlinear algorithm , 2005, BMC Bioinformatics.

[28]  Peter J. Woolf,et al.  Bayesian analysis of signaling networks governing embryonic stem cell fate decisions , 2005, Bioinform..

[29]  K. Sachs,et al.  Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[30]  Andrew Hunter,et al.  Application of neural networks and sensitivity analysis to improved prediction of trauma survival , 2000, Comput. Methods Programs Biomed..