Setting the Standards for Signal Transduction Research

Standardized methods to link large, complex data sets to models are needed. Major advances in high-throughput technology platforms, coupled with increasingly sophisticated computational methods for systematic data analysis, have provided scientists with tools to better understand the complexity of signaling networks. In this era of massive and diverse data collection, standardization efforts that streamline data gathering, analysis, storage, and sharing are becoming a necessity. Here, we give an overview of current technologies to study signal transduction. We argue that along with the opportunities the new technologies open, their heterogeneous nature poses critical challenges for data handling that are further increased when data are to be integrated in mathematical models. Efficient standardization through markup languages and data annotation is a sine qua non condition for a systems-level analysis of signaling processes. It remains to be seen the extent to which and the speed at which the emerging standardization efforts will be embraced by the signaling community.

[1]  C. Sander,et al.  The HUPO PSI's Molecular Interaction format—a community standard for the representation of protein interaction data , 2004, Nature Biotechnology.

[2]  Nigel W. Hardy,et al.  Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project , 2008, Nature Biotechnology.

[3]  Aris Floratos,et al.  geWorkbench: an open source platform for integrative genomics , 2010, Bioinform..

[4]  Scott D. Tanner,et al.  Multiplex bio-assay with inductively coupled plasma mass spectrometry: Towards a massively multivariate single-cell technology , 2007 .

[5]  Mudita Singhal,et al.  Enabling high-throughput data management for systems biology: The Bioinformatics Resource Manager , 2007, Bioinform..

[6]  D. di Bernardo,et al.  How to infer gene networks from expression profiles , 2007, Molecular systems biology.

[7]  Patrick G. A. Pedrioli,et al.  Phosphoproteomic Analysis Reveals Interconnected System-Wide Responses to Perturbations of Kinases and Phosphatases in Yeast , 2010, Science Signaling.

[8]  D. Lauffenburger,et al.  A Systems Model of Signaling Identifies a Molecular Basis Set for Cytokine-Induced Apoptosis , 2005, Science.

[9]  A. Nekrutenko,et al.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences , 2010, Genome Biology.

[10]  Piotr J. Durka,et al.  SignalML: metaformat for description of biomedical time series , 2004, Comput. Methods Programs Biomed..

[11]  Oliver Hofmann,et al.  ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community level , 2010, Bioinform..

[12]  Julio Saez-Rodriguez,et al.  Flexible informatics for linking experimental data to mathematical models via DataRail , 2008, Bioinform..

[13]  A. Brazma,et al.  Standards for systems biology , 2006, Nature Reviews Genetics.

[14]  John Quackenbush,et al.  Standardizing the standards , 2006 .

[15]  R. Aebersold,et al.  Applying mass spectrometry-based proteomics to genetics, genomics and network biology , 2009, Nature Reviews Genetics.

[16]  G. Nolan,et al.  Computational solutions to large-scale data management and analysis , 2010, Nature Reviews Genetics.

[17]  Charlotte Harrison High-content screening: Integrating information , 2008, Nature Reviews Drug Discovery.

[18]  H. Mori,et al.  Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection , 2006, Molecular systems biology.

[19]  Michael L. Creech,et al.  Integration of biological networks and gene expression data using Cytoscape , 2007, Nature Protocols.

[20]  E. Marcotte,et al.  It's the machine that matters: Predicting gene function and phenotype from protein networks. , 2010, Journal of proteomics.

[21]  Matthew R. Clutter,et al.  Fluorescent Cell Barcoding for Multiplex Flow Cytometry , 2011, Current protocols in cytometry.

[22]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[23]  M. Mann,et al.  Decoding signalling networks by mass spectrometry-based proteomics , 2010, Nature Reviews Molecular Cell Biology.

[24]  D. Lauffenburger,et al.  Systems Analysis of EGF Receptor Signaling Dynamics with Micro-Western Arrays , 2010, Nature Methods.

[25]  Hiroaki Kitano,et al.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models , 2003, Bioinform..

[26]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[27]  David J. Reiss,et al.  The Gaggle: An open-source software system for integrating bioinformatics software and data sources , 2006, BMC Bioinformatics.

[28]  Forest M White,et al.  Oncogenic EGFR Signaling Networks in Glioma , 2009, Science Signaling.

[29]  D. Lauffenburger,et al.  Networks Inferred from Biochemical Data Reveal Profound Differences in Toll-like Receptor and Inflammatory Signaling between Normal and Transformed Hepatocytes* , 2010, Molecular & Cellular Proteomics.

[30]  S. Brunak,et al.  Quantitative Phosphoproteomics Reveals Widespread Full Phosphorylation Site Occupancy During Mitosis , 2010, Science Signaling.

[31]  Carole A. Goble,et al.  Taverna: a tool for building and running workflows of services , 2006, Nucleic Acids Res..

[32]  Gavin MacBeath,et al.  Dissecting protein function and signaling using protein microarrays. , 2009, Current opinion in chemical biology.

[33]  Norman W. Paton,et al.  SBRML: a markup language for associating systems biology data with models , 2010, Bioinform..

[34]  Eric W. Deutsch,et al.  SBEAMS-Microarray: database software supporting genomic expression analyses for systems biology , 2006, BMC Bioinformatics.

[35]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.

[36]  Steven P. Gygi,et al.  Large-scale phosphorylation analysis of mouse liver , 2007, Proceedings of the National Academy of Sciences.

[37]  Chris T. A. Evelo,et al.  Presenting and exploring biological pathways with PathVisio , 2008, BMC Bioinformatics.

[38]  Alison Abbott,et al.  Germans cook up liver project , 2010, Nature.

[39]  N. D. Clarke,et al.  Towards a Rigorous Assessment of Systems Biology Models: The DREAM3 Challenges , 2010, PloS one.

[40]  J. Mesirov,et al.  GenePattern 2.0 , 2006, Nature Genetics.

[41]  Nicolas Le Novère,et al.  BioModels.net Web Services, a free and integrated toolkit for computational modelling software , 2010, Briefings Bioinform..

[42]  Gaudenz Danuser,et al.  Linking data to models: data regression , 2006, Nature Reviews Molecular Cell Biology.