Data driven linear algebraic methods for analysis of molecular pathways : application to disease progression in shock / trauma

MOTIVATION Although trauma is the leading cause of death for those below 45years of age, there is a dearth of information about the temporal behavior of the underlying biological mechanisms in those who survive the initial trauma only to later suffer from syndromes such as multiple organ failure. Levels of serum cytokines potentially affect the clinical outcomes of trauma; understanding how cytokine levels modulate intra-cellular signaling pathways can yield insights into molecular mechanisms of disease progression and help to identify targeted therapies. However, developing such analyses is challenging since it necessitates the integration and interpretation of large amounts of heterogeneous, quantitative and qualitative data. Here we present the Pathway Semantics Algorithm (PSA), an algebraic process of node and edge analyses of evoked biological pathways over time for in silico discovery of biomedical hypotheses, using data from a prospective controlled clinical study of the role of cytokines in multiple organ failure (MOF) at a major US trauma center. A matrix algebra approach was used in both the PSA node and PSA edge analyses with different matrix configurations and computations based on the biomedical questions to be examined. In the edge analysis, a percentage measure of crosstalk called XTALK was also developed to assess cross-pathway interference. RESULTS In the node/molecular analysis of the first 24h from trauma, PSA uncovered seven molecules evoked computationally that differentiated outcomes of MOF or non-MOF (NMOF), of which three molecules had not been previously associated with any shock/trauma syndrome. In the edge/molecular interaction analysis, PSA examined four categories of functional molecular interaction relationships--activation, expression, inhibition, and transcription--and found that the interaction patterns and crosstalk changed over time and outcome. The PSA edge analysis suggests that a diagnosis, prognosis or therapy based on molecular interaction mechanisms may be most effective within a certain time period and for a specific functional relationship.

[1]  R. Lambiotte,et al.  Line graphs, link partitions, and overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  Ricard V Solé,et al.  Distributed robustness in cellular networks: insights from synthetic evolved circuits , 2009, Journal of The Royal Society Interface.

[3]  B. Maier,et al.  EARLY VERSUS LATE ONSET OF MULTIPLE ORGAN FAILURE IS ASSOCIATED WITH DIFFERING PATTERNS OF PLASMA CYTOKINE BIOMARKER EXPRESSION AND OUTCOME AFTER SEVERE TRAUMA , 2007, Shock.

[4]  Jin Zhang Powerful goodness‐of‐fit tests based on the likelihood ratio , 2002 .

[5]  M. Korc,et al.  Signaling pathways in pancreatic cancer. , 2011, Critical reviews in eukaryotic gene expression.

[6]  M. Bianchi DAMPs, PAMPs and alarmins: all we need to know about danger , 2007, Journal of leukocyte biology.

[7]  M. Harboe,et al.  Innate Immune Responses to Danger Signals in Systemic Inflammatory Response Syndrome and Sepsis , 2009, Scandinavian journal of immunology.

[8]  Roger Brent,et al.  Cell biology. A fishing buddy for hypothesis generators. , 2005, Science.

[9]  T. Senda,et al.  Histone chaperones: 30 years from isolation to elucidation of the mechanisms of nucleosome assembly and disassembly , 2008, Cellular and Molecular Life Sciences.

[10]  M. Newton,et al.  Random-set methods identify distinct aspects of the enrichment signal in gene-set analysis , 2007, 0708.4350.

[11]  G. Núñez,et al.  Gut Immunity: A NOD to the Commensals , 2009, Current Biology.

[12]  Kwang-Hyun Cho,et al.  Investigations Into the Analysis and Modeling of the TNFα-Mediated NF-κB-Signaling Pathway , 2003 .

[13]  Gabriel Núñez,et al.  The innate immune receptor Nod1 protects the intestine from inflammation-induced tumorigenesis. , 2008, Cancer research.

[14]  Luis Emilio Bruni,et al.  Cellular Semiotics And Signal Transduction , 2008 .

[15]  Insuk Sohn,et al.  Multiple testing for gene sets from microarray experiments , 2011, BMC Bioinformatics.

[16]  J. Davis Bioinformatics and Computational Biology Solutions Using R and Bioconductor , 2007 .

[17]  Pat Levitt,et al.  Molecular Characterization of Schizophrenia Viewed by Microarray Analysis of Gene Expression in Prefrontal Cortex , 2000, Neuron.

[18]  Andrew N Hoofnagle,et al.  The fundamental flaws of immunoassays and potential solutions using tandem mass spectrometry. , 2009, Journal of immunological methods.

[19]  Susumu Goto,et al.  KEGG for representation and analysis of molecular networks involving diseases and drugs , 2009, Nucleic Acids Res..

[20]  C. Thiemermann,et al.  Selective NOD1 agonists cause shock and organ injury/dysfunction in vivo. , 2007, American journal of respiratory and critical care medicine.

[21]  R. Accolla,et al.  The dual function of the MHC class II transactivator CIITA against HTLV retroviruses. , 2009, Frontiers in bioscience.

[22]  Clifford A. Meyer,et al.  MYC regulation of a “poor-prognosis” metastatic cancer cell state , 2010, Proceedings of the National Academy of Sciences.

[23]  J. Lamb,et al.  Cyclin D1 and Molecular Chaperones: Implications for Tumorigenesis , 2003, Cell cycle.

[24]  Qi Liu,et al.  Improving gene set analysis of microarray data by SAM-GS , 2007, BMC Bioinformatics.

[25]  M. West,et al.  Bayesian Modelling for Biological Annotation of Gene Expression Pathway Signatures , 2009 .

[26]  Sayan Mukherjee,et al.  Analysis of sample set enrichment scores: assaying the enrichment of sets of genes for individual samples in genome-wide expression profiles , 2006, ISMB.

[27]  Di Wu,et al.  ROAST: rotation gene set tests for complex microarray experiments , 2010, Bioinform..

[28]  Reddy Sa Signaling pathways in pancreatic cancer. , 2001 .

[29]  A. Kho,et al.  Identification of genes expressed with temporal-spatial restriction to developing cerebellar neuron precursors by a functional genomic approach , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[30]  W. Tong,et al.  Transactivation of EGF receptor and ErbB2 protects intestinal epithelial cells from TNF-induced apoptosis , 2008, Proceedings of the National Academy of Sciences.

[31]  H. Simon,et al.  Studies of Scientific Discovery: Complementary Approaches and Convergent Findings , 1999 .

[32]  Xiaomeng Zhang,et al.  Identification of a Topological Characteristic Responsible for the Biological Robustness of Regulatory Networks , 2009, PLoS Comput. Biol..

[33]  G. Edelman,et al.  Degeneracy and complexity in biological systems , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[34]  Min Xu,et al.  Unraveling complex temporal associations in cellular systems across multiple time-series microarray datasets , 2010, J. Biomed. Informatics.

[35]  D. Philpott,et al.  The innate immune molecule, NOD1, regulates direct killing of Helicobacter pylori by antimicrobial peptides , 2010, Cellular microbiology.

[36]  J. Downing,et al.  Gene Expression Profiling of Pediatric Acute Myelogenous Leukemia Materials and Methods , 2022 .

[37]  David Klahr,et al.  Dual Space Search During Scientific Reasoning , 1988, Cogn. Sci..

[38]  Peter Bühlmann,et al.  Analyzing gene expression data in terms of gene sets: methodological issues , 2007, Bioinform..

[39]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[40]  E. Deitch,et al.  Multiple organ failure. Pathophysiology and potential future therapy. , 1992, Annals of surgery.

[41]  Ronald G. Tompkins,et al.  A Genomic Score Prognostic of Outcome in Trauma Patients , 2009, Molecular medicine.

[42]  Xing Qiu,et al.  Correlation Between Gene Expression Levels and Limitations of the Empirical Bayes Methodology for Finding Differentially Expressed Genes , 2005, Statistical applications in genetics and molecular biology.

[43]  Harald Mischak,et al.  Identification and Validation of Urinary Biomarkers for Differential Diagnosis and Evaluation of Therapeutic Intervention in Anti-neutrophil Cytoplasmic Antibody-associated Vasculitis* , 2009, Molecular & Cellular Proteomics.

[44]  Lawrence Hunter,et al.  Leveraging existing biological knowledge in the identification of candidate genes for facial dysmorphology , 2009, BMC Bioinformatics.

[45]  L. Koenderman,et al.  Postinjury immune monitoring: can multiple organ failure be predicted? , 2008, Current opinion in critical care.

[46]  F. Moore,et al.  Inducible nitric oxide synthase mediates gut ischemia/reperfusion-induced ileus only after severe insults. , 2001, The Journal of surgical research.

[47]  R. Hotchkiss,et al.  Molecular biology of multiple organ dysfunction syndrome: injury, adaptation, and apoptosis. , 2000, Surgical infections.

[48]  Yuval Shahar Dimension of Time in Illness: An Objective View , 2000, Annals of Internal Medicine.

[49]  Melonie P. Heron,et al.  Deaths: preliminary data for 2004. , 2006, National vital statistics reports : from the Centers for Disease Control and Prevention, National Center for Health Statistics, National Vital Statistics System.

[50]  Thomas Lengauer,et al.  Analysis of Gene Expression Data with Pathway Scores , 2000, ISMB.

[51]  Zhen Su,et al.  EasyGO: Gene Ontology-based annotation and functional enrichment analysis tool for agronomical species , 2007, BMC Genomics.

[52]  Abdul Salam Jarrah,et al.  Polynomial algebra of discrete models in systems biology , 2010, Bioinform..

[53]  William Stafford Noble,et al.  Exploring Gene Expression Data with Class Scores , 2001, Pacific Symposium on Biocomputing.

[54]  Pablo Tamayo,et al.  An Erythroid Differentiation Signature Predicts Response to Lenalidomide in Myelodysplastic Syndrome , 2008, PLoS medicine.

[55]  Jiaquan Xu,et al.  Deaths: preliminary data for 2011. , 2012 .

[56]  Robert E. Lewis,et al.  KSR2 is an essential regulator of AMP kinase, energy expenditure, and insulin sensitivity. , 2009, Cell metabolism.

[57]  Frank Emmert-Streib,et al.  Pathway Analysis of Expression Data: Deciphering Functional Building Blocks of Complex Diseases , 2011, PLoS Comput. Biol..

[58]  Garrett Birkhoff,et al.  A survey of modern algebra , 1942 .

[59]  Philip Hahnfeldt,et al.  Transcriptional network governing the angiogenic switch in human pancreatic cancer , 2007, Proceedings of the National Academy of Sciences.

[60]  Nick Patterson,et al.  Reply to "Statistical concerns about the GSEA procedure" , 2004, Nature Genetics.

[61]  J. Cuozzo,et al.  Identification of a Novel Human Kinase Supporter of Ras (hKSR-2) That Functions as a Negative Regulator of Cot (Tpl2) Signaling* , 2003, Journal of Biological Chemistry.

[62]  Mary F. McGuire,et al.  Measurement units may impact results of pathway analysis , 2007 .

[63]  Ben S. Wittner,et al.  Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1 , 2009, Nature.

[64]  Joaquín Dopazo,et al.  FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genes , 2004, Bioinform..

[65]  A. Nobel,et al.  Heading Down the Wrong Pathway: on the Influence of Correlation within Gene Sets , 2010, BMC Genomics.

[66]  C. Coopersmith,et al.  EPIDERMAL GROWTH FACTOR TREATMENT DECREASES MORTALITY AND IS ASSOCIATED WITH IMPROVED GUT INTEGRITY IN SEPSIS , 2008, Shock.

[67]  Doheon Lee,et al.  Inferring Pathway Activity toward Precise Disease Classification , 2008, PLoS Comput. Biol..

[68]  Y. Yoshikawa,et al.  The M-Ras-RA-GEF-2-Rap1 pathway mediates tumor necrosis factor-alpha dependent regulation of integrin activation in splenocytes. , 2007, Molecular biology of the cell.

[69]  Mingzhu Zhu,et al.  MEGO: gene functional module expression based on gene ontology. , 2005, BioTechniques.

[70]  J. Cavaillon,et al.  Compensatory anti-inflammatory response syndrome , 2008, Thrombosis and Haemostasis.

[71]  C. Coopersmith,et al.  INTESTINAL CROSSTALK: A NEW PARADIGM FOR UNDERSTANDING THE GUT AS THE "MOTOR" OF CRITICAL ILLNESS , 2007, Shock.

[72]  Pablo Tamayo,et al.  Loss of the tumor suppressor Snf5 leads to aberrant activation of the Hedgehog-Gli pathway , 2010, Nature Medicine.

[73]  Brad T. Sherman,et al.  DAVID: Database for Annotation, Visualization, and Integrated Discovery , 2003, Genome Biology.

[74]  J. A. Bondy,et al.  Graph Theory , 2008, Graduate Texts in Mathematics.

[75]  Patrik Edén,et al.  Comparing Functional Annotation Analyses with Catmap Comparing Functional Annotation Analyses with Catmap , 2004 .

[76]  Yoram Vodovotz,et al.  Translational systems biology of inflammation and healing , 2010, Wound repair and regeneration : official publication of the Wound Healing Society [and] the European Tissue Repair Society.

[77]  Dougu Nam,et al.  De-correlating expression in gene-set analysis , 2010, Bioinform..

[78]  Mary F. McGuire,et al.  Pathway Semantics: An Algebraic Data Driven Algorithm to Generate Hypotheses about Molecular Patterns Underlying Disease Progression , 2011 .

[79]  M. Daly,et al.  PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes , 2003, Nature Genetics.

[80]  Bing Zhang,et al.  An Integrated Approach for the Analysis of Biological Pathways using Mixed Models , 2008, PLoS genetics.

[81]  C. Gibbs,et al.  The 14-3-3 brain protein in cerebrospinal fluid as a marker for transmissible spongiform encephalopathies. , 1996, The New England journal of medicine.

[82]  R. Kozar,et al.  Molecular mechanisms of pharmaconutrients. , 2010, The Journal of surgical research.

[83]  Mary F. McGuire,et al.  Early cytokine production risk stratifies trauma patients for multiple organ failure. , 2009, Journal of the American College of Surgeons.

[84]  U. Bhalla,et al.  Complexity in biological signaling systems. , 1999, Science.

[85]  Sang-Bae Kim,et al.  ADGO: analysis of differentially expressed gene sets using composite GO annotation , 2006, Bioinform..

[86]  Lesly A. Dossett,et al.  Diagnosis-dependent relationships between cytokine levels and survival in patients admitted for surgical critical care. , 2010, Journal of the American College of Surgeons.

[87]  Seon-Young Kim,et al.  PAGE: Parametric Analysis of Gene Set Enrichment , 2005, BMC Bioinform..

[88]  D. Shasha,et al.  Pattern discovery for hypothesis generation in biology , 2006 .

[89]  M. Feinleib National Center for Health Statistics (NCHS) , 2005 .

[90]  May D. Wang,et al.  GoMiner: a resource for biological interpretation of genomic and proteomic data , 2003, Genome Biology.

[91]  B. Efron Correlation and Large-Scale Simultaneous Significance Testing , 2007 .

[92]  Andrew B. Nobel,et al.  A statistical framework for testing functional categories in microarray data , 2008, 0803.3881.

[93]  D. Damian,et al.  Statistical concerns about the GSEA procedure , 2004, Nature Genetics.

[94]  J. Mesirov,et al.  Gene Set Enrichment Analysis Made Right , 2011 .

[95]  Carol Friedman,et al.  Discovery of Protein Interaction Networks Shared by Diseases , 2006, Pacific Symposium on Biocomputing.

[96]  T. Golub,et al.  Modeling genomic diversity and tumor dependency in malignant melanoma. , 2008, Cancer research.

[97]  Andrew B. Nobel,et al.  Significance analysis of functional categories in gene expression studies: a structured permutation approach , 2005, Bioinform..

[98]  James M. Whitacre,et al.  Degeneracy: a link between evolvability, robustness and complexity in biological systems , 2009, Theoretical Biology and Medical Modelling.

[99]  W H Wong,et al.  Genome-wide expression analysis reveals dysregulation of myelination-related genes in chronic schizophrenia , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[100]  Jeffrey T Leek,et al.  A general framework for multiple testing dependence , 2008, Proceedings of the National Academy of Sciences.

[101]  Xiaohui Xie,et al.  Erralpha and Gabpa/b specify PGC-1alpha-dependent oxidative phosphorylation gene expression that is altered in diabetic muscle. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[102]  R. Goris,et al.  Inflammatory mediators in relation to the development of multiple organ failure in patients after severe blunt trauma. , 1995, Critical care medicine.

[103]  Christian P. Robert,et al.  Large-scale inference , 2010 .

[104]  T. Ideker,et al.  Network-based classification of breast cancer metastasis , 2007, Molecular systems biology.

[105]  M. Gerstein,et al.  The current excitement in bioinformatics-analysis of whole-genome expression data: how does it relate to protein structure and function? , 2000, Current opinion in structural biology.

[106]  David Haussler,et al.  Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM , 2010, Bioinform..

[107]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[108]  Kwang-Hyun Cho,et al.  Investigations into the analysis and modeling of the TNF alpha-mediated NF-kappa B-signaling pathway. , 2003, Genome research.

[109]  Mary F. McGuire,et al.  Computational Approaches for Translational Clinical Research in Disease Progression , 2011, Journal of Investigative Medicine.

[110]  M Schetz,et al.  Intensive insulin therapy in critically ill patients. , 2001, The New England journal of medicine.

[111]  R. Stewart Injury prevention: why so important? , 2007, The Journal of trauma.

[112]  Jeffrey T. Chang,et al.  Oncogenic pathway signatures in human cancers as a guide to targeted therapies , 2006, Nature.

[113]  A. Peitzman,et al.  Fresh frozen plasma is independently associated with a higher risk of multiple organ failure and acute respiratory distress syndrome. , 2009, The Journal of trauma.

[114]  S. Möller,et al.  Time course transcriptomics of IFNB1b drug therapy in multiple sclerosis , 2010, Autoimmunity.

[115]  F. Boisvert,et al.  The Nucleolus under Stress , 2010, Molecular Cell.

[116]  Xin Lu,et al.  Re-sampling strategy to improve the estimation of number of null hypotheses in FDR control under strong correlation structures , 2007, BMC Bioinformatics.

[117]  Luay Nakhleh,et al.  Hypothesis Generation in Signaling Networks , 2006, J. Comput. Biol..

[118]  J. Mesirov,et al.  An oncogenic KRAS2 expression signature identified by cross-species gene-expression analysis , 2005, Nature Genetics.

[119]  Sandrine Dudoit,et al.  Multiple tests of association with biological annotation metadata , 2008, 0805.3008.

[120]  Christian D. Schunn,et al.  A 4-Space Model of Scientific Discovery , 1995 .

[121]  K. Seidl,et al.  Pharmacologic Inhibition of Tpl2 Blocks Inflammatory Responses in Primary Human Monocytes, Synoviocytes, and Blood* , 2007, Journal of Biological Chemistry.

[122]  J. Downing,et al.  Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling. , 2002, Cancer cell.

[123]  W. Wong,et al.  GoSurfer: a graphical interactive tool for comparative analysis of large gene sets in Gene Ontology space. , 2004, Applied bioinformatics.

[124]  Min-Sung Kim,et al.  COFECO: composite function annotation enriched by protein complex data , 2009, Nucleic Acids Res..

[125]  Paolo Tieri,et al.  Network, degeneracy and bow tie. Integrating paradigms and architectures to grasp the complexity of the immune system , 2010, Theoretical Biology and Medical Modelling.

[126]  T. Poggio,et al.  Prediction of central nervous system embryonal tumour outcome based on gene expression , 2002, Nature.

[127]  Paul Pavlidis,et al.  ErmineJ: Tool for functional analysis of gene expression data sets , 2005, BMC Bioinformatics.

[128]  Jean-Daniel Zucker,et al.  FunNet: an integrative tool for exploring transcriptional interactions , 2008, Bioinform..

[129]  G Tononi,et al.  Measures of degeneracy and redundancy in biological networks. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[130]  J. Fallon,et al.  Induction of IG9 monocyte adhesion molecule expression in smooth muscle and endothelial cells after balloon arterial injury in cholesterol-fed rabbits. , 2000, Arteriosclerosis, thrombosis, and vascular biology.

[131]  Robert E. Brown,et al.  Morphoproteomics: exposing protein circuitries in tumors to identify potential therapeutic targets in cancer patients , 2005, Expert review of proteomics.

[132]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[133]  G. Mor,et al.  In vitro and in vivo Effects of Combination of Trastuzumab (Herceptin) and Tamoxifen in Breast Cancer , 2005, Breast Cancer Research and Treatment.

[134]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[135]  John D. Storey A direct approach to false discovery rates , 2002 .

[136]  Roger Brent,et al.  A Fishing Buddy for Hypothesis Generators , 2005, Science.

[137]  G. Birkhoff,et al.  A survey of modern algebra , 1942 .

[138]  M. Fang,et al.  PPARγ enhances IFNγ-mediated transcription and rescues the TGFβ antagonism by stimulating CIITA in vascular smooth muscle cells , 2009 .

[139]  Robert E. Brown,et al.  Morphogenomics and morphoproteomics: a role for anatomic pathology in personalized medicine. , 2009, Archives of pathology & laboratory medicine.

[140]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[141]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[142]  Rafael A Irizarry,et al.  Gene set enrichment analysis made simple , 2009, Statistical methods in medical research.

[143]  R. Tibshirani,et al.  On testing the significance of sets of genes , 2006, math/0610667.

[144]  P. Park,et al.  Discovering statistically significant pathways in expression profiling studies. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[145]  Korbinian Strimmer,et al.  Gene ranking and biomarker discovery under correlation , 2009, Bioinform..

[146]  R. Kozar,et al.  Enteral glutamine during active shock resuscitation is safe and enhances tolerance of enteral feeding. , 2008, JPEN. Journal of parenteral and enteral nutrition.

[147]  Pablo Tamayo,et al.  Inactivation of the Snf5 tumor suppressor stimulates cell cycle progression and cooperates with p53 loss in oncogenic transformation. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[148]  Karen Sachs,et al.  Characterization of patient specific signaling via augmentation of bayesian networks with disease and patient state nodes , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[149]  Willis X. Li Canonical and non-canonical JAK-STAT signaling. , 2008, Trends in cell biology.

[150]  Anil Potti,et al.  An integrated genomic-based approach to individualized treatment of patients with advanced-stage ovarian cancer. , 2007, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[151]  V. Mootha,et al.  mTOR controls mitochondrial oxidative function through a YY1–PGC-1α transcriptional complex , 2007, Nature.

[152]  Nir Friedman,et al.  Tissue classification with gene expression profiles , 2000, RECOMB '00.