Dis2PPI: A Workflow Designed to Integrate Proteomic and Genetic Disease Data

Experiments in bioinformatics are based on protocols that employ different steps for data mining and data integration, collectively known as computational workflows. Considering the use of databases in the biomedical sciences software that is able to query multiple databases is desirable. Systems biology, which encompasses the design of interactomic networks to understand complex biological processes, can benefit from computational workflows. Unfortunately, the use of computational workflows in systems biology is still very limited, especially for applications associated with the study of disease. To address this limitation, we designed Dis2PPI, a workflow that integrates information retrieved from genetic disease databases and interactomes. Dis2PPI extracts protein names from a disease report and uses this information to mine protein-protein interaction PPI networks. The data gathered from this mining can be used in systems biology analyses. To demonstrate the functionality of Dis2PPI for systems biology analyses, the authors mined information about xeroderma pigmentosum and Cockayne syndrome, two monogenic diseases that lead to skin cancer when the patients are exposed to sunlight and neurodegeneration.

[1]  Jerome A. Osheroff,et al.  Bmc Medical Informatics and Decision Making Information Management to Enable Personalized Medicine: Stakeholder Roles in Building Clinical Decision Support , 2009 .

[2]  A. Sancar,et al.  Formation of a ternary complex by human XPA, ERCC1, and ERCC4(XPF) excision repair proteins. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Masahide Takahashi,et al.  Misshapen-like kinase 1 (MINK1) Is a Novel Component of Striatin-interacting Phosphatase and Kinase (STRIPAK) and Is Required for the Completion of Cytokinesis* , 2012, The Journal of Biological Chemistry.

[4]  E. Friedberg How nucleotide excision repair protects against cancer , 2001, Nature Reviews Cancer.

[5]  C. Menck,et al.  The eukaryotic nucleotide excision repair pathway. , 2003, Biochimie.

[6]  Burkhard Linke,et al.  Conveyor: a workflow engine for bioinformatics analyses , 2011, Bioinform..

[7]  J. Hoeijmakers Genome maintenance mechanisms for preventing cancer , 2001, Nature.

[8]  J. Egly,et al.  TFIIH: when transcription met DNA repair , 2012, Nature Reviews Molecular Cell Biology.

[9]  G. Baffet,et al.  The knock-down of ERCC1 but not of XPF causes multinucleation. , 2011, DNA repair.

[10]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[11]  E. Friedberg,et al.  Correction of xeroderma pigmentosum complementation group D mutant cell phenotypes by chromosome and gene transfer: involvement of the human ERCC2 DNA repair gene. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[13]  Chiara Romualdi,et al.  A-MADMAN: Annotation-based microarray data meta-analysis tool , 2009, BMC Bioinformatics.

[14]  J. Rahn,et al.  Multiple roles of ERCC1‐XPF in mammalian interstrand crosslink repair , 2010, Environmental and molecular mutagenesis.

[15]  Lei Li,et al.  Characterization of molecular defects in xeroderma pigmentosum group C , 1993, Nature Genetics.

[16]  Péter Horváth,et al.  Enhanced CellClassifier: a multi-class classification tool for microscopy images , 2010, BMC Bioinformatics.

[17]  Kiyoji Tanaka,et al.  Molecular basis of group A xeroderma pigmentosum: A missense mutation and two deletions located in a zinc finger consensus sequence of the XPAC gene , 1992, Human Genetics.

[18]  D. Dickson,et al.  Neuropathology of Cockayne syndrome: Evidence for impaired development, premature aging, and neurodegeneration , 2009, Mechanisms of Ageing and Development.

[19]  M. F. White Structure, function and evolution of the XPD family of iron-sulfur-containing 5'-->3' DNA helicases. , 2009, Biochemical Society transactions.

[20]  Y. Nakatsu,et al.  Additive roles of XPA and MSH2 genes in UVB-induced skin tumorigenesis in mice. , 2002, DNA repair.

[21]  Matthew A. Hibbs,et al.  Visualization of omics data for systems biology , 2010, Nature Methods.

[22]  Alberto M. R. Dávila,et al.  In Services: Data Management for In Silico Workflows , 2006, 17th International Workshop on Database and Expert Systems Applications (DEXA'06).

[23]  F. Hanaoka,et al.  A novel interaction between human DNA polymerase eta and MutLalpha. , 2009, Biochemical and biophysical research communications.

[24]  Leroy Hood,et al.  Systems biology, proteomics, and the future of health care: toward predictive, preventative, and personalized medicine. , 2004, Journal of proteome research.

[25]  Huafeng Xie Statistical physics of complex networks , 2008 .

[26]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[27]  M. Oda,et al.  Neurodegeneration in hereditary nucleotide repair disorders , 1999, Brain and Development.

[28]  Yi-Ping Hsueh,et al.  CTTNBP2, but not CTTNBP2NL, regulates dendritic spinogenesis and synaptic distribution of the striatin–PP2A complex , 2012, Molecular biology of the cell.

[29]  Neena Grover,et al.  Principles of biochemistry (4th ed.) , 2006 .

[30]  Arthur M. Lesk,et al.  Introduction to bioinformatics , 2002 .

[31]  Guo-Min Li,et al.  Mechanisms and functions of DNA mismatch repair , 2008, Cell Research.

[32]  H. Naegeli,et al.  The xeroderma pigmentosum pathway: decision tree analysis of DNA quality. , 2011, DNA repair.

[33]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[34]  Angelo Nuzzo,et al.  Genephony: a knowledge management tool for genome-wide research , 2009, BMC Bioinformatics.

[35]  Christian von Mering,et al.  STRING 8—a global view on proteins and their functional interactions in 630 organisms , 2008, Nucleic Acids Res..

[36]  Walter Ricciardi,et al.  The effectiveness of computerized clinical guidelines in the process of care: a systematic review , 2010, BMC health services research.

[37]  Adrian Paschke,et al.  A journey to Semantic Web query federation in the life sciences , 2009, BMC Bioinformatics.

[38]  Alan F. Scott,et al.  McKusick's Online Mendelian Inheritance in Man (OMIM®) , 2008, Nucleic Acids Res..

[39]  Franz Kummert,et al.  Database driven test case generation for protein?Cprotein docking , 2005, Bioinform..

[40]  H. Pospiech,et al.  Nucleotide excision repair of DNA with recombinant human proteins: definition of the minimal set of factors, active forms of TFIIH, and modulation by CAK. , 2000, Genes & development.

[41]  Valerie A. I. Natale A comprehensive description of the severity groups in Cockayne syndrome , 2011, American journal of medical genetics. Part A.

[42]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[43]  E. Friedberg,et al.  DNA Repair and Mutagenesis , 2006 .

[44]  M. Amalric,et al.  Down-regulation of striatin, a neuronal calmodulin-binding protein, impairs rat locomotor activity. , 1999, Journal of neurobiology.

[45]  Giovanni Scardoni,et al.  Analyzing biological network parameters with CentiScaPe , 2009, Bioinform..

[46]  D. Reinberg,et al.  Human cyclin-dependent kinase-activating kinase exists in three distinct complexes. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[47]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[48]  Laura Inés Furlong,et al.  DisGeNET: a Cytoscape plugin to visualize, integrate, search and analyze gene-disease networks , 2010, Bioinform..

[49]  T. Kanda,et al.  Peripheral neuropathy in xeroderma pigmentosum. , 1990, Brain : a journal of neurology.

[50]  Xinxia Peng,et al.  Computational identification of hepatitis C virus associated microRNA-mRNA regulatory modules in human livers , 2009, BMC Genomics.

[51]  R. Legerski,et al.  XPC interacts with both HHR23B and HHR23A in vivo. , 1997, Mutation research.