Graph Data Representation in Oracle Database 10g: Case Studies in Life Sciences

New technologies have been developed in the life sciences that allow researchers to study biological systems in rich detail. These advances have resulted in an abundance of data that describes the relations between the fundamental components of biological systems, such as genes, proteins, and metabolites. The network of relations between the components holds insights as to how biological systems function, and consequently can help researchers understand the mechanisms behind disease. Biological networks are commonly managed and analyzed in a graph representation. Oracle Database 10g has the functionality to model data as a graph, and thereby has the potential to greatly facilitate research. In this paper we describe the Oracle implementation and provide case studies from the life sciences.

[1]  Cigdem Demir,et al.  The cell graphs of cancer , 2004, ISMB/ECCB.

[2]  Martin Vingron,et al.  IntAct: an open source molecular interaction database , 2004, Nucleic Acids Res..

[3]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[4]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[5]  A. Butte,et al.  Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Ioannis Xenarios,et al.  DIP: the Database of Interacting Proteins , 2000, Nucleic Acids Res..

[7]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[8]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[9]  Zoltan Szallasi,et al.  Genetic Network Analysis in Light of Massively Parallel Biological Data Acquisition , 1998, Pacific Symposium on Biocomputing.

[10]  Michael Lappe,et al.  From gene networks to gene function. , 2003, Genome research.

[11]  Christopher W. V. Hogue,et al.  Analysis of domain correlations in yeast protein complexes , 2004, ISMB/ECCB.

[12]  Albert-László Barabási,et al.  Internet: Diameter of the World-Wide Web , 1999, Nature.

[13]  B. Bollobás The evolution of random graphs , 1984 .

[14]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[15]  T. Jenssen,et al.  A literature network of human genes for high-throughput analysis of gene expression , 2001, Nature Genetics.

[16]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[17]  M. Kanehisa A database for post-genome analysis. , 1997, Trends in genetics : TIG.

[18]  Julien Gagneur,et al.  Modular decomposition of protein-protein interaction networks , 2004, Genome Biology.

[19]  T. Golub,et al.  DNA microarrays in clinical oncology. , 2002, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[20]  Ian M. Donaldson,et al.  BIND: the Biomolecular Interaction Network Database , 2001, Nucleic Acids Res..

[21]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[22]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[23]  J. Mattick Genome research , 1990, Nature.

[24]  David B. Searls,et al.  Data integration—connecting the dots , 2003, Nature Biotechnology.

[25]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[26]  P. Erdos,et al.  On the evolution of random graphs , 1984 .

[27]  U. Alon Biological Networks: The Tinkerer as an Engineer , 2003, Science.

[28]  Nir Friedman,et al.  Inferring Cellular Networks Using Probabilistic Graphical Models , 2004, Science.

[29]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[30]  Peter D. Karp,et al.  The EcoCyc and MetaCyc databases , 2000, Nucleic Acids Res..

[31]  Benno Schwikowski,et al.  Predicting protein-peptide interactions via a network-based motif sampler , 2004, ISMB/ECCB.

[32]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.