From Molecular Activities and Processes to Biological Function

This paper describes how biological function can be represented in terms of molecular activities and processes. It presents several key features of a data model that is based on a conceptual description of the network of interactions between molecular entities within the cell and between cells. This model is implemented in the aMAZE database that presently deals with information on metabolic pathways, gene regulation, sub- or supracellular locations, and transport. It is shown that this model constitutes a useful generalisation of data representations currently implemented in metabolic pathway databases, and that it can furthermore include multiple schemes for categorising and classifying molecular entities, activities, processes and localisations. In particular, we highlight the flexibility offered by our system in representing multiple molecular activities and their control, in viewing biological function at different levels of resolution and in updating this view as our knowledge evolves.

[1]  M. Riley Systems for categorizing functions of gene products. , 1998, Current Opinion in Structural Biology.

[2]  Peter D. Karp,et al.  An ontology for biological function based on molecular interactions , 2000, Bioinform..

[3]  D. Bray,et al.  Reductionism for biochemists: how to survive the protein jungle. , 1997, TIBS -Trends in Biochemical Sciences. Regular ed.

[4]  Julio Collado-Vides,et al.  RegulonDB (version 2.0): a database on transcriptional regulation in Escherichia coli , 1999, Nucleic Acids Res..

[5]  Julio Collado-Vides,et al.  RegulonDB (version 3.0): transcriptional regulation and operon organization in Escherichia coli K-12 , 2000, Nucleic Acids Res..

[6]  A. Bairoch The ENZYME data bank. , 1993, Nucleic acids research.

[7]  Xin Chen,et al.  TRANSFAC: an integrated system for gene expression regulation , 2000, Nucleic Acids Res..

[8]  S. Wodak,et al.  Representing and Analysing Molecular and Cellular Function Using the Computer , 2000, Biological chemistry.

[9]  H. Mewes,et al.  Overview of the yeast genome. , 1997, Nature.

[10]  Ioannis Xenarios,et al.  DIP: the Database of Interacting Proteins , 2000, Nucleic Acids Res..

[11]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[12]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[13]  Amos Bairoch,et al.  The ENZYME database in 2000 , 2000, Nucleic Acids Res..

[14]  Tsuguchika Kaminuma,et al.  A Database for Cell Signaling Networks , 1998, J. Comput. Biol..

[15]  Peter D. Karp,et al.  The EcoCyc and MetaCyc databases , 2000, Nucleic Acids Res..

[16]  Gary D. Bader,et al.  BIND-a data specification for storing and describing biomolecular interactions, molecular complexes and pathways , 2000, Bioinform..

[17]  Alain Rey,et al.  Dictionnaire alphabétique et analogique de la Langue française : le petit Robert , 1983 .

[18]  M. Riley,et al.  Functions of the gene products of Escherichia coli , 1993, Microbiological reviews.

[19]  Natalia Maltsev,et al.  WIT: integrated system for high-throughput genome sequence analysis and metabolic reconstruction , 2000, Nucleic Acids Res..

[20]  Evelyn Camon,et al.  The EMBL Nucleotide Sequence Database , 2004, Nucleic acids research.

[21]  Janet M. Thornton,et al.  Comparison of functional annotation schemes for genomes , 2000, Functional & Integrative Genomics.

[22]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.